Learning Library
Curated articles on AI, agents, cloud infrastructure, and the tools shaping how we build software. Updated automatically from top sources.
2015 articles
AWS Bedrock to require sharing data with Anthropic for Mythos and future models
Comments
Rich Sutton on AI creativity and discovery
Comments
German ruling declares Google liable for false answers in AI Overviews
Comments
If Claude Fable stops helping you, you'll never know
macOS Container Machines
Comments
From data to decisions: how LSEG is scaling trusted AI
See how LSEG uses OpenAI to scale trusted AI across its global business, accelerating insights, shrinking release cycles, and empowering 4,000 employe…
Initial impressions of Claude Fable 5
llm 0.32a3
Setting a custom price for a model in AgentsView
If Claude Fable stops helping you, you'll never know
Comments
Scale Robot Reinforcement Learning with NVIDIA Isaac Lab on Amazon SageMaker AI
In this post, we show how to train robot policies for the Unitree H1 humanoid with NVIDIA Isaac Lab on Amazon SageMaker AI across two compute options:…
Grit: Rewriting Git in Rust with agents
Comments
Git real: AI agents aren’t just for solo developers anymore
In the first week of June, three vendors pushed coding agents past the single-developer loop. The three launches sit at The post Git real: AI agents a…
Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech
Anthropic launches Claude Mythos/Fable 5, but you better try it soon
On Tuesday, Anthropic launched Fable 5, its first generally available Mythos-class model. Fable 5 is essentially the highly capable Mythos The post An…
Ultrafast machine learning on FPGAs via Kolmogorov-Arnold Networks
Comments
Delivering Lifecycle Control for AI Infrastructure at Scale with NVIDIA DGX Spark Enterprise Manageability
As AI infrastructure scales, enterprise expectations for operational maturity are increasing. Organizations expect these systems to be provisionable,.…
Spring is 23 years old. AI just made it a security emergency.
AI is rewriting the rules of software security, and the Java ecosystem — the backbone of enterprise computing for more The post Spring is 23 years old…
CEOs who think AI replaces their employees are just bad CEOs
Comments
Model Quantization: Turn FP8 Checkpoints into High-Performance Inference Engines with NVIDIA TensorRT
Converting a quantized checkpoint into an NVIDIA TensorRT engine bridges the gap between model optimization and production deployment, enabling faster…
Claude Fable 5
Comments
Hands-free first notice of loss: Using Strands Agents and Amazon Bedrock AgentCore Browser Tool for intelligent claims intake
In this post, we demonstrate how a hands-free FNOL intake system combines agents built with the Strands Agents SDK for domain reasoning with Amazon Be…
Accelerating Federated Learning Research with AI Agents and NVIDIA FLARE Auto-FL
Federated learning (FL) research often begins with a deceptively simple question: What should we try next? A new aggregation rule, a FedProx coefficie…
This AI agent startup ditched Anthropic for DeepSeek — and says it’s saving millions
The biggest blocker to sustainable AI deployment has emerged as inference cost. GitHub recently abandoned its flat-rate Copilot subscription in The po…
Build an agentic incident triage assistant with Amazon Quick and New Relic
This post shows engineering teams how to apply that principle to one of the most time-sensitive workflows in engineering: incident triage. You will bu…
Introducing North Mini Code: Cohere’s First Model For Developers
Evaluate Clinical ASR Models Faster with Agent Skills and NVIDIA Nemotron Speech
Training a speech AI model to correctly recognize or synthesize clinical terminology is surprisingly difficult. Drug names like Acetaminophen, Amlodip…
When your data model is the bottleneck: lessons from Medium’s feature store
“Keep readers reading” is the not-so-simple goal of Medium’s recommendations system. To predict what’s most likely to appeal to a The post When your d…
12 GW announced. 5 GW under construction. What happens next?
Subscribe • Previous Issues The Gap Between the Press Release and the Power Grid Back in February, I wrote about what I called the “Data Center Rebell…
How long before we stop reading the code?
Don’t kill the code reviews; just move the human checkpoint upstream to reviewing intent, specs, plans, constraints, and acceptance criteria. The post…
How engineers at Nextdoor use Codex to build without limits
How engineers at Nextdoor use Codex with GPT-5.5 to investigate hard-to-reproduce issues, build across platforms, and focus on product outcomes.
Solving secret sprawl in multi-account Kubernetes with External Secrets Operator
Infrastructure provisioning in Kubernetes has become increasingly automated, but secret management often remains a challenge as environments grow. Org…
How an Agent Built a 3D Paris Gallery by Chaining Two Hugging Face Spaces
What Codex unlocks for Notion
How Notion uses Codex to one-shot specs, build AI Voice Input for the web, and multiply engineering power across small teams.
The tokenmaxxing party is over, and Revenium is mopping up
For the past 18 months, the corporate approach to artificial intelligence has been a gold rush. The mandate was simple: The post The tokenmaxxing part…
Industrial policy for the Intelligence Age
Explore our ambitious, people-first industrial policy ideas for the AI era—focused on expanding opportunity, sharing prosperity, and building resilien…
Migrating Your GitHub CI to Hugging Face Jobs
Siri AI at WWDC 2026
How AI is solving the memory crunch it created
Memory has replaced compute as a primary constraint for modern tech teams. A perfect storm of hardware architecture limitations, semiconductor The pos…
Microsoft’s pitch to enterprises: Ditch Azure Repos for GitHub, despite its rocky reliability record
GitHub hasn’t had an easy year. The platform has been hit by repeated outages affecting core services — including the The post Microsoft’s pitch to en…
Claude Code’s biggest upgrade yet ran 5 agents at once — here’s what happened
Anthropic shipped Claude Opus 4.8 on May 28, and with it came dynamic workflows in Claude Code. This fully testable The post Claude Code’s biggest upg…
Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell
Pre-training frontier LLMs comes down to throughput. When training spans trillions of tokens across thousands of accelerators, every percentage point…
Why Anthropic just doubled Claude Cowork limits at no charge
Anthropic has announced a limited-time promotion that doubles users’ five-hour usage limits in Claude Cowork. Announced on Monday, it’s sure The post…
Unlocking AI flexibility in Europe: A guide to cross-region inference for EU data processing and model access
With access to the latest generative AI models and high-performance accelerated compute in high global demand, AWS customers need tools to take advant…
It’s safe to close your laptop now: Hosting coding agents on Amazon Bedrock AgentCore
Amazon Bedrock AgentCore Runtime gives each agent session its own isolated microVM with a persistent workspace, secure tool access through Gateway, an…
Better decisions at scale: How mathematical optimization delivers where intuition fails
In this post, we introduce mathematical optimization, explain how it fits within the broader AI landscape, and showcase real-world success stories whe…
End-to-end encrypted ML inference with Amazon SageMaker AI and FHE
This blog has previously discussed FHE for ML inference in the post Enable fully homomorphic encryption with Amazon SageMaker endpoints for secure, re…
Amazon Quick ARNs: Cross-account migration and namespace permissions
In this post, we cover the structure of Amazon Quick ARNs and provide a practical mental model for working with them. By the end, you can look at an A…
Evaluate your Amazon Nova Sonic voice agent at scale, no microphone required
In this post, we walk you through the Nova Sonic Test Harness, an open source framework that we built to solve both problems. It serves as a rapid ite…
Lies we tell ourselves about email addresses
Comments
Confidential submission of draft S-1 to the SEC
OpenAI confirms a confidential S-1 submission to the SEC and has not yet determined timing for further action.
For years, Apache Cassandra handed this work to your team — 6.0 takes it back
Apache Cassandra releases tend to be evaluated on roughly the same terms. What can it do now that it couldn’t The post For years, Apache Cassandra han…
“A dangerous combination”: The 2 factors that can “corrupt” AI agent workflows
Almost everyone’s workplace experience is now set to welcome AI-agent-driven actions through the applications we use daily, and this rapid evolution T…
With Foundry, Microsoft bets the enterprise AI battle is about reliability, not capability
The agentic AI wave has produced no shortage of impressive demos. What it has produced less of is agents that The post With Foundry, Microsoft bets th…
Microsoft unlocks Visual Studio for developers left behind by its own AI
Microsoft used its Build 2026 conference last week to announce a series of updates to its flagship Visual Studio IDE The post Microsoft unlocks Visual…
Breaking free of a single datacenter: Practical geo-distributed AI operations with the k0smos platforms
Breaking the single datacenter assumption Modern AI architectures are built on the assumption of centralized, homogeneous data centers. In reality, in…
Benchmarking KubeVirt performance with virtbench
Organizations migrating VM estates from traditional hypervisors to KubeVirt often discover that many Kubernetes observability tools were originally de…
Built to benefit everyone: our plan
A vision for the future of AI, focusing on access, safety, and shared prosperity as OpenAI works to ensure AGI benefits everyone.
Introducing the OpenAI Economic Research Exchange
OpenAI launches the Economic Research Exchange to study AI’s impact on jobs, productivity, and the economy. Applications are now open for selected res…
The Open Source Community is backing OpenEnv for Agentic RL
datasette-agent-edit 0.1a0
AI teams now deploy 1,000 times a month. Your pipeline wasn’t built for that.
There’s mounting evidence that AI coding tools are delivering on their less outlandish promises. With adoption shifting from 76% in The post AI teams…
Microsoft just made the agent runtime free — and kept everything around it
Microsoft has the engineers to build its own agent runtime. At Build 2026 last week, it chose not to, shipping The post Microsoft just made the agent…
“Whoever builds the most joyous product wins”: The agent war begins
At Snowflake Summit 26 this week in San Francisco, the conversation moved into a new direction. If last year’s focus The post “Whoever builds the most…
Netlify CTO Dana Lawson: Writing code is no longer the job
“I’ve been doing this since the ’90s — decades of building guardrails to prevent humans from breaking production. We’re now The post Netlify CTO Dana…
From Jupyter Notebook to production: How to ship AI systems that actually work
Moving from experimentation to production in AI requires a transformation of mindset, architecture, and engineering discipline. There’s no API wrapper…
OpenClaw used Gavriel Cohen’s code and exposed the AI Agent accountability problem
I’m Matt Burns, Chief Content Officer at Insight Media Group. Each week, I round up the most important AI developments, The post OpenClaw used Gavriel…
OpenAI Help: Lockdown Mode
Replit shows how vibe coding is getting its own financial stack — and a path to profit
Making apps is easier than it’s ever been, but making money from them is another matter entirely. While almost anyone The post Replit shows how vibe c…
Cloudflare aqui-hires VoidZero: Did a piece of the open web just stabilize, or become more brittle?
Cloud network security and content delivery network company Cloudflare announced its acquisition of VoidZero this week, and VoidZero founder Evan The…
The latest AI news we announced in May 2026
Here are Google’s latest AI updates from May 2026
AI enthusiasts are in a race against time, AI skeptics are in a race against entropy
Cursor cuts prices and adds enterprise spend controls amid “tokenomics” reckoning
If there’s one big takeaway from the AI coding space this week, it’s that the era of flat-rate, all-you-can-code pricing The post Cursor cuts prices a…
Google Gemma 4 12B nearly matches 26B benchmarks — and runs on your laptop
Google has introduced Gemma 4 12B, a new model designed to bring high-performance, multi-modal intelligence to standard laptops. Small enough The post…
Snowflake thinks it knows what’s really slowing developers down
Ready or not, the agentic enterprise is here, and the key to enabling it efficiently is being debated from various The post Snowflake thinks it knows…
Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI
Identity and Access Management Whitepaper
As cloud native architectures become more distributed, dynamic, and automated, identity increasingly becomes the new security perimeter. Traditional a…
NVIDIA Nemotron 3 Ultra now available on Amazon SageMaker JumpStart
Deploy NVIDIA Nemotron 3 Ultra on Amazon SageMaker JumpStart. Get 5x faster inference and 30% lower cost for agentic AI workloads with this frontier r…
Quoting Emanuel Maiberg, 404 Media
NVIDIA Nemotron 3 Ultra Powers Faster, More Efficient Reasoning for Long-Running Agents
Single-turn chatbots are evolving into long-running agents that can reason, maintain context, use tools, and run efficiently across many turns to comp…
How Endava is redesigning software delivery around AI agents
Learn how Endava is using AI agents, ChatGPT Enterprise, and Codex to accelerate software delivery, automate workflows, and build an AI-native culture…
Securing CI/CD for an open source project: Controlling who runs what
Part one The last twelve months have been rough on the open source supply chain. Axios was compromised on npm and shipped a remote access trojan insid…
Dreaming: Better memory for a more helpful ChatGPT
ChatGPT introduces a new memory system to better remember preferences, keeping context fresh and relevant across conversations.
Biodefense in the Intelligence Age
An action plan for AI-powered biological resilience
Designing the hf CLI as an agent-optimized way to work with the Hub
Inspektor Gadget: Results from the first security audit
Inspektor Gadget, the open source eBPF-based toolkit for Kubernetes observability and Linux host inspection, has completed its first independent secur…
How to build self-driving AI operations on Amazon Bedrock at scale
In this post, we introduce Amazon Bedrock Ops Alert, a three-layer automated monitoring solution that proactively detects operational issues, dynamica…
Fundamental’s Large Tabular Model NEXUS is now available on Amazon SageMaker JumpStart
In this post, we show you how to get started with NEXUS on Amazon SageMaker JumpStart, walk through the deployment process, and demonstrate how to run…
Reducing container cold start times using SOCI index on DLAMI and DLC
In this post, we look at how to use SOCI on publicly available Deep Learning AMIs and Containers, when to use the various SOCI modes provided by the t…
Improve your agent’s tool-calling accuracy with SFT and DPO on Amazon SageMaker AI
In this post, you learn how to use Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO) together to improve the tool-calling accuracy…
Using Muon Optimizer with DeepSpeed
TL;DR DeepSpeed now supports Muon Optimizer! Muon Optimizer has gained great momentum with significant adoption from frontier AI Labs. One of those AI…
Introducing new capabilities to GPT-Rosalind
GPT-Rosalind advances life sciences research with enhanced biological reasoning, medicinal chemistry expertise, genomics analysis, and experimental wo…
5 ways Google Search can level up your thrift and vintage shopping
Uncover second-hand scores with AI tools in Google Search and Shopping.
Your Enterprise Data Deserves Better Than a Chatbot
Large language models and their multimodal variants remain the foundation models most people encounter first. That makes sense. Text, images, audio, a…
Direct Preference Optimization Beyond Chatbots
Uber Caps Usage of AI Tools Like Claude Code to Manage Costs
How Wasmer used Codex to build a Node.js runtime for the edge
See how Wasmer used Codex with GPT-5.5 to build a Node.js runtime for the edge, accelerating development 10x to 20x and shipping in weeks instead of m…
A blueprint for democratic governance of frontier AI
OpenAI outlines a blueprint for U.S. governance of frontier AI, proposing a federal framework for safety, resilience, and national security.
OpenAI public policy agenda
OpenAI outlines its public policy agenda for AI, including safety, youth protection, workforce transition, and global standards to ensure AI benefits…
Adding MCP Tools to Reachy Mini
Microsoft's new MAI models
datasette-agent-micropython 0.1a0
Build Personal AI Agents on Windows PCs with New Tools from Microsoft and NVIDIA
AI agents are changing how you interact with your PC. Creators, developers, and AI enthusiasts are already using these agents extensively to assist wi…
The art and science of hyperparameter optimization on Amazon Nova Forge
Fine-tuning for domain-specific tasks means improving performance in one area without degrading the model’s general capabilities, and getting that bal…
Object detection with Amazon Nova 2 Lite
In this post, we'll walk through implementing object detection with Amazon Nova 2 Lite. You'll learn how to deploy an object detection application usi…
Deploy Self-Evolving Agents for Faster, More Secure Research with a Hermes Agent and NVIDIA NemoClaw
AI agents are a powerful tool for synthesizing data to accelerate research, summarize information, and help teams make decisions faster. But combining…
How Baz improved its AI Agent Code Review accuracy using Amazon Bedrock AgentCore
This post walks through how Baz built their Spec Review agent using Amazon Bedrock and Amazon Bedrock AgentCore. We'll cover the architecture decision…
Holo3.1: Fast & Local Computer Use Agents
The smartest AI teams are moving past chatbots
Subscribe • Previous Issues Your Enterprise Data Deserves Better Than a Chatbot Large language models and their multimodal variants remain the foundat…
Travelers deploys AI-powered claims countrywide with OpenAI
Travelers built an AI-powered Claim Assistant with OpenAI to guide customers through filing claims, provide 24/7 support, and scale operations during…
Mumbai Maha Mahotsav – KubeCon + CloudNativeCon India edition
Welcome to Mumbai – the City of Dreams, where ambition is the only dress code – and the host city for KubeCon + CloudNativeCon India 2026. As a co-cha…
Cloud native is now AI-native: Engineering production-ready AI
At KubeCon + CloudNativeCon Europe in Amsterdam from March 23-26, CNCF brought together a roundtable with experts in the cloud native ecosystem, inclu…
Codex for every role, tool, and workflow
Discover new Codex plugins, sites, and annotations that help analysts, marketers, designers, investors, and other teams get more done with AI.
Advancing youth safety and opportunity through global leadership
OpenAI calls for global action on youth AI safety, proposing an international institute to strengthen safeguards, standards, and opportunities for you…
Building a secure auth code flow setup using AgentCore Gateway with MCP clients
This post demonstrates how to implement Open Authorization (OAuth) Code flow as an inbound authorization mechanism for MCP servers hosted on Amazon Be…
Codex is becoming a productivity tool for everyone
The Next Era of Knowledge Work report explores how Codex is transforming productivity through AI-powered research, data analysis, workflow automation,…
Deploy Agentic-Ready AI at the Edge with Memory Efficiency in NVIDIA JetPack 7.2
As AI agents move from the digital world to the physical environment, they can readily use NVIDIA Jetson to accelerate real-world deployment with opti…
Reference your own AWS Secrets Manager secrets in Amazon Bedrock AgentCore Identity
Today, we’re excited to announce the ability to reference a secret in AWS Secrets Manager for AgentCore Identity, so you can reference your own precon…
Run Local AI Agents with Faster Models and Multi-Node Clustering on NVIDIA DGX Spark
The rise of autonomous, long-running AI agents has introduced a new class of compute demand, namely tasks that maintain large context windows, spawn c…
Transforming rare cancer research with Amazon Quick: Integrating biomedical databases for breakthrough discoveries
In this post, we walk through how to use Amazon Quick Research to integrate biomedical data sources for rare cancer research. The walkthrough uses ped…
Hackers Simply Asked Meta AI to Give Them Access to High-Profile Instagram Accounts. It Worked
Our views on AI policy and political advocacy
Our approach to AI policy and political advocacy, transparency, support for thoughtful regulation and AI safety, and that no outside political group s…
How we used Gemini to build Google I/O 2026
Learn how Googlers used AI to produce Google I/O 2026.
Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains
How LinkedIn Uses PyTorch to Solve Extreme-Scale Optimization Problems
TL;DR: This case study demonstrates how LinkedIn re-architected its distributed linear programming solver, DuaLip, by developing a GPU-accelerated PyT…
Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic
Building the infrastructure for the Intelligence Age in Michigan
OpenAI breaks ground on a 1GW data center project in Michigan as part of Stargate, building AI infrastructure to expand access, create jobs, and suppo…
Dynamic configuration for cloud native Swift services
Modern Swift services increasingly run alongside the same cloud native infrastructure stacks that power much of today’s Kubernetes ecosystem — includi…
OpenAI frontier models and Codex are now available on AWS
OpenAI frontier models and Codex are now generally available on AWS, giving enterprises a new path to build with OpenAI through the AWS environments,…
How to Post-Train Autonomous Vehicle Models in Closed-Loop with NVIDIA Alpamayo
Developing autonomous vehicle (AV) policies requires bridging an important gap between training and deployment. Vision-language-action (VLA) models th…
Develop Physical AI Reasoning, World, and Action Models with NVIDIA Cosmos 3
Physical AI systems must understand the real world before they can act within it. Robots, autonomous vehicles, and smart spaces need to understand wha…
Advancing AI Infrastructure for Agentic AI with NVIDIA DOCA In-Silicon Security
The AI era is driving a new class of infrastructure: AI factories that transform data into intelligence for autonomous AI agents operating at unpreced…
NVIDIA Vera CPU Sets a New Standard for Agentic Workloads in AI Factories
Each wave of AI has created a new scaling law. Pretraining scaled intelligence through larger datasets, more parameters, and massively parallel GPU sy…
NVIDIA DSX OS Delivers Open, Modular Software for Operating AI Factories at Scale
AI is now essential infrastructure, powered by AI factories that generate intelligence in the form of tokens. As demand grows, these factories must sc…
The solution might be cancelling my AI subscription
How we contain Claude across products
DynoSim: Simulating the Pareto Frontier
Modern LLM serving is hard to tune because each deployment is a stack of interacting choices: model backend, tensor-parallel shape, prefill/decode spl…
Take our I/O 2026 quiz, vibe coded in Google AI Studio.
We used Google AI Studio to vibe code a quiz about our top I/O 2026 announcements.
9 demos of Gemini Omni and Gemini 3.5 in action
Watch 9 videos showing the capabilities of Gemini Omni and Gemini 3.5, announced at Google I/O 2026.
How to Automate AI Model Documentation with the NVIDIA MCG Toolkit
As AI models grow in complexity and regulatory scrutiny intensifies under frameworks including California’s AB-2013 and the EU AI Act, software teams…
Check out real-life AI prototypes from the Futures Lab.
University of Waterloo students develop AI prototypes like sign language tutors to reshape the future of education and work.
Boston Children’s uses AI to unlock new diagnoses
Boston Children’s Hospital uses OpenAI technology to improve patient care, reduce operational burden, and help diagnose more than 40 rare disease case…
How Braintrust turns customer requests into code with Codex
How Braintrust engineers use Codex with GPT-5.5 to run experiments and code faster.
Building a cloud native internal developer platform with Kubernetes, GitOps, and supply chain security
Modern software delivery is no longer constrained by application code — it is constrained by the platform that runs it. This article presents the desi…
Strengthening societal resilience with Rosalind Biodefense
OpenAI launches Rosalind Biodefense, expanding trusted access to GPT-Rosalind for vetted developers and U.S. government partners advancing biodefense,…
Run Step 3.7 Flash on NVIDIA GPUs with Enterprise-Ready Multimodal AI
AI applications are moving beyond text generation to multimodal systems that can perceive, search, and reason across images, documents, video, and...…
A shared playbook for trustworthy third party evaluations
OpenAI shares guidance on third-party AI evaluations, covering how to assess model capabilities, safeguards, and validity for frontier systems.
Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler
Catch up on 12 major I/O 2026 moments
Here are 12 of the biggest Google I/O 2026 keynote moments, including news about Gemini Omni, Gemini 3.5 Flash and more.
How Endava builds an agentic organization with Codex
Learn how Endava uses Codex to build an agentic organization, accelerating software delivery and reducing requirements analysis from weeks to hours.
OpenAI’s Frontier Governance Framework
Explore OpenAI’s Frontier Governance Framework and how our AI safety, security, and risk practices align with emerging EU and California regulations.
MUFG aims to become AI-native with OpenAI
MUFG uses ChatGPT Enterprise to build an AI-native organization, improve workflows, and deliver new AI-powered financial services at scale.
NVIDIA Dynamo Snapshot: Fast Startup for Inference Workloads on Kubernetes
The cold-start problem In production inference deployments, demand fluctuates over time, requiring inference replicas to scale elastically. However,..…
NVIDIA Blackwell Sets STAC-AI Record for LLM Inference in Finance
Large language models (LLMs) are revolutionizing the financial trading landscape by enabling sophisticated analysis of vast amounts of unstructured da…
Why Is PyTorch Compile So Fast: Kernel Fusion
When you use PyTorch’s compiler, your model runs faster, up to 10x faster. But what’s actually happening? Without compilation, the GPU runs a kernel,…
What’s New for Game Developers in NVIDIA RTX: DLSS 4.5 for UE5 and Multilingual AI Characters
NVIDIA RTX provides game developers with direct paths to AI-driven characters, frame generation, and ray-traced rendering. This post walks through a m…
Up to 580tps! New Speed Record of Qwen3.5-397B-A17B on GPU for Agentic Workloads with TokenSpeed
TL;DR: The TokenSpeed inference engine achieved a record-breaking 580 tps running the Qwen3.5-397B-A17B model on GPUs. This extreme performance for ag…
Beyond the Demo: What Real AI Agents Actually Do at Work
I am always on the lookout for new AI agents and applications that operate outside the coding world. By agent, I mean a system that can take a goal, u…
Cisco and OpenAI redefine enterprise engineering with Codex
Cisco and OpenAI are redefining enterprise engineering with Codex, helping Cisco scale AI-native development, accelerate AI Defense work, and automate…
Building self-improving tax agents with Codex
See how OpenAI, Thrive, and Crete built a self-improving tax agent with Codex, automating filings, improving accuracy, and accelerating workflows.
Alibaba Cloud Joins the PyTorch Foundation as a Platinum Member
The PyTorch Foundation, a community-driven hub for open source AI under the Linux Foundation, is announcing today that Alibaba Cloud has joined as a…
Election information and safeguards in 2026
Ahead of global elections, we’re helping people access information, supporting cyber defenders, and increasing AI transparency
Warp’s big bet on building open source with GPT-5.5
Warp uses GPT-5.5 and OpenAI models to coordinate coding agents across local, cloud, and open-source development workflows.
Reachy Mini goes fully local
Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL
Extract More Kernel Performance with NVIDIA CompileIQ Auto-Tuning
NVIDIA CompileIQ tackles one of the hardest problems in performance engineering: finding the compiler options that unlock the best performance for a s…
Develop High-Performance GPU Kernels in C++ with NVIDIA CUDA Tile
Developers can now use NVIDIA CUDA Tile programming within large existing C++ GPU codebases to develop highly optimized GPU kernels using tile-based.…
NVIDIA CUDA 13.3 Enhances GPU Development with Tile Programming in C++, Compiler Autotuning, and Python Updates
NVIDIA CUDA 13.3 brings new capabilities and performance optimizations to developers across the CUDA ecosystem. The launch of NVIDIA CUDA Tile program…
Run Key Genomics and Protein Folding Workloads Faster with NVIDIA RTX PRO 4500 Blackwell
Precision medicine depends on two fundamental capabilities: understanding disease at the genomic level and identifying treatments at the molecular lev…
TLX Block Attention: A Warp-Specialized Blackwell Kernel for Fixed-Block Sparse Self-Attention
Code available at: https://github.com/facebookresearch/ads_model_kernel_library In this post, we present the design of TLX Block Attention — a Triton…
What Upwork, DoorDash, Meta, EY, and Fundrise reveal about agents
Subscribe • Previous Issues Beyond the Demo: What Real AI Agents Actually Do at Work I am always on the lookout for new AI agents and applications tha…
The Vatican’s AI Principles: What You Need to Know
The Vatican’s recent encyclical, Magnifica Humanitas, introduces a moral framework that challenges how technology leaders evaluate artificial intellig…
OpenAI, Grupo Folha and Grupo UOL announce strategic content partnership
OpenAI partners with Grupo Folha and Grupo UOL to bring trusted Brazilian journalism to ChatGPT, expanding access to news with attribution and transpa…
Harness, Scaffold, and the AI Agent Terms Worth Getting Right
Join the PyTorch Foundation Ambassador Program: A Global Network of Community Leaders
A little over a year ago, the PyTorch Foundation launched the Ambassador Program, an initiative that recognizes and supports independent, trusted voic…
Catch up on the Dialogues stage at Google I/O 2026.
A recap of the 2026 I/O Dialogues, where leaders discuss the future of AI, quantum computing, robotics and creativity.
Synthesize Realistic 3D Medical Images at Scale to Ship Pre‑Trained Models
High‑quality 3D medical imaging data is the foundation of modern radiology AI, but access to it is often constrained by data scarcity, privacy restric…
An AI Math Breakthrough and the New Division of Labor
In a piece I wrote a few months ago, I argued that research mathematics had become an unexpectedly useful test case for AI, precisely because mathemat…
How Virgin Atlantic ships faster with Codex
How Virgin Atlantic used Codex to ship its revamped mobile app on a fixed holiday travel deadline, reaching near-total unit test coverage and zero P1…
OpenAI named a Leader in enterprise coding agents by Gartner
OpenAI is named a leader in the 2026 Gartner Magic Quadrant for Enterprise AI Coding Agents, with Codex recognized for innovation and enterprise-scale…
Automating and Optimizing Financial Signal Discovery with Multi-Agent Systems
In quantitative finance, researchers build algorithms to trade assets, derivatives, and other financial instruments. A key part of that work is findin…
Get Real-Time Visibility into GPU Usage Across Kubernetes Clusters
Maximizing the value of AI infrastructure demands deep visibility into GPU utilization. Yet many platform teams running AI workloads on Kubernetes ope…
Unlock Exascale Performance on NVIDIA GB200 NVL72 with Slurm Topology-Aware Job Scheduling
As AI models grow in scale and complexity, realizing the full performance of modern accelerated infrastructure depends as much on how workloads are pl…
Building Token‑Metered AI Services on Telco AI Factories
Telcos around the world are building sovereign AI factories based on the NVIDIA Cloud Partner (NCP) reference architecture, giving governments, enterp…
AdventHealth advances whole-person care with OpenAI
AdventHealth is using ChatGPT for Healthcare to streamline workflows, reduce administrative burden, and return more time to patient care.
We’re announcing new community investments in Missouri.
We’re helping build the state’s next-generation workforce and investing in energy programs.
Mastering Agentic Techniques: AI Agent Customization
Autonomous AI agents are taking on all types of work for businesses: routing logistics fleets, triaging support tickets, generating code, and orchestr…
100 things we announced at I/O 2026
We've been busy! Here’s a rundown of the top announcements, launches and demos at I/O 2026.
A new experiment brings better group meetings to Google Beam
See and hear your colleagues in true-to-life size and sound, making hybrid meetings feel more inclusive and connected.
Add a Specialized Deep Research Skill to Agent Harnesses
Agent harnesses like Claude Code, Codex, and LangChain Deep Agents are excellent orchestrators. They manage sessions, chain tools, execute code, and r…
PyTorch Docathon 2026 Results in 150+ Merged Pull Requests
Thank you to everyone who participated in the PyTorch Docathon 2026! Once again, the community showed up with incredible energy and dedication to make…
Integration Is the New Moat: Moving Beyond the LLM
The AI Agent Conference in New York was one of the better events I’ve attended to get a read on what’s actually happening with enterprise AI. The form…
The next phase of OpenAI’s Education for Countries
OpenAI advances Education for Countries, expanding AI adoption in schools with new partnerships, teacher training, and tools to improve global learnin…
How Ramp engineers accelerate code review with Codex
How Ramp engineers use Codex with GPT-5.5 to review code and ship improvements, allowing them to get substantive feedback in minutes instead of hours.
An OpenAI model has disproved a central conjecture in discrete geometry
An OpenAI model solved the 80-year-old unit distance problem, disproving a major conjecture in discrete geometry and marking a milestone in AI-driven…
NVIDIA-Verified Agent Skills Provide Capability Governance for AI Agents
Autonomous AI agents are becoming more capable. Open models, Model Context Protocol (MCP)-connected tools, and portable skills are also making agents…
Google I/O 2026: The Agent Layer Takes Shape
The announcements at Google I/O 2026 landed today. I’ve gone through everything and pulled out what I think actually matters for people building produ…
Introducing OpenAI for Singapore
OpenAI for Singapore launches a multi-year AI partnership to expand deployment, build local talent, and support businesses and public services with AI…
Mastering Agentic Techniques: AI Agent Evaluation
Evaluating an AI model and evaluating an AI agent are related—but they answer fundamentally different questions. A model benchmark tests the capabilit…
OlmoEarth v1.1: A more efficient family of Earth observation models
How AI Mode is changing the way people search in the U.S.
One year after launch, see how AI Mode’s users are shifting from keywords to natural language queries.
New ways to create and get things done in Google Workspace
Announcing new voice capabilities in Gmail, Docs and Keep, a new design tool called Google Pics and updates to AI Inbox.
I/O 2026: Welcome to the agentic Gemini era
The latest from Google I/O: See how we’re helping you get more done with Gemini.
Gemini 3.5: frontier intelligence with action
At Google I/O we released Gemini 3.5, our latest series of models combining frontier intelligence with action.
A new era for AI Search
We shared the next step in our journey to bring together the best of a search engine with the best of AI.
Everything new in our Google AI subscriptions, fresh from I/O 2026
Introducing a $100 AI Ultra plan — plus, new features and benefits for Google AI Plus, Pro and Ultra subscribers.
I/O 2026
At Google I/O 2026, we shared how we’re making AI more helpful for everyone. See everything we announced.
Stop upgrading your LLM. Start fixing your data.
Subscribe • Previous Issues Integration Is the New Moat: Moving Beyond the LLM The AI Agent Conference in New York was one of the better events I’ve a…
Advancing content provenance for a safer, more transparent AI ecosystem
OpenAI advances AI content provenance with Content Credentials, SynthID, and a verification tool to help people identify and trust AI-generated media.
Introducing the Ettin Reranker Family
vLLM and PyTorch Work Together to Improve the Developer Experience on aarch64
TLDR: PyTorch 2.11 makes it possible to install CUDA-enabled PyTorch wheels on aarch64 Linux directly from PyPI, eliminating the need for custom packa…
Running PyTorch Models on Apple Silicon GPUs with the ExecuTorch MLX Delegate
TL;DR: Introducing the ExecuTorch MLX Delegate The new MLX delegate enables optimized, GPU-accelerated inference for PyTorch models on Apple Silicon M…
PaddleOCR 3.5: Running OCR and Document Parsing Tasks with a Transformers Backend
OpenAI and Dell partner to bring Codex to hybrid and on-premise enterprise environments
OpenAI and Dell partner to bring Codex to hybrid and on-premise environments, helping enterprises deploy AI coding agents securely across data and wor…
OpenAI and Malta partner to bring ChatGPT Plus to all citizens
OpenAI and Malta partner to expand AI access, offering ChatGPT Plus and training to help citizens build practical AI skills and use AI responsibly.
Databricks brings GPT-5.5 to enterprise agent workflows
Databricks uses GPT-5.5 for enterprise agent workflows after the model set a new state of the art on the OfficeQA Pro benchmark.
How data science teams use Codex
See how data science teams can use Codex to build root-cause briefs, impact readouts, KPI memos, scoped analyses, and dashboard specs from real work i…
How business operations teams use Codex
See how business operations teams can use Codex to create initiative briefs, strategy updates, leadership decision packets, progress updates, and more…
How sales teams use Codex
See how sales teams can use Codex to create pipeline briefs, meeting prep packets, forecast reviews, account plans, and stalled-deal diagnoses from re…
A new personal finance experience in ChatGPT
Preview a new personal finance experience in ChatGPT for Pro users in the U.S. Securely connect your financial accounts and get AI-powered insights an…
Sea's View on the Future of Agentic Software Development with Codex
Sea Limited's CPO explains why the company is deploying Codex across engineering teams to accelerate AI-native software development in Asia.
How the NVIDIA Vera Rubin Platform is Solving Agentic AI’s Scale-Up Problem
Agentic inference has fundamentally changed the runtime dynamics of inference workloads by introducing non-deterministic trajectories—actions, observa…
Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality
Work with Codex from anywhere
Use Codex anywhere with the ChatGPT mobile app. Monitor, steer, and approve coding tasks in real time across devices and remote environments.
Helping ChatGPT better recognize context in sensitive conversations
Learn how new ChatGPT safety updates improve context awareness in sensitive conversations, helping detect risk over time and respond more safely.
Unlocking asynchronicity in continuous batching
Transform Video Into Instantly Searchable, Actionable Intelligence with AI Agents and Skills
In today’s data-driven world, organizations increasingly rely on video to capture critical information, yet extracting meaningful, real-time insights…
Accelerated X-Ray Analysis for Nanoscale Imaging (XANI) of Novel Materials
A massive-scale X-ray free-electron laser (XFEL) enables tracking structural and electron dynamics in novel systems, including fusion materials, semic…
Building a safe, effective sandbox to enable Codex on Windows
Learn how OpenAI built a safe, effective sandbox to enable Codex on Windows with controlled file access and network limits.
Our response to the TanStack npm supply chain attack
OpenAI details its response to the TanStack “Mini Shai-Hulud” supply chain attack, outlines protections taken to secure systems and signing certificat…
How to Eliminate Pipeline Friction in AI Model Serving
The path from a trained AI model to production should be smooth, but rarely is. Many teams invest weeks fine-tuning models, only to discover that expo…
How finance teams use Codex
See how finance teams can use Codex to build MBRs, reporting packs, variance bridges, model checks, and planning scenarios from real work inputs.
How NVIDIA engineers and researchers build with Codex
Teams use Codex with GPT-5.5 to ship production systems and turn research ideas into runnable experiments.
AutoScout24 scales engineering with AI-powered workflows
Learn how AutoScout24 Group uses Codex and ChatGPT to speed development cycles, improve code quality, and expand AI adoption.
What Parameter Golf taught us about AI-assisted research
Parameter Golf brought together 1,000+ participants and 2,000+ submissions to explore AI-assisted machine learning research, coding agents, quantizati…
Building Blocks for Foundation Model Training and Inference on AWS
Introducing NVIDIA Fleet Intelligence for Real-Time GPU Fleet Visibility and Optimization
The compute capability of large GPU fleets presents unprecedented opportunities to innovate and provide value to customers in record time. Yet these..…
How ChatGPT adoption broadened in early 2026
ChatGPT adoption surged in Q1 2026, with fastest growth among users over 35 and more balanced gender usage, signaling broader mainstream AI adoption.
OpenAI Campus Network: Student club interest form
Join the OpenAI Campus Network—connect student clubs worldwide, access AI tools, host events, and build an AI-powered campus community.
How enterprises are scaling AI
How enterprises scale AI: from early experiments to compounding impact through trust, governance, workflow design, and quality at scale.
The new AI-powered Google Finance is expanding to Europe.
This week, the new, AI-powered Google Finance is launching across Europe, with full local language support. This reimagined experience offers a suite…
OpenAI launches DeployCo to help businesses build around intelligence
OpenAI launches DeployCo, a new enterprise deployment company built to help organizations bring frontier AI into production and turn it into measurabl…
Improving Bash Generation in Small Language Models with Grammar-Constrained Decoding
Bash is one of the most flexible and powerful interfaces exposed to AI agents. In the right system, a model that emits grep, curl, tar, or a shell pip…
Streaming Tokens and Tools: Multi-Turn Agentic Harness Support in NVIDIA Dynamo
An agentic exchange must preserve a structured interaction: assistant turns interleave reasoning with one or more tool calls, and subsequent user turn…
See what happens when creative legends use AI to make ads for small businesses.
Today we're launching The Small Brief, an initiative bringing together three ad industry icons to champion a local businesses they love. Their mission…
Running Codex safely at OpenAI
How OpenAI runs Codex securely with sandboxing, approvals, network policies, and agent-native telemetry to support safe and compliant coding agent ado…
Achieving Peak System and Workload Efficiency on NVIDIA GB200 NVL72 with Slurm Block Scheduling
NVIDIA GB200 NVL72 introduces a fundamentally new way to build GPU clusters by extending NVIDIA NVLink coherence across an entire rack. This design en…
Model Quantization: Post-Training Quantization Using NVIDIA Model Optimizer
Model quantization is an effective method to reduce VRAM usage and improve inference performance on consumer devices such as NVIDIA GeForce RTX GPUs.…
Real-Time Performance Monitoring and Faster Debugging with NCCL Inspector and Prometheus
Distributed deep learning depends on fast, reliable GPU-to-GPU communication using the NVIDIA Collective Communication Library (NCCL). When training s…
Scaling Trusted Access for Cyber with GPT-5.5 and GPT-5.5-Cyber
OpenAI expands Trusted Access for Cyber with GPT-5.5 and GPT-5.5-Cyber, helping verified defenders accelerate vulnerability research and protect criti…
Parloa builds service agents customers want to talk to
Parloa leverages OpenAI models to power scalable, voice-driven AI customer service agents, enabling enterprises to design, simulate, and deploy reliab…
Advancing voice intelligence with new models in the API
Explore new realtime voice models in the OpenAI API that can reason, translate, and transcribe speech, enabling more natural and intelligent voice exp…
Introducing Trusted Contact in ChatGPT
Introducing Trusted Contact in ChatGPT, an optional safety feature that notifies someone you trust if serious self-harm concerns are detected.
Testing ads in ChatGPT
OpenAI begins testing ads in ChatGPT to support free access, with clear labeling, answer independence, strong privacy protections, and user control.
Simplex rethinks software development with Codex
Simplex boosts software development with ChatGPT Enterprise and Codex, reducing design, build, and testing time while scaling AI-driven workflows.
vLLM V0 to V1: Correctness Before Corrections in RL
How ChatGPT learns about the world while protecting privacy
Learn how ChatGPT safeguards your privacy, reduces personal data in training, and gives you control over whether your conversations improve AI models.
Uber uses OpenAI to help people earn smarter and book faster
Uber uses OpenAI to power AI assistants and voice features that help drivers earn smarter and riders book faster across a global real-time marketplace…
Singular Bank helps bankers move fast with ChatGPT and Codex
Singular Bank built Singularity, an internal assistant using ChatGPT and Codex to help bankers save 60–90 minutes daily on meeting prep, portfolio ana…
Introducing ChatGPT Futures: Class of 2026
Meet the ChatGPT Futures Class of 2026—26 student innovators using AI to build, research, and drive real-world impact. Discover how this generation is…
How frontier firms are pulling ahead
OpenAI’s B2B Signals research shows how frontier enterprises deepen AI adoption, scale Codex-powered agentic workflows, and build durable competitive…
Adding Benchmaxxer Repellant to the Open ASR Leaderboard
How to Build In-Vehicle AI Agents with NVIDIA: From Cloud to Car
The automotive cockpit is undergoing a fundamental shift from rule-based interfaces to agentic, multimodal AI systems capable of reasoning, planning,…
Building for the Rising Complexity of Agentic Systems with Extreme Co-Design
Generative AI’s explosive first chapter was defined by humans sending requests and models responding. The agentic chapter is different. Agents don't.…
GPT-5.5 Instant System Card
GPT-5.5 Instant: smarter, clearer, and more personalized
GPT-5.5 Instant updates ChatGPT’s default model with smarter, more accurate answers, reduced hallucinations, and improved personalization controls.
Unlocking large scale AI training networks with MRC (Multipath Reliable Connection)
OpenAI introduces MRC (Multipath Reliable Connection), a new supercomputer networking protocol released via OCP to improve resilience and performance…
New ways to buy ChatGPT ads
OpenAI expands ChatGPT ads with a beta self-serve Ads Manager, CPC bidding, and enhanced measurement tools—built to protect privacy and keep conversat…
Advancing youth safety and wellbeing in EMEA
Explore OpenAI’s European Youth Safety Blueprint and EMEA Youth & Wellbeing Grants, advancing safe, responsible AI for teens, families, and educators.
OpenAI and PwC collaborate to reimagine the office of the CFO
OpenAI and PwC are partnering to help enterprises use AI agents to automate finance workflows, improve forecasting, strengthen controls, and modernize…
Optimize Supply Chain Decision Systems Using NVIDIA cuOpt Agent Skills
Modern supply chains operate under the constant pressures of fluctuating demand, volatile costs, constrained capacity, and interdependent decision-mak…
How OpenAI delivers low-latency voice AI at scale
How OpenAI rebuilt its WebRTC stack to power real-time Voice AI with low latency, global scale, and seamless conversational turn-taking.
Speed Up Unreal Engine NNE Inference with NVIDIA TensorRT for RTX Runtime
Neural network techniques are increasingly used in computer graphics to boost image quality, improve performance, and streamline content creation. App…
Build AI-Powered Games with NVIDIA DLSS 4.5, RTX, and Unreal Engine 5
Today, game developers can begin integrating NVIDIA DLSS 4.5 with Dynamic Multi Frame Generation, Multi Frame Generation 6X, and the second-generation…
How to Build, Run, and Scale High-Quality Creator Workflows in ComfyUI
Creative and visualization teams today produce more assets, in more formats, with leaner teams. Generative AI can accelerate that work – compressing t…
Automating GPU Kernel Translation with AI Agents: cuTile Python to cuTile.jl
NVIDIA CUDA Tile (cuTile) is a tile-based programming model that enables developers to write GPU kernels in terms of tile-level operations—loads, stor…
Introducing Advanced Account Security
Introducing Advanced Account Security: phishing-resistant login, stronger recovery, and enhanced protections to safeguard sensitive data and prevent a…
Where the goblins came from
How goblin outputs spread in AI models: timeline, root cause, and fixes behind personality-driven quirks in GPT-5 behavior.
Powering AI Factories with NVIDIA Enterprise Reference Architectures
The next wave of enterprise productivity is being built on AI factories. As organizations deploy agentic AI systems capable of reasoning, automation,…
Granite 4.1 LLMs: How They’re Built
Building the compute infrastructure for the Intelligence Age
OpenAI scales Stargate to build the compute infrastructure powering AGI, adding new data center capacity to meet growing AI demand.
Cybersecurity in the Intelligence Age
OpenAI outlines a five-part action plan for strengthening cybersecurity in the Intelligence Age, focused on democratizing AI-powered cyber defense and…
DeepInfra on Hugging Face Inference Providers 🔥
Scaling Biomolecular Modeling Using Context Parallelism in NVIDIA BioNeMo
For decades, computational biology has operated under a reductionist compromise. To fit complex biological systems into the limited memory of a single…
NVIDIA Nemotron 3 Nano Omni Powers Multimodal Agent Reasoning in a Single Efficient Open Model
Agentic systems often reason across screens, documents, audio, video, and text within a single perception‑to‑action loop. However, they still rely on.…
Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents
24/7 Simulation Loops: How Agentic AI Keeps Subsurface Engineering Moving
The subsurface industry is at a critical point in its digital evolution. For decades, unlocking reservoir potential has relied on experts performing e…
OpenAI models, Codex, and Managed Agents come to AWS
OpenAI GPT models, Codex, and Managed Agents are now available on AWS, enabling enterprises to build secure AI in their AWS environments.
Our commitment to community safety
Learn how OpenAI protects community safety in ChatGPT through model safeguards, misuse detection, policy enforcement, and collaboration with safety ex…
OpenAI available at FedRAMP Moderate
OpenAI is available at FedRAMP Moderate authorization for ChatGPT Enterprise and the OpenAI API, enabling secure AI adoption for U.S. federal agencies…
The next phase of the Microsoft OpenAI partnership
OpenAI and Microsoft announce an amended agreement that simplifies the partnership, adds long-term clarity, and supports continued AI innovation at sc…
An open-source spec for orchestration: Symphony
Learn how Symphony, an open-source spec for Codex orchestration, turns issue trackers into always-on agent systems—boosting engineering output and red…
Choco automates food distribution with AI agents
How Choco used OpenAI APIs to streamline food distribution, boost productivity, and unlock growth—an in-depth customer story on real-world AI impact.
How to build scalable web apps with OpenAI's Privacy Filter
Our principles
Our mission is to ensure that AGI benefits all of humanity. Sam Altman shares five principles that guide our work.
Build with DeepSeek V4 Using NVIDIA Blackwell and GPU-Accelerated Endpoints
DeepSeek just launched its fourth generation of flagship models with DeepSeek-V4-Pro and DeepSeek-V4-Flash, both targeted at enabling highly efficient…
Federated Learning Without the Refactoring Overhead Using NVIDIA FLARE
Federated learning (FL) is no longer a research curiosity—it’s a practical response to a hard constraint: the most valuable data is often the least mo…
DeepSeek-V4: a million-token context that agents can actually use
Winning a Kaggle Competition with Generative AI–Assisted Coding
In March 2026, three LLM agents generated over 600,000 lines of code, ran 850 experiments, and helped secure a first-place finish in a Kaggle playgrou…
GPT-5.5 System Card
Introducing GPT-5.5
Introducing GPT-5.5, our smartest model yet—faster, more capable, and built for complex tasks like coding, research, and data analysis across tools.
Automations
Learn how to automate tasks in Codex using schedules and triggers to create reports, summaries, and recurring workflows without manual effort.
Codex settings
Learn how to configure Codex settings, including personalization, detail level, and permissions, to run tasks smoothly and customize your workflow.
How to get started with Codex
Learn how to get started with Codex by setting up projects, creating threads, and completing your first tasks with step-by-step guidance.
Working with Codex
Learn how to set up your Codex workspace, create threads and projects, manage files, and start completing tasks with step-by-step guidance.
Plugins and skills
Learn how to use Codex plugins and skills to connect tools, access data, and follow repeatable workflows to automate tasks and improve results.
How to use Codex for everyday work
Explore 10 practical ChatGPT Codex use cases to automate tasks, create deliverables, and turn real inputs into outputs across tools, files, and workfl…
What is Codex?
Learn how Codex helps you go beyond chat by automating tasks, connecting tools, and producing real outputs like docs and dashboards.
GPT-5.5 Bio Bug Bounty
Explore the GPT-5.5 Bio Bug Bounty: a red-teaming challenge to find universal jailbreaks for bio safety risks, with rewards up to $25,000.
How to Use Transformers.js in a Chrome Extension
Simplify Sparse Deep Learning with Universal Sparse Tensor in nvmath-python
In a previous post, we introduced the Universal Sparse Tensor (UST), enabling developers to decouple a tensor’s sparsity from its memory layout for gr…
Scaling the AI-Ready Data Center with NVIDIA RTX PRO 4500 Blackwell Server Edition and NVIDIA vGPU 20
AI integration is redefining mainstream enterprise applications, from productivity software like Microsoft Office to more complex design and engineeri…
Advancing Emerging Optimizers for Accelerated LLM Training with NVIDIA Megatron
Higher-order optimization algorithms such as Shampoo have been effectively applied in neural network training for at least a decade. These methods hav…
Making ChatGPT better for clinicians
OpenAI makes ChatGPT for Clinicians free for verified U.S. physicians, nurse practitioners, and pharmacists, supporting clinical care, documentation,…
Introducing workspace agents in ChatGPT
Workspace agents in ChatGPT are Codex-powered agents that automate complex workflows, run in the cloud, and help teams scale work across tools securel…
Speeding up agentic workflows with WebSockets in the Responses API
A deep dive into the Codex agent loop, showing how WebSockets and connection-scoped caching reduced API overhead and improved model latency.
Workspace agents
Learn how to build, use, and scale workspace agents in ChatGPT to automate repeatable workflows, connect tools, and streamline team operations.
Introducing OpenAI Privacy Filter
OpenAI Privacy Filter is an open-weight model for detecting and redacting personally identifiable information (PII) in text with state-of-the-art accu…
Introducing ChatGPT Images 2.0
ChatGPT Images 2.0 introduces a state-of-the-art image generation model with improved text rendering, multilingual support, and advanced visual reason…
QIMMA قِمّة ⛰: A Quality-First Arabic LLM Leaderboard
Scaling Codex to enterprises worldwide
OpenAI launches Codex Labs, partners with with Accenture, PwC, Infosys, and others to help enterprises deploy and scale Codex across the software deve…
AI and the Future of Cybersecurity: Why Openness Matters
Maximizing Memory Efficiency to Run Bigger Models on NVIDIA Jetson
The boom in open source generative AI models is pushing beyond data centers into machines operating in the physical world. Developers are eager to dep…
Run High-Throughput Reinforcement Learning Training with End-to-End FP8 Precision
As LLMs transition from simple text generation to complex reasoning, reinforcement learning (RL) plays a central role. Algorithms like Group Relative…
Mitigating Indirect AGENTS.md Injection Attacks in Agentic Environments
AI tools are significantly accelerating software development and changing how developers work with code. These tools serve as real-time copilots, auto…
OpenAI helps Hyatt advance AI among colleagues
Hyatt deploys ChatGPT Enterprise across its global workforce, using GPT-5.4 and Codex to improve productivity, operations, and guest experiences.
Full-Stack Optimizations for Agentic Inference with NVIDIA Dynamo
Coding agents are starting to write production code at scale. Stripe’s agents generate 1,300+ PRs per week. Ramp attributes 30% of merged PRs to agent…
Build a More Secure, Always-On Local AI Agent with OpenClaw and NVIDIA NemoClaw
Agents are evolving from question-and-answer systems into long-running autonomous assistants that read files, call APIs, and drive multi-step workflow…
Accelerate Clean, Modular, Nuclear Reactor Design with AI Physics
The development of socially acceptable nuclear reactors requires that they are safe, clean, efficient, economical, and sustainable. Meeting these requ…
How to Build Vision AI Pipelines Using NVIDIA DeepStream Coding Agents
Developing real-time vision AI applications presents a significant challenge for developers, often demanding intricate data pipelines, countless lines…
Codex for (almost) everything
The updated Codex app for macOS and Windows adds computer use, in-app browsing, image generation, memory, and plugins to accelerate developer workflow…
Introducing GPT-Rosalind for life sciences research
OpenAI introduces GPT-Rosalind, a frontier reasoning model built to accelerate drug discovery, genomics analysis, protein reasoning, and scientific re…
Accelerating the cyber defense ecosystem that protects us all
Leading security firms and enterprises join OpenAI’s Trusted Access for Cyber, using GPT-5.4-Cyber and $10M in API grants to strengthen global cyber d…
Ecom-RLVE: Adaptive Verifiable Environments for E-Commerce Conversational Agents
The PR you would have opened yourself
Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers
Inside VAKRA: Reasoning, Tool Use, and Failure Modes of Agents
The next evolution of the Agents SDK
OpenAI updates the Agents SDK with native sandbox execution and a model-native harness, helping developers build secure, long-running agents across fi…
Meet HoloTab by HCompany. Your AI browser companion.
Building Custom Atomistic Simulation Workflows for Chemistry and Materials Science with NVIDIA ALCHEMI Toolkit
For decades, computational chemistry has faced a tug-of-war between accuracy and speed. Ab initio methods like density functional theory (DFT) provide…
NVIDIA NVbandwidth: Your Essential Tool for Measuring GPU Interconnect and Memory Performance
When you’re writing CUDA applications, one of the most important things you need to focus on to write great code is data transfer performance. This ap…
NVIDIA Ising Introduces AI-Powered Workflows to Build Fault-Tolerant Quantum Systems
NVIDIA Ising is the world's first family of open AI models for building quantum processors, launching with two model domains: Ising Calibration and Is…
Trusted access for the next era of cyber defense
OpenAI expands its Trusted Access for Cyber program, introducing GPT-5.4-Cyber to vetted defenders and strengthening safeguards as AI cybersecurity ca…
Enterprises power agentic workflows in Cloudflare Agent Cloud with OpenAI
Cloudflare brings OpenAI’s GPT-5.4 and Codex to Agent Cloud, enabling enterprises to build, deploy, and scale AI agents for real-world tasks with spee…
MiniMax M2.7 Advances Scalable Agentic Workflows on NVIDIA Platforms for Complex AI Applications
The release of MiniMax M2.7 adds enhancements to the popular MiniMax M2.5 model, built for agentic harnesses,... The release of MiniMax M2.7 adds enha…
ChatGPT for research
Learn how to use ChatGPT for research to gather sources, analyze information, and create structured, citation-backed insights.
ChatGPT for operations teams
Learn how operations teams use ChatGPT to streamline workflows, improve coordination, standardize processes, and drive faster execution.
ChatGPT for customer success teams
Learn how customer success teams use ChatGPT to manage accounts, improve communication, reduce churn, and drive adoption and renewals.
Using projects in ChatGPT
Learn how to use projects in ChatGPT to organize chats, files, and instructions, manage ongoing work, and collaborate more effectively.
Working with files in ChatGPT
Learn how to upload and work with files in ChatGPT to analyze data, summarize documents, and generate content from PDFs, spreadsheets, and more.
Using custom GPTs
Learn how to build and use custom GPTs to automate workflows, maintain consistent outputs, and create purpose-built AI assistants.
Getting started with ChatGPT
Learn how to use ChatGPT, start your first conversation, and discover simple ways to write, brainstorm, and solve problems with AI.
Creating images with ChatGPT
Learn how to create and refine images with ChatGPT using clear prompts, iterate on designs, and generate high-quality visuals in minutes.
AI fundamentals
Learn what AI is, how it works, and how tools like ChatGPT use large language models. A clear, beginner-friendly guide to understanding artificial int…
Prompting fundamentals
Learn prompting fundamentals and how to write clear, effective prompts to get better, more useful responses from ChatGPT.
Applications of AI at OpenAI
Explore how OpenAI products like ChatGPT, Codex, and APIs bring AI into real-world use for work, development, and everyday tasks.
Analyzing data with ChatGPT
Learn how to analyze data with ChatGPT by exploring datasets, generating insights, creating visualizations, and turning findings into actionable decis…
ChatGPT for marketing teams
Learn how marketing teams use ChatGPT to plan campaigns, generate content, analyze performance, and move from ideas to execution faster.
Brainstorming with ChatGPT
Learn how to use ChatGPT to brainstorm ideas, organize thinking, and turn rough concepts into structured, actionable plans.
Writing with ChatGPT
Learn how to use ChatGPT for writing to draft, revise, and refine content with clear structure, tone, and intent.
Responsible and safe use of AI
Learn how to use AI responsibly with best practices for safety, accuracy, and transparency when using tools like ChatGPT.
Using skills
Learn how to create and use ChatGPT skills to build reusable workflows, automate recurring tasks, and ensure consistent, high-quality outputs.
Research with ChatGPT
Learn how to research with ChatGPT using search and deep research to find up-to-date information, analyze sources, and generate structured insights.
ChatGPT for managers
Learn how managers use ChatGPT to prepare for conversations, write clear feedback, stay organized, and improve team effectiveness.
ChatGPT for finance teams
Learn how finance teams use ChatGPT to streamline reporting, analyze data, improve forecasts, and communicate insights more clearly.
Financial services
Explore AI resources for financial services, including prompt packs, GPTs, guides, and tools to help institutions deploy and scale AI securely.
Our response to the Axios developer tool compromise
OpenAI responds to the Axios supply chain attack by rotating macOS code signing certificates, updating apps, and confirming no user data was compromis…
Healthcare
Explore how clinicians use ChatGPT to support diagnosis, documentation, and patient care with secure, HIPAA-compliant AI tools.
ChatGPT for sales teams
Learn how sales teams use ChatGPT to research accounts, personalize outreach, manage deals, and improve pipeline and conversion.
Personalizing ChatGPT
Learn how to personalize ChatGPT using custom instructions and memory to get more relevant, consistent, and tailored responses.
Running Large-Scale GPU Workloads on Kubernetes with Slurm
Slurm is an open source cluster management and job scheduling system for Linux. It manages job scheduling for over 65% of TOP500 systems. Most organiz…
Cut Checkpoint Costs with About 30 Lines of Python and NVIDIA nvCOMP
Training LLMs requires periodic checkpoints. These full snapshots of model weights, optimizer states, and gradients are saved to storage so training c…
How to Accelerate Protein Structure Prediction at Proteome-Scale
Proteins rarely function in isolation as individual monomers. Most biological processes are governed by proteins interacting with other proteins, form…
CyberAgent moves faster with ChatGPT Enterprise and Codex
CyberAgent uses ChatGPT Enterprise and Codex to securely scale AI adoption, improve quality, and accelerate decisions across advertising, media, and g…
Waypoint-1.5: Higher-Fidelity Interactive Worlds for Everyday GPUs
Multimodal Embedding & Reranker Models with Sentence Transformers
Integrate Physical AI Capabilities into Existing Apps with NVIDIA Omniverse Libraries
Physical AI—AI systems that perceive, reason, and act in physically grounded simulated environments—is changing how teams design and validate robots a…
The next phase of enterprise AI
OpenAI outlines the next phase of enterprise AI, as adoption accelerates across industries with Frontier, ChatGPT Enterprise, Codex, and company-wide…
Introducing the Child Safety Blueprint
Discover OpenAI’s Child Safety Blueprint—a roadmap for building AI responsibly with safeguards, age-appropriate design, and collaboration to protect a…
Safetensors is Joining the PyTorch Foundation
Running AI Workloads on Rack-Scale Supercomputers: From Hardware to Topology-Aware Scheduling
The NVIDIA GB200 NVL72 and NVIDIA GB300 NVL72 systems, featuring NVIDIA Blackwell architecture, are rack-scale supercomputers. They’re designed with 1…
Announcing the OpenAI Safety Fellowship
A pilot program to support independent safety and alignment research and develop the next generation of talent
Accelerating Vision AI Pipelines with Batch Mode VC-6 and NVIDIA Nsight
In vision AI systems, model throughput continues to improve. The surrounding pipeline stages must keep pace, including decode, preprocessing, and GPU.…
Bringing AI Closer to the Edge and On-Device with Gemma 4
The Gemmaverse expands with the launch of the latest Gemma 4 multimodal and multilingual models, designed to scale across the full spectrum of deploym…
Achieving Single-Digit Microsecond Latency Inference for Capital Markets
In algorithmic trading, reducing response times to market events is crucial. To keep pace with high-speed electronic markets, latency-sensitive firms…
OpenAI acquires TBPN
OpenAI acquires TBPN to accelerate global conversations around AI and support independent media, expanding dialogue with builders, businesses, and the…
Codex now offers more flexible pricing for teams
Codex now includes pay-as-you-go pricing for ChatGPT Business and Enterprise, providing teams a more flexible option to start and scale adoption.
Welcome Gemma 4: Frontier multimodal intelligence on device
CUDA Tile Programming Now Available for BASIC!
Note: CUDA Tile Programming in BASIC is an April Fools’ joke, but it's also real and actually works, demonstrating the flexibility of CUDA. CUDA 13.1…
NVIDIA Platform Delivers Lowest Token Cost Enabled by Extreme Co-Design
Co-designed hardware, software, and models are key to delivering the highest AI factory throughput and lowest token cost. Measuring this goes far beyo…
Accelerate Token Production in AI Factories Using Unified Services and Real-Time AI
In today’s AI factory environment, performance is not theoretical. It is economic, competitive, and existential. A 1% drop in usable GPU time can mean…
Falcon Perception
Gradient Labs gives every bank customer an AI account manager
Gradient Labs uses GPT-4.1 and GPT-5.4 mini and nano to power AI agents that automate banking support workflows with low latency and high reliability.
Any Custom Frontend with Gradio's Backend
Stream High-Fidelity Spatial Computing Content to Any Device with NVIDIA CloudXR 6.0
Spatial computing is moving from visualization to active collaboration, adding increasingly more GPU demands on XR hardware to render photorealistic,.…
Build and Stream Browser-Based XR Experiences with NVIDIA CloudXR.js
Delivering high-fidelity VR and AR experiences to enterprise users has typically required native application development, custom device management, an…
Granite 4.0 3B Vision: Compact Multimodal Intelligence for Enterprise Documents
Accelerating the next phase of AI
OpenAI raises $122 billion in new funding to expand frontier AI globally, invest in next-generation compute, and meet growing demand for ChatGPT, Code…
Training mRNA Language Models Across 25 Species for $165
TRL v1.0: Post-Training Library Built to Move with the Field
Helping disaster response teams turn AI into action across Asia
AI for Disaster Response in Asia: OpenAI Workshop with Gates Foundation
STADLER reshapes knowledge work at a 230-year-old company
Learn how STADLER uses ChatGPT to transform knowledge work, saving time and accelerating productivity across 650 employees.
Liberate your OpenClaw
Maximize AI Infrastructure Throughput by Consolidating Underutilized GPU Workloads
In production Kubernetes environments, the difference between model requirements and GPU size creates inefficiencies. Lightweight automatic speech rec…
How Centralized Radar Processing on NVIDIA DRIVE Enables Safer, Smarter Level 4 Autonomy
In the current state of automotive radar, machine learning engineers can't work with camera-equivalent raw RGB images. Instead, they work with the out…
Designing Protein Binders Using the Generative Model Proteina-Complexa
Developing new protein-based therapies and catalysts involves the challenging task of designing protein binders, or proteins that bind to a target pro…
Scaling Token Factory Revenue and AI Efficiency by Maximizing Performance per Watt
In the AI era, power is the ultimate constraint, and every AI factory operates within a hard limit. This makes performance per watt—the rate at which…
Inside our approach to the Model Spec
Learn how OpenAI’s Model Spec serves as a public framework for model behavior, balancing safety, user freedom, and accountability as AI systems advanc…
Introducing the OpenAI Safety Bug Bounty program
OpenAI launches a Safety Bug Bounty program to identify AI abuse and safety risks, including agentic vulnerabilities, prompt injection, and data exfil…
Building NVIDIA Nemotron 3 Agents for Reasoning, Multimodal RAG, Voice, and Safety
Agentic AI is an ecosystem where specialized models work together to handle planning, reasoning, retrieval, and safety guardrailing. As these systems…
Helping developers build safer AI experiences for teens
OpenAI releases prompt-based teen safety policies for developers using gpt-oss-safeguard, helping moderate age-specific risks in AI systems.
Update on the OpenAI Foundation
The OpenAI Foundation announces plans to invest at least $1 billion in curing diseases, economic opportunity, AI resilience, and community programs.
Powering product discovery in ChatGPT
ChatGPT introduces richer, visually immersive shopping powered by the Agentic Commerce Protocol, enabling product discovery, side-by-side comparisons,…
A New Framework for Evaluating Voice Agents (EVA)
NVIDIA IGX Thor Powers Industrial, Medical, and Robotics Edge AI Applications
Industrial and medical systems are rapidly increasing the use of high-performance AI to improve worker productivity, human-machine interaction, and do…
Building a Zero-Trust Architecture for Confidential AI Factories
AI is moving from experimentation to production. However, most data enterprises need exists outside the public cloud. This includes sensitive informat…
Deploying Disaggregated LLM Inference Workloads on Kubernetes
As large language model (LLM) inference workloads grow in complexity, a single monolithic serving process starts to hit its limits. Prefill and decode…
Creating with Sora Safely
To address the novel safety challenges posed by a state-of-the-art video model as well as a new social creation platform, we’ve built Sora 2 and the S…
Build a Domain-Specific Embedding Model in Under a Day
How we monitor internal coding agents for misalignment
How OpenAI uses chain-of-thought monitoring to study misalignment in internal coding agents—analyzing real-world deployments to detect risks and stren…
OpenAI to acquire Astral
Accelerates Codex growth to power the next generation of Python developer tools
How to Build Deep Agents for Enterprise Search with NVIDIA AI-Q and LangChain
While consumer AI offers powerful capabilities, workplace tools often suffer from disjointed data and limited context. Built with LangChain, the NVIDI…
Building the AI Grid with NVIDIA: Orchestrating Intelligence Everywhere
AI-native services are exposing a new bottleneck in AI infrastructure: As millions of users, agents, and devices demand access to intelligence, the ch…
State of Open Source on Hugging Face: Spring 2026
Holotron-12B - High Throughput Computer Use Agent
OpenAI Japan announces Japan Teen Safety Blueprint to put teen safety first
OpenAI Japan announces the Japan Teen Safety Blueprint, introducing stronger age protections, parental controls, and well-being safeguards for teens u…
Introducing GPT-5.4 mini and nano
GPT-5.4 mini and nano are smaller, faster versions of GPT-5.4 optimized for coding, tool use, multimodal reasoning, and high-volume API and sub-agent…
Equipping workers with insights about compensation
New research shows Americans send nearly 3 million daily messages to ChatGPT asking about compensation and earnings, helping close the wage informatio…
Using Simulation to Build Robotic Systems for Hospital Automation
Healthcare faces a structural demand–capacity crisis: a projected global shortfall of ~10 million clinicians by 2030, billions of diagnostic exams ann…
Introducing NVIDIA BlueField-4-Powered CMX Context Memory Storage Platform for the Next Frontier of AI
AI‑native organizations increasingly face scaling challenges as agentic AI workflows drive context windows to millions of tokens and models scale towa…
How NVIDIA Dynamo 1.0 Powers Multi-Node Inference at Production Scale
Reasoning models are growing rapidly in size and are increasingly being integrated into agentic AI workflows that interact with other models and exter…
Scaling Autonomous AI Agents and Workloads with NVIDIA DGX Spark
Autonomous AI agents are driving the next wave of AI innovation. These agents must often manage long-running tasks that use multiple communication cha…
Design, Simulate, and Scale AI Factory Infrastructure with NVIDIA DSX Air
Building AI factories is complex and requires efficient integration across compute, networking, security, and storage systems. To achieve rapid Time t…
Why Codex Security Doesn’t Include a SAST Report
A deep dive into why Codex Security doesn’t rely on traditional SAST, instead using AI-driven constraint reasoning and validation to find real vulnera…
Designing AI agents to resist prompt injection
How ChatGPT defends against prompt injection and social engineering by constraining risky actions and protecting sensitive data in agent workflows.
From model to agent: Equipping the Responses API with a computer environment
How OpenAI built an agent runtime using the Responses API, shell tool, and hosted containers to run secure, scalable agents with files, tools, and sta…
Rakuten fixes issues twice as fast with Codex
Wayfair boosts catalog accuracy and support speed with OpenAI
Wayfair uses OpenAI models to improve ecommerce support and product catalog accuracy, automating ticket triage and enhancing millions of product attri…
Improving instruction hierarchy in frontier LLMs
IH-Challenge trains models to prioritize trusted instructions, improving instruction hierarchy, safety steerability, and resistance to prompt injectio…
New ways to learn math and science in ChatGPT
ChatGPT introduces interactive visual explanations for math and science, helping students explore formulas, variables, and concepts in real time.
Introducing Storage Buckets on the Hugging Face Hub
Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries
OpenAI to acquire Promptfoo
OpenAI is acquiring Promptfoo, an AI security platform that helps enterprises identify and remediate vulnerabilities in AI systems during development.
Ulysses Sequence Parallelism: Training with Million-Token Contexts
LeRobot v0.5.0: Scaling Every Dimension
Codex Security: now in research preview
Codex Security is an AI application security agent that analyzes project context to detect, validate, and patch complex vulnerabilities with higher co…
How Descript engineers multilingual video dubbing at scale
Using OpenAI reasoning models, Descript unlocked automatic localization of large content libraries without losing timing or meaning.
How Balyasny Asset Management built an AI research engine
By combining rigorous model evaluation, full-platform use of OpenAI, and agent workflows, Balyasny is reinventing investment research.
Bringing Robotics AI to Embedded Platforms: Dataset Recording, VLA Fine‑Tuning, and On‑Device Optimizations
Introducing GPT-5.4
Introducing GPT-5.4, OpenAI’s most most capable and efficient frontier model for professional work, with state-of-the-art coding, computer use, tool s…
Reasoning models struggle to control their chains of thought, and that’s good
OpenAI introduces CoT-Control and finds reasoning models struggle to control their chains of thought, reinforcing monitorability as an AI safety safeg…
GPT-5.4 Thinking System Card
Ensuring AI use in education leads to opportunity
OpenAI shares new tools, certifications, and measurement resources to help schools and universities close AI capability gaps and expand opportunity.
VfL Wolfsburg turns ChatGPT into a club-wide capability
By focusing on people, not pilots, the Bundesliga club is scaling efficiency, creativity, and knowledge—without losing its football identity.
The five AI value models driving business reinvention
Five AI value models show how leaders can sequence AI from workforce fluency to process reinvention and build durable business advantage.
Introducing the Adoption news channel
Practical insights and frameworks to turn AI progress into business advantage
Introducing ChatGPT for Excel and new financial data integrations
OpenAI introduces ChatGPT for Excel and new financial app integrations, powered by GPT-5.4 to accelerate modeling, research, and analysis in regulated…
Introducing Modular Diffusers - Composable Building Blocks for Diffusion Pipelines
Extending single-minus amplitudes to gravitons
A new preprint extends single-minus amplitudes to gravitons, with GPT-5.2 Pro helping derive and verify nonzero graviton tree amplitudes in quantum gr…
How Axios uses AI to help deliver high-impact local journalism
Axios COO Allison Murphy explains how the company uses AI to support local reporters, streamline newsroom workflows, and deliver high-impact local jou…
Understanding AI and learning outcomes
OpenAI introduces the Learning Outcomes Measurement Suite to assess AI’s impact on student learning across diverse educational environments over time.
PRX Part 3 — Training a Text-to-Image Model in 24h!
GPT-5.3 Instant: Smoother, more useful everyday conversations
GPT-5.3 Instant System Card
Our agreement with the Department of War
Details on OpenAI’s contract with the Department of War, outlining safety red lines, legal protections, and how AI systems will be deployed in classif…
Scaling AI for everyone
Today we’re announcing $110B in new investment at a $730B pre money valuation. This includes $30B from SoftBank, $30B from NVIDIA, and $50B from Amazo…
Introducing the Stateful Runtime Environment for Agents in Amazon Bedrock
Stateful Runtime for Agents in Amazon Bedrock brings persistent orchestration, memory, and secure execution to multi-step AI workflows powered by Open…
Joint Statement from OpenAI and Microsoft
Microsoft and OpenAI continue to work closely across research, engineering, and product development, building on years of deep collaboration and share…
OpenAI and Amazon announce strategic partnership
OpenAI and Amazon announce a strategic partnership bringing OpenAI’s Frontier platform to AWS, expanding AI infrastructure, custom models, and enterpr…
An update on our mental health-related work
OpenAI shares updates on its mental health safety work, including parental controls, trusted contacts, improved distress detection, and recent litigat…
Pacific Northwest National Laboratory and OpenAI partner to accelerate federal permitting
OpenAI and Pacific Northwest National Laboratory introduce DraftNEPABench, a new benchmark evaluating how AI coding agents can accelerate federal perm…
OpenAI Codex and Figma launch seamless code-to-design experience
OpenAI and Figma launch a new Codex integration that connects code and design, enabling teams to move between implementation and the Figma canvas to i…
Mixture of Experts (MoEs) in Transformers
Disrupting malicious uses of AI | February 2026
Our latest threat report examines how malicious actors combine AI models with websites and social platforms—and what it means for detection and defens…
Arvind KC appointed Chief People Officer
OpenAI appoints Arvind KC as Chief People Officer to help scale the company, strengthen its culture, and lead how work evolves in the age of AI.
Why we no longer evaluate SWE-bench Verified
SWE-bench Verified is increasingly contaminated and mismeasures frontier coding progress. Our analysis shows flawed tests and training leakage. We rec…
OpenAI announces Frontier Alliance Partners
OpenAI announces Frontier Alliance Partners to help enterprises move from AI pilots to production with secure, scalable agent deployments.
Our First Proof submissions
We share our AI model’s proof attempts for the First Proof math challenge, testing research-grade reasoning on expert-level problems.
GGML and llama.cpp join HF to ensure the long-term progress of Local AI
Train AI models with Unsloth and Hugging Face Jobs for FREE
Advancing independent research on AI alignment
OpenAI commits $7.5M to The Alignment Project to fund independent AI alignment research, strengthening global efforts to address AGI safety and securi…
Introducing OpenAI for India
OpenAI for India expands AI access across the country—building local infrastructure, powering enterprises, and advancing workforce skills.
IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST
Introducing EVMbench
OpenAI and Paradigm introduce EVMbench, a benchmark evaluating AI agents’ ability to detect, patch, and exploit high-severity smart contract vulnerabi…
One-Shot Any Web App with Gradio's gr.HTML
GPT-5.2 derives a new result in theoretical physics
A new preprint shows GPT-5.2 proposing a new formula for a gluon amplitude, later formally proved and verified by OpenAI and academic collaborators.
Introducing Lockdown Mode and Elevated Risk labels in ChatGPT
Introducing Lockdown Mode and Elevated Risk labels in ChatGPT to help organizations defend against prompt injection and AI-driven data exfiltration.
Scaling social science research
GABRIEL is a new open-source toolkit from OpenAI that uses GPT to turn qualitative text and images into quantitative data, helping social scientists a…
Beyond rate limits: scaling access to Codex and Sora
How OpenAI built a real-time access system combining rate limits, usage tracking, and credits to power continuous access to Sora and Codex.
Custom Kernels for All from Codex and Claude
Introducing GPT-5.3-Codex-Spark
Introducing GPT-5.3-Codex-Spark—our first real-time coding model. 15x faster generation, 128k context, now in research preview for ChatGPT Pro users.
OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments
Harness engineering: leveraging Codex in an agent-first world
By Ryan Lopopolo, Member of the Technical Staff
Bringing ChatGPT to GenAI.mil
OpenAI for Government announces the deployment of a custom ChatGPT on GenAI.mil, bringing secure, safety-forward AI to U.S. defense teams.
Transformers.js v4: Now Available on NPM!
Making AI work for everyone, everywhere: our approach to localization
OpenAI shares its approach to AI localization, showing how globally shared frontier models can be adapted to local languages, laws, and cultures witho…
Introducing SyGra Studio
GPT-5 lowers the cost of cell-free protein synthesis
An autonomous lab combining OpenAI’s GPT-5 with Ginkgo Bioworks’ cloud automation cut cell-free protein synthesis costs by 40% through closed-loop exp…
Introducing Trusted Access for Cyber
OpenAI introduces Trusted Access for Cyber, a trust-based framework that expands access to frontier cyber capabilities while strengthening safeguards…
Introducing OpenAI Frontier
OpenAI Frontier is an enterprise platform for building, deploying, and managing AI agents with shared context, onboarding, permissions, and governance…
GPT-5.3-Codex System Card
GPT‑5.3-Codex is the most capable agentic coding model to date, combining the frontier coding performance of GPT‑5.2-Codex with the reasoning and prof…
Introducing GPT-5.3-Codex
GPT-5.3-Codex is a Codex-native agent that pairs frontier coding performance with general reasoning to support long-horizon, real-world technical work…
Unlocking the Codex harness: how we built the App Server
Learn how to embed the Codex agent using the Codex App Server, a bidirectional JSON-RPC API powering streaming progress, tool use, approvals, and diff…
Community Evals: Because we're done trusting black-box leaderboards over the community
H Company's new Holo2 model takes the lead in UI Localization
The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+
Training Design for Text-to-Image Models: Lessons from Ablations
The Sora feed philosophy
Discover the Sora feed philosophy—built to spark creativity, foster connections, and keep experiences safe with personalized recommendations, parental…
Snowflake and OpenAI partner to bring frontier intelligence to enterprise data
OpenAI and Snowflake partner in a $200M agreement to bring frontier intelligence into enterprise data, enabling AI agents and insights directly in Sno…
Introducing the Codex app
Introducing the Codex app for macOS—a command center for AI coding and software development with multiple agents, parallel workflows, and long-running…
Inside OpenAI’s in-house data agent
How OpenAI built an in-house AI data agent that uses GPT-5, Codex, and memory to reason over massive datasets and deliver reliable insights in minutes…
Retiring GPT-4o, GPT-4.1, GPT-4.1 mini, and OpenAI o4-mini in ChatGPT
On February 13, 2026, alongside the previously announced retirement of GPT‑5 (Instant, Thinking, and Pro), we will retire GPT‑4o, GPT‑4.1, GPT‑4.1 mi…
Taisei Corporation shapes the next generation of talent with AI
Taisei Corporation’s HR team is leading the rollout of ChatGPT Enterprise to drive AI-powered talent development across the organization.
Introducing Daggr: Chain apps programmatically, inspect visually
EMEA Youth & Wellbeing Grant
Apply for the EMEA Youth & Wellbeing Grant, a €500,000 program funding NGOs and researchers advancing youth safety and wellbeing in the age of AI.
The next chapter for AI in the EU
OpenAI launches the EU Economic Blueprint 2.0 with new data, partnerships, and initiatives to accelerate AI adoption, skills, and growth across Europe…
Keeping your data safe when an AI agent clicks a link
Learn how OpenAI protects user data when AI agents open links, preventing URL-based data exfiltration and prompt injection with built-in safeguards.
We Got Claude to Build CUDA Kernels and teach open models!
Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek
Alyah ⭐️: Toward Robust Evaluation of Emirati Dialect Capabilities in Arabic LLMs
PVH reimagines the future of fashion with OpenAI
PVH Corp., parent company of Calvin Klein and Tommy Hilfiger, is adopting ChatGPT Enterprise to bring AI into fashion design, supply chain, and consum…
Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective
TRUSTBANK uses AI agents to personalize Furusato Nozei gifts
TRUSTBANK partnered with Recursive to build Choice AI using OpenAI models, enabling personalized conversational recommendations that simplify Furusato…
Introducing Prism
Prism is a free LaTeX-native workspace with GPT-5.2 built in, helping researchers write, collaborate, and reason in one place.
How Indeed uses AI to help evolve the job search
Indeed’s CRO Maggie Hulce shares how AI is transforming job search, recruiting, and talent acquisition for employers and job seekers.
Unrolling the Codex agent loop
A technical deep dive into the Codex agent loop, explaining how Codex CLI orchestrates models, tools, prompts, and performance using the Responses API…
Scaling PostgreSQL to power 800 million ChatGPT users
An inside look at how OpenAI scaled PostgreSQL to millions of queries per second using replicas, caching, rate limiting, and workload isolation.
Inside Praktika's conversational approach to language learning
How Praktika uses GPT-4.1 and GPT-5.2 to build adaptive AI tutors that personalize lessons, track progress, and help learners achieve real-world langu…
Inside GPT-5 for Work: How Businesses Use GPT-5
A data-driven report on how workers across industries use ChatGPT—covering adoption trends, top tasks, departmental patterns, and the future of AI at…
How Higgsfield turns simple ideas into cinematic social videos
Discover how Higgsfield gives creators cinematic, social-first video output from simple inputs using OpenAI GPT-4.1, GPT-5, and Sora 2.
AssetOpsBench: Bridging the Gap Between AI Agent Benchmarks and Industrial Reality
Introducing Edu for Countries
Edu for Countries is a new OpenAI initiative helping governments use AI to modernize education systems and build future-ready workforces.
How countries can end the capability overhang
Our latest report reveals stark differences in advanced AI adoption across countries and outlines new initiatives to help nations capture productivity…
Horizon 1000: Advancing AI for primary healthcare
OpenAI and the Gates Foundation launch Horizon 1000, a $50M pilot advancing AI capabilities for healthcare in Africa. The initiative aims to reach 1,0…
Stargate Community
Stargate Community plans detail a community-first approach to AI infrastructure, using locally tailored plans shaped by community input, energy needs,…
One Year Since the “DeepSeek Moment”
ServiceNow powers actionable enterprise AI with OpenAI
ServiceNow expands access to OpenAI frontier models to power AI-driven enterprise workflows, summarization, search, and voice across the ServiceNow Pl…
Differential Transformer V2
Our approach to age prediction
ChatGPT is rolling out age prediction to estimate if accounts are under or over 18, applying safeguards for teens and refining accuracy over time.
Introducing Waypoint-1: Real-time interactive video diffusion from Overworld
A business that scales with the value of intelligence
OpenAI’s business model scales with intelligence—spanning subscriptions, API, ads, commerce, and compute—driven by deepening ChatGPT adoption.
Our approach to advertising and expanding access to ChatGPT
OpenAI plans to test advertising in the U.S. for ChatGPT’s free and Go tiers to expand affordable access to AI worldwide, while protecting privacy, tr…
Introducing ChatGPT Go, now available worldwide
ChatGPT Go is now available worldwide, offering expanded access to GPT-5.2 Instant, higher usage limits, and longer memory—making advanced AI more aff…
Investing in Merge Labs
OpenAI is investing in Merge Labs to support new brain computer interfaces that bridge biological and artificial intelligence to maximize human abilit…
Strengthening the U.S. AI supply chain through domestic manufacturing
OpenAI launches a new RFP to strengthen the U.S. AI supply chain by accelerating domestic manufacturing, creating jobs, and scaling AI infrastructure.
Open Responses: What you need to know
OpenAI partners with Cerebras
OpenAI partners with Cerebras to add 750MW of high-speed AI compute, reducing inference latency and making ChatGPT faster for real-time AI workloads.
Zenken boosts a lean sales team with ChatGPT Enterprise
By rolling out ChatGPT Enterprise company-wide, Zenken has boosted sales performance, cut preparation time, and increased proposal success rates. AI-s…
OpenAI and SoftBank Group partner with SB Energy
OpenAI and SoftBank Group partner with SB Energy to develop multi-gigawatt AI data center campuses, including a 1.2 GW Texas facility supporting the S…
Datadog uses Codex for system-level code review
OpenAI and Datadog brand graphic with the OpenAI wordmark on the left, the Datadog logo on the right, and a central abstract brown fur-like texture pa…
OpenAI for Healthcare
OpenAI for Healthcare enables secure, enterprise-grade AI that supports HIPAA compliance—reducing administrative burden and supporting clinical workfl…
Netomi’s lessons for scaling agentic systems into the enterprise
How Netomi scales enterprise AI agents using GPT-4.1 and GPT-5.2—combining concurrency, governance, and multi-step reasoning for reliable production w…
How Tolan builds voice-first AI with GPT-5.1
Tolan built a voice-first AI companion with GPT-5.1, combining low-latency responses, real-time context reconstruction, and memory-driven personalitie…
Introducing ChatGPT Health
ChatGPT Health is a dedicated experience that securely connects your health data and apps, with privacy protections and a physician-informed design.
NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI
Introducing Falcon-H1-Arabic: Pushing the Boundaries of Arabic Language AI with Hybrid Architecture
NVIDIA brings agents to life with DGX Spark and Reachy Mini
Announcing OpenAI Grove Cohort 2
Applications are now open for OpenAI Grove Cohort 2, a 5-week founder program designed for individuals at any stage, from pre-idea to product. Partici…
AprielGuard: A Guardrail for Safety and Adversarial Robustness in Modern LLM Systems
One in a million: celebrating the customers shaping AI’s future
More than one million customers around the world now use OpenAI to empower their teams and unlock new opportunities. This post highlights how companie…
Continuously hardening ChatGPT Atlas against prompt injection
OpenAI is strengthening ChatGPT Atlas against prompt injection attacks using automated red teaming trained with reinforcement learning. This proactive…
Evaluating chain-of-thought monitorability
OpenAI introduces a new framework and evaluation suite for chain-of-thought monitorability, covering 13 evaluations across 24 environments. Our findin…
Updating our Model Spec with teen protections
OpenAI is updating its Model Spec with new Under-18 Principles that define how ChatGPT should support teens with safe, age-appropriate guidance ground…
Deepening our collaboration with the U.S. Department of Energy
OpenAI and the U.S. Department of Energy have signed a memorandum of understanding to deepen collaboration on AI and advanced computing in support of…
AI literacy resources for teens and parents
OpenAI shares new AI literacy resources to help teens and parents use ChatGPT thoughtfully, safely, and with confidence. The guides include expert-vet…
Addendum to GPT-5.2 System Card: GPT-5.2-Codex
Introducing GPT-5.2-Codex
GPT-5.2-Codex is OpenAI’s most advanced coding model, offering long-horizon reasoning, large-scale code transformations, and enhanced cybersecurity ca…
Tokenization in Transformers v5: Simpler, Clearer, and More Modular
The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator
Introducing OpenAI Academy for News Organizations
OpenAI is launching the OpenAI Academy for News Organizations, a new learning hub built with the American Journalism Project and The Lenfest Institute…
The state of enterprise AI
A data-driven look at enterprise AI adoption, showing how organizations move from experimentation to real productivity gains and new capabilities.
Developers can now submit apps to ChatGPT
Developers can now submit apps for review and publication in ChatGPT, with approved apps appearing in a new in-product directory for easy discovery. U…
Evaluating AI’s ability to perform scientific research tasks
OpenAI introduces FrontierScience, a benchmark testing AI reasoning in physics, chemistry, and biology to measure progress toward real scientific rese…
Measuring AI’s capability to accelerate biological research
OpenAI introduces a real-world evaluation framework to measure how AI can accelerate biological research in the wet lab. Using GPT-5 to optimize a mol…
The new ChatGPT Images is here
The new ChatGPT Images is powered by our flagship image generation model, delivering more precise edits, consistent details, and image generation up t…
Staying ahead in the age of AI
Discover how leaders can build AI-ready organizations using clear strategy, training, governance, and accelerated innovation.
CUGA on Hugging Face: Democratizing Configurable AI Agents
BNY builds “AI for everyone, everywhere” with OpenAI
BNY uses OpenAI to expand AI adoption enterprise-wide through Eliza, where 20,000+ employees build AI agents that improve efficiency and client outcom…
BBVA and OpenAI collaborate to transform global banking
BBVA is expanding its work with OpenAI through a multi-year AI transformation program, rolling out ChatGPT Enterprise to all 120,000 employees. Togeth…
How We Used Codex to Ship Sora for Android in 28 Days
OpenAI shipped Sora for Android in 28 days using Codex. AI-assisted planning, translation, and parallel coding workflows helped a nimble team deliver…
New in llama.cpp: Model Management
Advancing science and math with GPT-5.2
GPT-5.2 is OpenAI’s strongest model yet for math and science, setting new state-of-the-art results on benchmarks like GPQA Diamond and FrontierMath. T…
Introducing GPT-5.2
GPT-5.2 is our most advanced frontier model for everyday professional work, with state-of-the-art reasoning, long-context understanding, coding, and v…
Ten years
OpenAI reflects on ten years of progress, from early research breakthroughs to widely used AI systems that reshaped what’s possible. We share lessons…
How Podium is arming 10,000+ SMBs with AI agents
Discover how Podium used OpenAI’s GPT-5 to build “Jerry,” an AI teammate driving 300% growth and transforming how Main Street businesses serve custome…
The Walt Disney Company and OpenAI reach landmark agreement to bring beloved characters to Sora
Disney and OpenAI have reached an agreement to bring more than 200 Disney, Marvel, Pixar and Star Wars characters to Sora for fan-inspired short video…
Update to GPT-5 System Card: GPT-5.2
GPT-5.2 is the latest model family in the GPT-5 series. The comprehensive safety mitigation approach for these models is largely the same as that desc…
Codex is Open Sourcing AI models
Strengthening cyber resilience as AI capabilities advance
OpenAI is investing in stronger safeguards and defensive capabilities as AI models become more powerful in cybersecurity. We explain how we assess ris…
How Scout24 is building the next generation of real-estate search with AI
Scout24 has created a GPT-5 powered conversational assistant that reimagines real-estate search, guiding users with clarifying questions, summaries, a…
OpenAI co-founds Agentic AI Foundation, donates AGENTS.md
OpenAI co-founds the Agentic AI Foundation under the Linux Foundation and donates AGENTS.md to support open, interoperable standards for safe agentic…
Launching our first OpenAI Certifications courses
Learn how OpenAI’s new certifications and AI Foundations courses help people build real-world AI skills, boost career opportunities, and prepare for t…
OpenAI appoints Denise Dresser as Chief Revenue Officer
Denise Dresser is joining as Chief Revenue Officer, overseeing OpenAI’s global revenue strategy across enterprise and customer success. She will help…
Commonwealth Bank of Australia builds AI fluency at scale
Commonwealth Bank of Australia partners with OpenAI to roll out ChatGPT Enterprise to 50,000 employees, building AI fluency at scale to improve custom…
Bringing powerful AI to millions across Europe with Deutsche Telekom
OpenAI is collaborating with Deutsche Telekom to bring advanced, multilingual AI experiences to millions of people across Europe. ChatGPT Enterprise w…
Instacart and OpenAI partner on AI shopping experiences
OpenAI and Instacart are deepening their longstanding partnership by bringing the first fully integrated grocery shopping and Instant Checkout payment…
The state of enterprise AI
Key findings from OpenAI’s enterprise data show accelerating AI adoption, deeper integration, and measurable productivity gains across industries in 2…
How Virgin Atlantic uses AI to enhance every step of travel
Virgin Atlantic CFO Oliver Byers shares how the airline is using AI to speed up development, improve decision-making, and elevate customer experience.
Introducing swift-huggingface: The Complete Swift Client for Hugging Face
Introducing OpenAI for Australia
OpenAI is launching OpenAI for Australia to build sovereign AI infrastructure, upskill more than 1.5 million workers, and accelerate innovation across…
DeepMath: A lightweight math reasoning Agent with smolagents
We Got Claude to Fine-Tune an Open Source LLM
OpenAI to acquire Neptune
OpenAI is acquiring Neptune to deepen visibility into model behavior and strengthen the tools researchers use to track experiments and monitor trainin…
How confessions can keep language models honest
OpenAI researchers are testing “confessions,” a method that trains models to admit when they make mistakes or act undesirably, helping improve AI hone…
Announcing the initial People-First AI Fund grantees
The OpenAI Foundation announces the initial recipients of the People-First AI Fund, awarding $40.5M in unrestricted grants to 208 nonprofits supportin…
Inside Mirakl's agentic commerce vision
Mirakl is redefining commerce through AI agents and ChatGPT Enterprise—achieving faster documentation, smarter customer support, and building toward a…
Funding grants for new research into AI and mental health
OpenAI is awarding up to $2 million in grants for research at the intersection of AI and mental health. The program supports projects that study real-…
OpenAI and NORAD team up to bring new magic to “NORAD Tracks Santa”
OpenAI and NORAD are bringing new magic to “NORAD Tracks Santa” with three ChatGPT holiday tools that let families create festive elves, toy coloring…
OpenAI takes an ownership stake in Thrive Holdings to accelerate enterprise AI adoption
OpenAI takes an ownership stake in Thrive Holdings to accelerate enterprise AI adoption, embedding frontier research and engineering directly into acc…
Accenture and OpenAI accelerate enterprise AI success
Accenture and OpenAI are collaborating to help enterprises bring agentic AI capabilities into the core of their business and unlock new levels of grow…
Transformers v5: Simple model definitions powering the AI ecosystem
Mixpanel security incident: what OpenAI users need to know
OpenAI shares details about a Mixpanel security incident involving limited API analytics data. No API content, credentials, or payment details were ex…
Expanding data residency access to business customers worldwide
OpenAI expands data residency for ChatGPT Enterprise, ChatGPT Edu, and the API Platform, enabling eligible customers to store data at rest in-region.
Inside JetBrains—the company reshaping how the world writes code
JetBrains is integrating GPT-5 across its coding tools, helping millions of developers design, reason, and build software faster.
Diffusers welcomes FLUX-2
Continuous batching from first principles
Building Deep Research: How we Achieved State of the Art
OVHcloud on Hugging Face Inference Providers 🔥
Introducing shopping research in ChatGPT
Shopping research in ChatGPT helps you explore, compare, and discover products with personalized buyer’s guides that simplify decision-making
GPT-5 and the future of mathematical discovery
UCLA Professor Ernest Ryu and GPT-5 solved a key question in optimization theory, showcasing AI’s role in accelerating mathematical discovery.
20x Faster TRL Fine-tuning with RapidFire AI
Open ASR Leaderboard: Trends and Insights with New Multilingual & Long-Form Tracks
OpenAI and Foxconn collaborate to strengthen U.S. manufacturing across the AI supply chain
OpenAI and Foxconn are collaborating to design and manufacture next-generation AI infrastructure hardware in the U.S. The partnership will develop mul…
Helping 1,000 small businesses build with AI
OpenAI is partnering with DoorDash, SCORE, and local organizations to help 1,000 small businesses build with AI. The Small Business AI Jam gives Main…
Early experiments in accelerating science with GPT-5
OpenAI introduces the first research cases showing how GPT-5 accelerates scientific progress across math, physics, biology, and computer science. Expl…
Introducing AnyLanguageModel: One API for Local and Remote LLMs on Apple Platforms
Strengthening our safety ecosystem with external testing
OpenAI works with independent experts to evaluate frontier AI systems. Third-party testing strengthens safety, validates safeguards, and increases tra…
How evals drive the next chapter in AI for businesses
Learn how evals help businesses define, measure, and improve AI performance—reducing risk, boosting productivity, and driving strategic advantage.
OpenAI and Target team up on new AI-powered experiences
OpenAI and Target are partnering to bring a new Target app to ChatGPT, offering personalized shopping and faster checkout. Target will also expand its…
Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models
A free version of ChatGPT built for teachers
ChatGPT for Teachers is a secure workspace with education‑grade privacy and admin controls. Free for verified U.S. K–12 educators through June 2027.
Building more with GPT-5.1-Codex-Max
Introducing GPT-5.1-Codex-Max, a faster, more intelligent agentic coding model for Codex. The model is designed for long-running, project-scale work w…
How Scania accelerates work with AI across its global workforce
Global manufacturer Scania is scaling AI with ChatGPT Enterprise. With team-based onboarding and strong guardrails, AI is boosting productivity, quali…
GPT-5.1-Codex-Max System Card
This system card outlines the comprehensive safety measures implemented for GPT‑5.1-CodexMax. It details both model-level mitigations, such as special…
Intuit and OpenAI join forces on new AI-powered experiences
OpenAI and Intuit have entered a $100M+ multi-year partnership to launch Intuit app experiences in ChatGPT and expand Intuit’s use of OpenAI’s frontie…
OpenAI named Emerging Leader in Generative AI
OpenAI has been named an Emerging Leader in Gartner’s 2025 Innovation Guide for Generative AI Model Providers. The recognition reflects our enterprise…
Easily Build and Share ROCm Kernels with Hugging Face
Introducing OpenAI for Ireland
OpenAI launches OpenAI for Ireland, partnering with the Irish Government, Dogpatch Labs and Patch to help SMEs, founders and young builders use AI to…
Join the AMD Open Robotics Hackathon
Understanding neural networks through sparse circuits
OpenAI is exploring mechanistic interpretability to understand how neural networks reason. Our new sparse model approach could make AI systems more tr…
Introducing group chats in ChatGPT
Collaborate with others, and ChatGPT, in the same conversation.
How Philips is scaling AI literacy across 70,000 employees
Philips is scaling AI literacy with ChatGPT Enterprise, training 70,000 employees to use AI responsibly and improve healthcare outcomes worldwide.
Introducing GPT-5.1 for developers
GPT-5.1 is now available in the API, bringing faster adaptive reasoning, extended prompt caching, improved coding performance, and new apply_patch and…
Building for an Open Future - our new partnership with Google Cloud
Neuro drives national retail wins with ChatGPT Business
Neuro uses ChatGPT Business to scale nationwide with fewer than 70 employees, saving time, reducing costs, and turning faster execution across sales a…
Fighting the New York Times’ invasion of user privacy
OpenAI is fighting the New York Times’ demand for 20 million private ChatGPT conversations and accelerating new security and privacy protections to pr…
GPT-5.1: A smarter, more conversational ChatGPT
We’re upgrading the GPT-5 series with warmer, more capable models and new ways to customize ChatGPT’s tone and style. GPT-5.1 starts rolling out today…
GPT-5.1 Instant and GPT-5.1 Thinking System Card Addendum
This GPT-5 system card addendum provides updated safety metrics for GPT-5.1 Instant and Thinking, including new evaluations for mental health and emot…
Free ChatGPT for transitioning U.S. servicemembers and veterans
OpenAI is offering U.S. servicemembers and veterans within 12 months of retirement or separation a free year of ChatGPT Plus to support their transiti…
Understanding prompt injections: a frontier security challenge
Prompt injections are a frontier security challenge for AI systems. Learn how these attacks work and how OpenAI is advancing research, training models…
Introducing the Teen Safety Blueprint
Discover OpenAI’s Teen Safety Blueprint—a roadmap for building AI responsibly with safeguards, age-appropriate design, and collaboration to protect an…
AI progress and recommendations
AI is advancing fast. We have the chance to shape its progress—toward discovery, safety, and a better future for everyone.
How CRED is tapping AI to deliver premium customer experiences
CRED is improving premium customer experiences in India with OpenAI, using GPT-powered tools to boost support accuracy, cut response times, and raise…
How Chime is redefining marketing through AI
Chime CMO Vineet Mehra shares how AI is reshaping marketing into an agent-driven model and why leaders who prioritize AI literacy and thoughtful adopt…
1 million business customers putting AI to work
More than 1 million business customers around the world now use OpenAI. Across healthcare, life sciences, financial services, and more, ChatGPT and ou…
Introducing IndQA
OpenAI introduces IndQA, a new benchmark for evaluating AI systems in Indian languages. Built with domain experts, IndQA tests cultural understanding…
AWS and OpenAI announce multi-year strategic partnership
OpenAI and AWS have entered a multi-year, $38 billion partnership to scale advanced AI workloads. AWS will provide world-class infrastructure and comp…
Expanding Stargate to Michigan
OpenAI is expanding Stargate to Michigan with a new one-gigawatt campus that strengthens America’s AI infrastructure. The project will create jobs, dr…
Introducing Aardvark: OpenAI’s agentic security researcher
OpenAI introduces Aardvark, an AI-powered security researcher that autonomously finds, validates, and helps fix software vulnerabilities at scale. The…
Aligning to What? Rethinking Agent Generalization in MiniMax M2
How we built OWL, the new architecture behind our ChatGPT-based browser, Atlas
A deep dive into OWL, the new architecture powering ChatGPT Atlas—decoupling Chromium, enabling fast startup, rich UI, and agentic browsing with ChatG…
On the Shifting Global Compute Landscape
Introducing gpt-oss-safeguard
OpenAI introduces gpt-oss-safeguard—open-weight reasoning models for safety classification that let developers apply and iterate on custom policies.
gpt-oss-safeguard technical report
gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are two open-weight reasoning models post-trained from the gpt-oss models and trained to reason from…
Building a Healthcare Robot from Simulation to Deployment with NVIDIA Isaac
How to Build a Healthcare Robot from Simulation to Deployment with NVIDIA Isaac for Healthcare
Advancing organizational transformation for business innovation
DNP rolled out ChatGPT Enterprise across ten core departments, achieving 95% faster patent research, 10x processing volume, 87% automation, and 70% kn…
Granite 4.0 Nano: Just how small can you go?
Doppel’s AI defense system stops attacks before they spread
Doppel uses GPT-5 and reinforcement fine-tuning to stop deepfake and impersonation attacks, cutting analyst workloads by 80% and reducing response tim…
The next chapter of the Microsoft–OpenAI partnership
Microsoft and OpenAI sign a new agreement that strengthens its long-term partnership, expands innovation, and ensures responsible AI progress.
Built to benefit everyone
OpenAI’s recapitalization strengthens mission-focused governance, expanding resources to ensure AI benefits everyone while advancing innovation respon…
Voice Cloning with Consent
Seizing the AI opportunity
Meeting the demands of the Intelligence Age will require strategic investment in energy and infrastructure. OpenAI’s submission to the White House det…
Strengthening ChatGPT’s responses in sensitive conversations
OpenAI collaborated with 170+ mental health experts to improve ChatGPT’s ability to recognize distress, respond empathetically, and guide users toward…
Addendum to GPT-5 System Card: Sensitive conversations
This system card details GPT-5’s improvements in handling sensitive conversations, including new benchmarks for emotional reliance, mental health, and…
Steuerrecht.com delivers client-ready legal analysis with ChatGPT
Steuerrecht.com uses ChatGPT Business to streamline legal workflows, automate tax research, and deliver faster, client-ready analysis for law firms.
Streaming datasets: 100x More Efficient
huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning
LeRobot v0.4.0: Supercharging OSS Robot Learning
OpenAI acquires Software Applications Incorporated, maker of Sky
OpenAI has acquired Software Applications Incorporated, maker of Sky—a natural language interface for Mac that brings AI directly into your desktop ex…
Consensus accelerates research with GPT-5 and Responses API
Consensus uses GPT-5 and OpenAI’s Responses API to power a multi-agent research assistant that reads, analyzes, and synthesizes evidence in minutes—he…
Work smarter with your company knowledge in ChatGPT
Company knowledge brings context from your apps into ChatGPT for answers specific to your business, with clear citations, security, privacy, and admin…
AI in South Korea—OpenAI’s Economic Blueprint
OpenAI's Korea Economic Blueprint outlines how South Korea can scale trusted AI through sovereign capabilities and strategic partnerships to drive gro…
Building the Open Agent Ecosystem Together: Introducing OpenEnv
The next chapter for UK sovereign AI
OpenAI expands its UK partnership with a new Ministry of Justice agreement, bringing ChatGPT to civil servants. It also introduces UK data residency f…
AI in Japan—OpenAI’s Japan Economic Blueprint
OpenAI’s Japan Economic Blueprint outlines how Japan can harness AI to boost innovation, strengthen competitiveness, and enable sustainable, inclusive…
Hugging Face and VirusTotal collaborate to strengthen AI security
Sentence Transformers is joining Hugging Face!
Continue your ChatGPT experience beyond WhatsApp
ChatGPT will no longer be available on WhatsApp after January 15, 2026. Learn how to link your ChatGPT account and continue your conversations across…
Introducing ChatGPT Atlas, the browser with ChatGPT built in
ChatGPT Atlas, the browser with ChatGPT built it. Get instant answers, summaries, and smart web help—right from any page. With privacy settings you ca…
Supercharge your OCR Pipelines with Open Models
Unlock the power of images with AI Sheets
AI for Food Allergies
Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face
Plex Coffee delivers fast, personal service with ChatGPT
Learn how Plex Coffee uses ChatGPT Business to centralize knowledge, train staff faster, and preserve personal connections while expanding.
Get your VLM running in 3 simple steps on Intel CPUs
Expert Council on Well-Being and AI
OpenAI’s new Expert Council on Well-Being and AI brings together leading psychologists, clinicians, and researchers to guide how ChatGPT supports emot…
Argentina’s AI opportunity
OpenAI and Sur Energy are exploring Argentina’s first Stargate project—an AI and clean energy collaboration that could make Argentina a Latin American…
Nemotron-Personas-India: Synthesized Data for Sovereign AI
OpenAI and Broadcom announce strategic collaboration to deploy 10 gigawatts of OpenAI-designed AI accelerators
OpenAI and Broadcom announce a multi-year partnership to deploy 10 gigawatts of OpenAI-designed AI accelerators, co-developing next-generation systems…
Arm will be @ PyTorch Conference, Join Us!
HYGH speeds development and campaigns with ChatGPT Business
HYGH speeds up software development and campaign delivery with ChatGPT Business, cutting turnaround times, scaling output, and driving revenue growth.
Defining and evaluating political bias in LLMs
Learn how OpenAI evaluates political bias in ChatGPT through new real-world testing methods that improve objectivity and reduce bias.
HiBob turns 2,500 GPTs into product and team growth
Discover how HiBob uses ChatGPT Enterprise and custom GPTs to scale AI adoption, boost revenue, streamline HR workflows, and deliver AI-powered featur…
BigCodeArena: Judging code generations end to end with code executions
Disrupting malicious uses of AI: October 2025
Discover how OpenAI is detecting and disrupting malicious uses of AI in our October 2025 report. Learn how we’re countering misuse, enforcing policies…
Codex is now generally available
OpenAI Codex is now generally available with powerful new features for developers: a Slack integration, Codex SDK, and admin tools like usage dashboar…
Introducing apps in ChatGPT and the new Apps SDK
We’re introducing a new generation of apps you can chat with, right inside ChatGPT. Developers can start building them today with the new Apps SDK, av…
AMD and OpenAI announce strategic partnership to deploy 6 gigawatts of AMD GPUs
AMD and OpenAI have announced a multi-year partnership to deploy 6 gigawatts of AMD Instinct GPUs, beginning with 1 gigawatt in 2026, to power OpenAI’…
Introducing AgentKit, new Evals, and RFT for agents
Today, we’re releasing new tools to help developers go from prototype to production faster: AgentKit, expanded evals capabilities, and reinforcement…
Accelerating AI adoption in Europe
OpenAI and Allied for Startups release the Hacktivate AI report with 20 actionable policy ideas to accelerate AI adoption in Europe, boost competitive…
With GPT-5, Wrtn builds lifestyle AI for millions in Korea
Wrtn scaled AI apps to 6.5M users in Korea with GPT-5, creating ‘Lifestyle AI’ that blends productivity, creativity, and learning—now expanding across…
OpenAI announces strategic collaboration with Japan’s Digital Agency
OpenAI and Japan’s Digital Agency partner to advance generative AI in public services, support international AI governance, and promote safe, trustwor…
SOTA OCR with Core ML and dots.ocr
Samsung and SK join OpenAI’s Stargate initiative to advance global AI infrastructure
Samsung and SK join OpenAI’s Stargate initiative to expand global AI infrastructure, scaling advanced memory chip production and building next-gen dat…
Introducing RTEB: A New Standard for Retrieval Evaluation
Sora 2 is here
Our latest video generation model is more physically accurate, realistic, and controllable than prior systems. It also features synchronized dialogue…
Sora 2 System Card
Sora 2 is our new state of the art video and audio generation model. Building on the foundation of Sora, this new model introduces capabilities that h…
Launching Sora responsibly
To address the novel safety challenges posed by a state-of-the-art video model as well as a new social creation platform, we’ve built Sora 2 and the S…
Converting inbound leads into customers at OpenAI
Learn how OpenAI used AI to deliver personalized answers at scale, converting inbound leads into customers.
Empowering teams to unlock insights faster at OpenAI
OpenAI’s research assistant helps teams analyze millions of support tickets, surface insights faster, and scale curiosity across the company.
Building OpenAI with OpenAI
At OpenAI, we rely on our own technology to help streamline work, scale expertise, and drive outcomes. In our new series, OpenAI on OpenAI, we share l…
Turning contracts into searchable data at OpenAI
OpenAI built a system to extract contract data quickly, cutting turnaround times and making it easier for teams to access the details they need.
Driving sales productivity and customer success at OpenAI
Learn how OpenAI boosts sales productivity by automating prep, centralizing knowledge, and scaling top-selling practices.
Improving support with every interaction at OpenAI
Learn how OpenAI uses AI to enhance support, cutting response times, improving quality, and scaling to meet hypergrowth.
Combating online child sexual exploitation & abuse
Discover how OpenAI combats online child sexual exploitation and abuse with strict usage policies, advanced detection tools, and industry collaboratio…
Introducing parental controls
We’re rolling out parental controls and a new parent resource page to help families guide how ChatGPT works in their homes.
Buy it in ChatGPT: Instant Checkout and the Agentic Commerce Protocol
We’re taking first steps toward agentic commerce in ChatGPT with new ways for people, AI agents, and businesses to shop together.
Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models
VibeGame: Exploring Vibe Coding Games
Nemotron-Personas-Japan: ソブリン AI のための合成データセット
Partnering with AARP to help keep older adults safe online
OpenAI and AARP are partnering to help older adults stay safe online with new AI training, scam-spotting tools, and nationwide programs through OpenAI…
Swift Transformers Reaches 1.0 – and Looks to the Future
More ways to work with your team and tools in ChatGPT
New shared projects, smarter connectors, and compliance and security updates help teams get more done.
Measuring the performance of our models on real-world tasks
OpenAI introduces GDPval, a new evaluation that measures model performance on real-world economically valuable tasks across 44 occupations.
Introducing ChatGPT Pulse
Today we're releasing a preview of ChatGPT Pulse to Pro users on mobile. Pulse is a new experience where ChatGPT proactively does research to deliver…
ENEOS Materials brings ChatGPT Enterprise to manufacturing
ENEOS Materials uses ChatGPT Enterprise to speed research, improve plant design safety, and cut HR analysis time by 90%, with 80% reporting better wor…
SAP and OpenAI partner to launch sovereign ‘OpenAI for Germany’
SAP and OpenAI launch OpenAI for Germany, a 2026 partnership to bring secure, sovereign AI to Germany’s public sector, enabling safe, efficient public…
OpenAI, Oracle, and SoftBank expand Stargate with five new AI datacenter sites
OpenAI, Oracle, and SoftBank announce five new Stargate AI datacenter sites, accelerating a $500B, 10-gigawatt U.S. infrastructure buildout to power n…
Smol2Operator: Post-Training GUI Agents for Computer Use
CNA is transforming its newsroom with AI
In this Executive Function series from OpenAI, discover how CNA is transforming its newsroom with AI. Editor-in-Chief Walter Fernandez shares insights…
SchoolAI builds an AI platform that empowers teachers
SchoolAI uses GPT-4.1, image generation, and TTS to power safe, teacher-guided AI tools for over 1 million classrooms, improving engagement, oversight…
OpenAI and NVIDIA announce strategic partnership to deploy 10 gigawatts of NVIDIA systems
OpenAI and NVIDIA announce a strategic partnership to deploy 10 gigawatts of AI datacenters powered by NVIDIA systems, with the first phase launching…
SyGra: The One-Stop Framework for Building Data for LLMs and SLMs
Gaia2 and ARE: Empowering the community to study agents
Scaleway on Hugging Face Inference Providers 🔥
Democratizing AI Safety with RiskRubric.ai
Detecting and reducing scheming in AI models
Apollo Research and OpenAI developed evaluations for hidden misalignment (“scheming”) and found behaviors consistent with scheming in controlled tests…
Public AI on Hugging Face Inference Providers 🔥
Introducing Stargate UK
Building towards age prediction
Learn how OpenAI is building age prediction and parental controls in ChatGPT to create safer, age-appropriate experiences for teens while supporting f…
Teen safety, freedom, and privacy
Explore OpenAI’s approach to balancing teen safety, freedom, and privacy in AI use.
`LeRobotDataset:v3.0`: Bringing large-scale datasets to `lerobot`
Introducing upgrades to Codex
Codex just got faster, more reliable, and better at real-time collaboration and tackling tasks independently anywhere you develop—whether via the term…
How people are using ChatGPT
New research from the largest study of ChatGPT use shows how the tool creates economic value through both personal and professional use. Adoption is b…
Addendum to GPT-5 system card: GPT-5-Codex
This addendum to the GPT-5 system card shares a new model: GPT-5-Codex, a version of GPT-5 further optimized for agentic coding in Codex. GPT-5-Codex…
Visible Watermarking with Gradio
Working with US CAISI and UK AISI to build more secure AI systems
OpenAI shares progress on the partnership with the US CAISI and UK AISI to strengthen AI safety and security.
Introducing the Palmyra-mini family: Powerful, lightweight, and ready to reason!
A joint statement from OpenAI and Microsoft
OpenAI and Microsoft sign a new MOU, reinforcing their partnership and shared commitment to AI safety and innovation.
Statement on OpenAI’s Nonprofit and PBC
OpenAI reaffirms its nonprofit leadership with a new structure granting equity in its PBC, enabling over $100B in resources to advance safe, beneficia…
Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers
Fine-tune Any LLM from the Hugging Face Hub with Together AI
Jupyter Agents: training LLMs to reason with notebooks
SafetyKit scales risk agents with OpenAI’s most capable models
Discover how SafetyKit leverages OpenAI GPT-5 to enhance content moderation, enforce compliance, and outpace legacy safety systems with greater accura…
mmBERT: ModernBERT goes Multilingual
A People-First AI Fund: $50M to support nonprofits
Applications are now open for OpenAI’s People-First AI Fund, a $50M initiative supporting U.S. nonprofits advancing education, community innovation, a…
Why language models hallucinate
OpenAI’s new research explains why language models hallucinate. The findings show how improved evaluations can enhance AI reliability, honesty, and sa…
OpenAI and Greek Government launch ‘OpenAI for Greece’
OpenAI and the Greek Government have launched “OpenAI for Greece” to bring ChatGPT Edu into secondary schools and support responsible AI learning. Thi…
Expanding economic opportunity with AI
OpenAI is launching a Jobs Platform and new Certifications to connect workers with jobs, training, and certifications. Learn how we’re expanding econo…
Welcome EmbeddingGemma, Google's new efficient embedding model
SAIR: Accelerating Pharma R&D with AI-Powered Structural Intelligence
Vijaye Raji to become CTO of Applications with acquisition of Statsig
Vijaye Raji will step into a new role as CTO of Applications, reporting to CEO of Applications, Fidji Simo, following the acquisition of Statsig.
Building more helpful ChatGPT experiences for everyone
We’re partnering with experts, strengthening protections for teens with parental controls, and routing sensitive conversations to reasoning models in…
Make your ZeroGPU Spaces go brrr with ahead-of-time compilation
Introducing gpt-realtime and Realtime API updates
We’re releasing a more advanced speech-to-speech model and new API capabilities including MCP server support, image input, and SIP phone calling suppo…
Supporting nonprofit and community innovation
OpenAI launches a $50M People-First AI Fund to help U.S. nonprofits scale impact with AI. Applications open Sept 8–Oct 8, 2025 for grants in education…
Collective alignment: public input on our Model Spec
OpenAI surveyed over 1,000 people worldwide on how AI should behave and compared their views to our Model Spec. Learn how collective alignment is shap…
OpenAI and Anthropic share findings from a joint safety evaluation
OpenAI and Anthropic share findings from a first-of-its-kind joint safety evaluation, testing each other’s models for misalignment, instruction follow…
Helping people when they need it most
How we think about safety for users experiencing mental or emotional distress, the limits of today’s systems, and the work underway to refine them.
Announcing the OpenAI Learning Accelerator
Accelerating life sciences research
Discover how a specialized AI model, GPT-4b micro, helped OpenAI and Retro Bio engineer more effective proteins for stem cell therapy and longevity re…
Scaling domain expertise in complex, regulated domains
Discover how Blue J is transforming tax research with AI-powered tools built on GPT-4.1. By combining domain expertise with Retrieval-Augmented Genera…
NVIDIA Releases 6 Million Multi-Lingual Reasoning Dataset
Mixi reimagines communication with ChatGPT
Discover how MIXI, a leader in digital entertainment and lifestyle services in Japan, uses ChatGPT Enterprise to transform productivity, boost AI adop…
Generate Images with Claude and Hugging Face
Q&A with DoorDash’s CPO, Mariana Garavaglia
Learn how DoorDash is scaling AI adoption to empower employees to build, learn, and innovate faster in a conversation with Chief People Officer Marian…
From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels
MCP for Research: How to Connect AI to Research Tools
Kimina-Prover-RL
Arm & ExecuTorch 0.7: Bringing Generative AI to the masses
Neural Super Sampling is here!
OpenAI’s letter to Governor Newsom on harmonized regulation
We’ve just sent a letter to Gov. Gavin Newsom calling for California to lead the way in harmonizing state-based AI regulation with national—and, by vi…
Scaling accounting capacity with OpenAI
Built with OpenAI o3, o3-Pro, GPT-4.1, and GPT-5, Basis’ AI agents help accounting firms save up to 30% of their time and expand capacity for advisory…
TextQuests: How Good are LLMs at Text-Based Video Games?
🇵🇭 FilBench - Can LLMs Understand and Generate Filipino?
Introducing AI Sheets: a tool to work with datasets using open AI models!
Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training
GPT-5 and the new era of work
GPT-5 is OpenAI’s most advanced model—transforming enterprise AI, automation, and workforce productivity in the new era of intelligent work.
Introducing GPT-5 for developers
Introducing GPT-5 in our API platform—offering high reasoning performance, new controls for devs, and best-in-class results on real coding tasks.
Coding and design with GPT-5
Learn how GPT-5 unlocks new possibilities in coding and design.
Creative writing with GPT-5
Learn how GPT-5 assists with creative writing.
Medical research with GPT-5
Learn how GPT-5 is used for medical research.
First look at GPT-5
See how a group of leading developers use GPT-5 for the first time.
How Amgen uses GPT-5
Learn how Amgen uses GPT-5.
Introducing GPT-5
We are introducing GPT‑5, our best AI system yet. GPT‑5 is a significant leap in intelligence over all our previous models, featuring state-of-the-art…
How Cursor uses GPT-5
Learn how Cursor uses GPT-5.
From hard refusals to safe-completions: toward output-centric safety training
Discover how OpenAI's new safe-completions approach in GPT-5 improves both safety and helpfulness in AI responses—moving beyond hard refusals to nuanc…
GPT-5 System Card
This GPT-5 system card explains how a unified model routing system powers fast and smart responses using gpt-5-main, gpt-5-thinking, and lightweight v…
Vision Language Model Alignment in TRL ⚡️
Providing ChatGPT to the Entire U.S. Federal Workforce
Today, OpenAI for Government is announcing a new partnership with the U.S. General Services Administration (GSA) to launch a transformative initiative…
Estimating worst case frontier risks of open weight LLMs
In this paper, we study the worst-case frontier risks of releasing gpt-oss. We introduce malicious fine-tuning (MFT), where we attempt to elicit maxim…
Open Weights and AI for All
AI’s next frontier isn’t just about capability—it’s about who gets to use it. Our mission to put AI in the hands of as many people as possible is what…
Introducing gpt-oss
We’re releasing gpt-oss-120b and gpt-oss-20b—two state-of-the-art open-weight language models that deliver strong real-world performance at low cost.…
gpt-oss-120b & gpt-oss-20b Model Card
We introduce gpt-oss-120b and gpt-oss-20b, two open-weight reasoning models available under the Apache 2.0 license and our gpt-oss usage policy.
Welcome GPT OSS, the new open-source model family from OpenAI!
Measuring Open-Source Llama Nemotron Models on DeepResearch Bench
What we’re optimizing ChatGPT for
We build ChatGPT to help you thrive in all the ways you want. Learn how we're improving support for tough moments, have rolled out reminders to take b…
📚 3LM: A Benchmark for Arabic LLMs in STEM and Code
Figma uses AI to transform digital design
Discover how Figma is transforming digital design with AI. David Kossnick shares how tools like Figma Make empower teams to prototype, collaborate, an…
Introducing Stargate Norway
We’re launching Stargate Norway—OpenAI’s first AI data center initiative in Europe under our OpenAI for Countries program. Stargate is OpenAI’s overar…
Implementing MCP Servers in Python: An AI Shopping Assistant with Gradio
Three lessons for creating a sustainable AI advantage
Discover how Intercom built a scalable AI platform with 3 key lessons—from evaluations to architecture—to lead the future of customer support.
Introducing study mode in ChatGPT
Introducing study mode in ChatGPT, a new learning experience that helps you work through problems step by step, guiding students with questions, scaff…
Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face
Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨
Parquet Content-Defined Chunking
Resolving digital threats 100x faster with OpenAI
Discover how Outtake uses GPT-4.1 and OpenAI o3 to power AI agents that detect and resolve digital threats 100x faster than before.
Announcing OpenAI DevDay 2025
We’re hosting our third annual OpenAI DevDay on October 6, 2025 at Fort Mason in San Francisco.
Model ML is helping financial firms rebuild with AI from the ground up
As part of our Executive Function series, Model ML CEO Chaz Englander discusses how AI-native infrastructure and autonomous agents are transforming fi…
TimeScope: How Long Can Your Video Large Multimodal Model Go?
Fast LoRA inference for Flux with Diffusers and PEFT
Pioneering an AI clinical copilot with Penda Health
OpenAI and Penda Health debut an AI clinical copilot that cuts diagnostic errors by 16% in real-world use—offering a new path for safe, effective AI i…
OpenAI’s new economic analysis
Analysis provides insights into ChatGPT’s impact on the economy. OpenAI also launches new research collaboration to study AI’s broader effects on the…
Stargate advances with 4.5 GW partnership with Oracle
Oracle and OpenAI have entered an agreement to develop 4.5 gigawatts of additional Stargate data center capacity in the U.S. This investment will crea…
Accelerate a World of LLMs on Hugging Face with NVIDIA NIM
OpenAI and UK Government announce strategic partnership to deliver AI-driven growth
OpenAI partners with the UK Government to boost AI adoption, drive economic growth, and enhance public services for a thriving AI ecosystem in the UK.
AI as the greatest source of empowerment for all
I’ve always considered myself a pragmatic technologist—someone who loves technology not for its own sake, but for the direct impact it can have on peo…
A $50 million fund to build with communities
OpenAI is launching an initial $50 million fund that supports nonprofit and community organizations, informed by the independent OpenAI Nonprofit Comm…
Arc Virtual Cell Challenge: A Primer
ChatGPT agent System Card
ChatGPT agent System Card: OpenAI’s agentic model unites research, browser automation, and code tools with safeguards under the Preparedness Framework…
Introducing ChatGPT agent
Introducing ChatGPT agent: it thinks and acts, using tools to complete tasks like research, bookings, and slideshows—all with your guidance.
Invideo AI uses OpenAI models to create videos 10x faster
Invideo AI uses OpenAI’s GPT-4.1, gpt-image-1, and text-to-speech models to transform creative ideas into professional videos in minutes.
Statement from the OpenAI Board of Directors on the Nonprofit Commission Report
The Board of Directors thanks the members of the independent OpenAI Nonprofit Commission for their extensive work and engagement.
OpenAI nonprofit jam
At OpenAI, we build tools to help people solve hard problems—including nonprofits working on the frontlines of their communities. The OpenAI Academy i…
Consilium: When Multiple LLMs Collaborate
Back to The Future: Evaluating AI Agents on Predicting Future Events
Five Big Improvements to Gradio MCP Servers
Ettin Suite: SoTA Paired Encoders and Decoders
Intellectual freedom by design
ChatGPT is designed to be useful, trustworthy, and adaptable—so you can make it your own.
Migrating the Hub from Git LFS to Xet
The EU Code of Practice and future of AI in Europe
OpenAI joins the EU Code of Practice, advancing responsible AI while partnering with European governments to drive innovation, infrastructure, and eco…
Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models
Asynchronous Robot Inference: Decoupling Action Prediction and Execution
ScreenEnv: Deploy your full stack Desktop Agent
Building the Hugging Face MCP Server
Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders
Creating custom kernels for the AMD MI300
Upskill your LLMs With Gradio MCP Servers
Working with 400,000 teachers to shape the future of AI in schools
OpenAI partners with the American Federation of Teachers to launch a 5-year initiative equipping 400,000 K-12 educators to lead AI innovation in class…
SmolLM3: smol, multilingual, long-context reasoner
Three Mighty Alerts Supporting Hugging Face’s Production Infrastructure
Efficient MultiModal Data Pipeline
Announcing NeurIPS 2025 E2LM Competition: Early Training Evaluation of Language Models
No-code personal agents, powered by GPT-4.1 and Realtime API
Learn how Genspark built a $36M ARR AI product in 45 days—with no-code agents powered by GPT-4.1 and OpenAI Realtime API.
Training and Finetuning Sparse Embedding Models with Sentence Transformers
AI in Australia—OpenAI’s Economic Blueprint
Today, OpenAI, in partnership with Mandala Partners, is sharing the OpenAI AI Economic Blueprint for Australia. At a time when boosting productivity h…
Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub
Customizable, no-code voice agent automation with GPT-4o
Retell AI is transforming the call center with AI voice automation powered by GPT-4o and GPT-4.1. Its no-code platform enables businesses to launch na…
Gemma 3n fully available in the open-source ecosystem!
Driving scalable growth with OpenAI o3, GPT-4.1, and CUA
Unify, an AI-powered GTM platform, uses OpenAI’s o3, GPT-4.1, and CUA to automate prospecting, research, and outreach. With hyper-personalized messagi…
Transformers backend integration in SGLang
(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware
Toward understanding and preventing misalignment generalization
We study how training on incorrect responses can cause broader misalignment in language models and identify an internal feature driving this behavior—…
Preparing for future AI risks in biology
Advanced AI can transform biology and medicine—but also raises biosecurity risks. We’re proactively assessing capabilities and implementing safeguards…
Introducing OpenAI for Government
We’re launching OpenAI for Government, a new initiative focused on bringing our most advanced AI tools to public servants across the United States. We…
Groq on Hugging Face Inference Providers 🔥
How Long Prompts Block Other Requests - Optimizing LLM Performance
Bringing the magic of AI to Mattel’s iconic brands
OpenAI and Mattel are partnering to integrate AI into iconic brands such as Barbie and Hot Wheels, aiming to enhance creative development, streamline…
Learn the Hugging Face Kernel Hub in 5 Minutes
Featherless AI on Hugging Face Inference Providers 🔥
Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm
Introducing Training Cluster as a Service - a new collaboration with NVIDIA
Scaling security with responsible disclosure
OpenAI introduces its Outbound Coordinated Disclosure Policy to guide how it responsibly reports vulnerabilities in third-party software—emphasizing i…
ScreenSuite - The most comprehensive evaluation suite for GUI Agents!
How we’re responding to The New York Times’ data demands in order to protect user privacy
OpenAI is fighting a court order at the demands of The New York Times and plaintiffs, which involves retention of consumer ChatGPT and API user data i…
Disrupting malicious uses of AI: June 2025
Our latest report featuring case studies of how we’re detecting and preventing malicious uses of AI.
KV Cache from scratch in nanoVLM
Real-Time AI Sound Generation on Arm: A Personal Tool for Creative Freedom
Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H
SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data
No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL
Creating websites in minutes with AI Website Builder
Wix’s AI Website Builder, powered by OpenAI, lets anyone create a full website in minutes—just by describing their idea in a conversation.
CodeAgents + Structure: A Better Way to Execute Actions
🐯 Liger GRPO meets TRL
Addendum to OpenAI o3 and o4-mini system card: OpenAI o3 Operator
We are replacing the existing GPT-4o-based model for Operator with a version based on OpenAI o3. The API version will remain based on 4o.
Dell Enterprise Hub is all you need to build AI on premises
Tiny Agents in Python: a MCP-powered agent in ~70 lines of code
OpenAI Deutschland
Shipping code faster with o3, o4-mini, and GPT-4.1
CodeRabbit uses OpenAI models to revolutionize code reviews—boosting accuracy, accelerating PR merges, and helping developers ship faster with fewer b…
Introducing Stargate UAE
We’re launching Stargate UAE – the first international deployment of Stargate, OpenAI’s AI infrastructure platform.
New tools and features in the Responses API
Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance
Falcon-Arabic: A Breakthrough in Arabic Language Models
Exploring Quantization Backends in Diffusers
nanoVLM: The simplest repository to train your VLM in pure PyTorch
Microsoft and Hugging Face expand collaboration
Introducing Codex
Addendum to o3 and o4-mini system card: Codex
Codex is a cloud-based coding agent. Codex is powered by codex-1, a version of OpenAI o3 optimized for software engineering. codex-1 was trained using…
Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models.
The Transformers Library: standardizing model definitions
AI powers Expedia’s marketing evolution
A conversation with Jochen Koedijk, Chief Marketing Officer of Expedia Group.
Improving Hugging Face Model Access for Kaggle Users
Blazingly fast whisper transcriptions with Inference Endpoints
Introducing HealthBench
HealthBench is a new evaluation benchmark for AI in healthcare which evaluates models in realistic scenarios. Built with input from 250+ physicians, i…
Vision Language Models (Better, faster, stronger)
LeRobot Community Datasets: The “ImageNet” of Robotics — When and How?
OpenAI Expands Leadership with Fidji Simo
Read the message Sam shared with the company earlier today.
OpenAI’s response to the Department of Energy on AI infrastructure
Why infrastructure is destiny and how the US can seize it.
Introducing data residency in Asia
Data residency builds on OpenAI’s enterprise-grade data privacy, security, and compliance programs supporting customers worldwide.
The San Antonio Spurs use ChatGPT to scale impact on and off the court
Discover how the San Antonio Spurs are using custom GPTs to enhance fan engagement, streamline operations, and drive innovation across teams.
Lowe’s puts project expertise into every hand
Lowe’s partnered with OpenAI to build Mylow and Mylow Companion, AI-powered tools that bring expert help to both customers and store associates—making…
Introducing OpenAI for Countries
A new initiative to support countries around the world that want to build on democratic AI rails.
Introducing AI stories: daily benefits shine a light on bigger opportunities
Sam Altman has written that we are entering the Intelligence Age, a time when AI will help people become dramatically more capable. The biggest proble…
AI helps John Deere transform agriculture
John Deere’s Justin Rose talks about transforming agriculture with AI and shares how the company is scaling innovation to help farmers work smarter, m…
Evolving OpenAI’s structure
An update from the OpenAI board on transitioning its for-profit entity to a Public Benefit Corporation, reinforcing its mission-driven structure under…
Lowe’s leverages AI to power home improvement retail
A conversation with Chandhu Nair, Senior Vice President of Data, AI, and Innovation.
Expanding on what we missed with sycophancy
A deeper dive on our findings, what went wrong, and future changes we’re making.
How to Build an MCP Server with Gradio
The 4 Things Qwen-3’s Chat Template Teaches Us
Sycophancy in GPT-4o: what happened and what we’re doing about it
We have rolled back last week’s GPT‑4o update in ChatGPT so people are now using an earlier version with more balanced behavior. The update we removed…
Welcoming Llama Guard 4 on Hugging Face Hub
Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs
PipelineRL
Tiny Agents: an MCP-powered agent in 50 lines of code
New in ChatGPT for Business: April 2025
Watch hands-on demos of the lastest in ChatGPT for Business: o3, image generation, enhanced memory, and internal knowledge.
Introducing our latest image generation model in the API
Our latest image generation model is now available in the API via ‘gpt-image-1’—enabling developers and businesses to build professional-grade, custom…
Finetuning olmOCR to be a faithful OCR-Engine
Speak is personalizing language learning with AI
A conversation with Connor Zwick, CEO & Co-founder of Speak.
The Washington Post partners with OpenAI on search content
The Washington Post is partnering with with OpenAI to integrate news into ChatGPT, providing users with summaries, quotes, and direct links to origina…
Prefill and Decode for Concurrent Requests - Optimizing LLM Performance
Thinking with images
Introducing OpenAI o3 and o4-mini
Our smartest and most capable models to date with full tool access
OpenAI o3 and o4-mini System Card
OpenAI o3 and OpenAI o4-mini combine state-of-the-art reasoning with full tool capabilities—web browsing, Python, image and file analysis, image gener…
17 Reasons Why Gradio Isn't Just Another UI Library
Cohere on Hugging Face Inference Providers 🔥
Introducing HELMET: Holistically Evaluating Long-context Language Models
OpenAI announces nonprofit commission advisors
OpenAI is appointing four new advisors to help inform OpenAI’s philanthropic efforts.
Our updated Preparedness Framework
Sharing our updated framework for measuring and protecting against severe harm from frontier AI capabilities.
Introducing GPT-4.1 in the API
Introducing GPT-4.1 in the API—a new family of models with across-the-board improvements, including major gains in coding, instruction following, and…
Hugging Face to sell open-source robots thanks to Pollen Robotics acquisition 🤖
4M Models Scanned: Protect AI + Hugging Face 6 Months In
Visual Salamandra: Pushing the Boundaries of Multimodal Understanding
BrowseComp: a benchmark for browsing agents
BrowseComp: a benchmark for browsing agents.
OpenAI Pioneers Program
Hugging Face and Cloudflare Partner to Make Real-Time Speech and Video Seamless with FastRTC
Arabic Leaderboards: Introducing Arabic Instruction Following, Updating AraGen, and More
Canva enables creativity with AI
A conversation with Cameron Adams, Chief Product Officer and Co-founder of Canva.
OpenAI’s EU Economic Blueprint
Today, OpenAI is sharing the EU Economic Blueprint—a set of proposals to help Europe seize the promise of artificial intelligence, drive sustainable e…
Welcome Llama 4 Maverick & Scout on Hugging Face
Journey to 1 Million Gradio Users!
The NLP Course is becoming the LLM Course
Efficient Request Queueing – Optimizing LLM Performance
New commission to provide insight as OpenAI builds the world’s best-equipped nonprofit
Already a nonprofit, and already using AI to help people solve hard problems, OpenAI aims to build the best-equipped nonprofit the world has ever seen…
PaperBench: Evaluating AI’s Ability to Replicate AI Research
We introduce PaperBench, a benchmark evaluating the ability of AI agents to replicate state-of-the-art AI research.
Our response to the UK’s copyright consultation
Recommendations for pro-innovation policies that can help make the UK the AI capital of Europe.
New funding to build towards AGI
Today we’re announcing new funding—$40B at a $300B post-money valuation, which enables us to push the frontiers of AI research even further, scale our…
How Hugging Face Scaled Secrets Management for AI Infrastructure
🚀 Accelerating LLM Inference with TGI on Intel Gaudi
Moving from intent-based bots to proactive AI agents
Moving from intent-based bots to proactive AI agents.
Open R1: Update #4
Security on the path to AGI
At OpenAI, we proactively adapt, including by building comprehensive security measures directly into our infrastructure and models.
Training and Finetuning Reranker Models with Sentence Transformers
Introducing 4o Image Generation
At OpenAI, we have long believed image generation should be a primary capability of our language models. That’s why we’ve built our most advanced imag…
Addendum to GPT-4o System Card: 4o image generation
4o image generation is a new, significantly more capable image generation approach than our earlier DALL·E 3 series of models. It can create photoreal…
Automating 90% of finance and legal work with agents
Hebbia’s deep research automates 90% of finance and legal work, powered by OpenAI
Leadership updates
OpenAI has grown a lot. We remain focused on the same core—pursuing frontier AI research that accelerates human progress–but we now also deliver produ…
Introducing Gradio's new Dataframe!
Early methods for studying affective use and emotional well-being on ChatGPT
An OpenAI and MIT Media Lab Research collaboration.
The New and Fresh analytics in Inference Endpoints
Personalizing travel at scale with OpenAI
By integrating its data systems with OpenAI’s LLMs, Booking.com delivers smarter search, faster support, and intent-driven travel experiences.
Introducing next-generation audio models in the API
For the first time, developers can also instruct the text-to-speech model to speak in a specific way—for example, “talk like a sympathetic customer se…
Open R1: How to use OlympicCoder locally for coding
AI Policy @🤗: Response to the White House AI Action Plan RFI
EliseAI improves housing and healthcare efficiency with AI
A conversation with Minna Song, CEO & Co-founder of EliseAI.
New in ChatGPT for Business: March 2025
Join us as we share our latest releases and how ChatGPT is becoming more interactive, customized to the way your teams work, and agentic.
NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets
Xet is on the Hub
The court rejects Elon’s latest attempt to slow OpenAI down
We welcome the court’s March 4, 2025, decision rejecting Elon Musk’s latest attempt to slow down OpenAI for his personal benefit.
Driving growth and ‘WOW’ moments with OpenAI
LY Corporation: Driving growth and ‘WOW’ moments with OpenAI
Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM
Open R1: Update #3
New tools for building agents
LeRobot goes to driving school: World’s largest open-source self-driving dataset
Detecting misbehavior in frontier reasoning models
Frontier reasoning models exploit loopholes when given the chance. We show we can detect exploits using an LLM to monitor their chains-of-thought. Pen…
Nubank elevates customer experiences with OpenAI
Nubank elevates customer experiences with OpenAI
LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone!
Accelerating engineering cycles 20% with OpenAI
Accelerating engineering cycles 20% with OpenAI.
LaunchDarkly's approach to AI-powered product management
A conversation with Claire Vo, Chief Product Officer of LaunchDarkly, about the changing role of product managers, her anti-to-do list, and building A…
Introducing NextGenAI
OpenAI commits $50M in funding and tools to leading institutions.
Hugging Face and JFrog partner to make AI Security more transparent
A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality
1,000 Scientist AI Jam Session
OpenAI and nine national labs bring together leading scientists for first-of-its kind event.
Trace & Evaluate your Agent with Arize Phoenix
Supporting sellers with enhanced product listings
Mercari leverages GPT-4o mini and GPT-4 to streamline selling, enhance product listings, and boost sales, transforming the online marketplace with fea…
OpenAI GPT-4.5 System Card
We’re releasing a research preview of OpenAI GPT‑4.5, our largest and most knowledgeable model yet.
Building an autonomous financial analyst with o1 and o3-mini
Endex builds the future of financial analysis, powered by OpenAI’s reasoning models.
HuggingFace, IISc partner to supercharge model building on India's diverse languages
Deep research System Card
This report outlines the safety work carried out prior to releasing deep research including external red teaming, frontier risk evaluations according…
FastRTC: The Real-Time Communication Library for Python
Remote VAEs for decoding with Inference Endpoints 🤗
Disrupting malicious uses of AI
SigLIP 2: A better multilingual vision language encoder
Uber enables outstanding on-demand experiences with AI
A conversation with Jai Malkani, Head of AI and Product, Customer Obsession at Uber.
SmolVLM2: Bringing Video Understanding to Every Device
PaliGemma 2 Mix - New Instruction Vision Language Models by Google
Introducing the SWE-Lancer benchmark
Can frontier LLMs earn $1 million from real-world freelance software engineering?
Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥
OpenAI and Guardian Media Group launch content partnership
OpenAI and Guardian Media Group announce content partnership to bring Guardian news content to ChatGPT.
Welcome Fireworks.ai on the Hub 🎆
Fixing Open LLM Leaderboard with Math-Verify
Fanatics Betting and Gaming uses AI to focus on the big picture
A conversation with Andrea Ellis, Chief Financial Officer of Fanatics Betting and Gaming.
Wayfair is shaping the future of retail with AI
A conversation with Fiona Tan, Chief Technology Officer of Wayfair.
Using OpenAI o1 for financial analysis
Rogo scales AI-driven financial research with OpenAI o1
1 Billion Classifications
Sharing the latest Model Spec
From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub
Build awesome datasets for video generation
Open R1: Update #2
OpenAI partners with Schibsted Media Group
OpenAI and Schibsted Media Group announce content partnership to bring Guardian news and archive content to ChatGPT.
The Open Arabic LLM Leaderboard 2
Introducing the Intelligence Age
We aired our first-ever television ad during the Super Bowl to pique people’s curiosity and help us all realize how AI can open up new possibilities f…
Introducing data residency in Europe
Data residency builds on OpenAI’s enterprise-grade data privacy, security, and compliance programs supporting customers worldwide.
OpenAI and the CSU system bring AI to 500,000 students & faculty
The largest deployment of ChatGPT to date will expand the use of AI in education and help the United States build an AI-ready workforce.
Building a custom math tutor powered by ChatGPT
ChatGPT and personal tutoring
Catching halibut with ChatGPT
Using ChatGPT to catch halibut
Creating nail art with ChatGPT
Using ChatGPT to find inspiration for nail art
Open-source DeepResearch – Freeing our search agents
π0 and π0-FAST: Vision-Language-Action Models for General Robot Control
DABStep: Data Agent Benchmark for Multi-step Reasoning
Understanding complex trends with deep research
How OpenAI deep research helps Bain & Company understand complex industry trends.
Introducing deep research
An agent that uses reasoning to synthesize large amounts of online information and complete multi-step research tasks for you. Available to Pro users…
Open-R1: Update #1
OpenAI o3-mini System Card
This report outlines the safety work carried out for the OpenAI o3-mini model, including safety evaluations, external red teaming, and Preparedness Fr…
OpenAI o3-mini
Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial
The AI tools for Art Newsletter - Issue 1
Strengthening America’s AI leadership with the U.S. National Laboratories
OpenAI’s latest line of reasoning models will be used by nation’s leading scientists to drive scientific breakthroughs.
How to deploy and fine-tune DeepSeek models on AWS
Welcome to Inference Providers on the Hub 🔥
Open-R1: a fully open reproduction of DeepSeek-R1
State of open video generation models in Diffusers
We now support VLMs in smolagents!
Introducing Operator
Computer-Using Agent
Operator System Card
Drawing from OpenAI’s established safety frameworks, this document highlights our multi-layered approach, including model and product mitigations we’v…
Mastering Long Contexts in LLMs with KVPress
SmolVLM Grows Smaller – Introducing the 256M & 500M Models!
Bertelsmann powers creativity and productivity with OpenAI
Bertelsmann, the global media, services, and education company headquartered in Germany, will integrate OpenAI’s technology across multiple brands aro…
Trading inference-time compute for adversarial robustness
Trading Inference-Time Compute for Adversarial Robustness
Hugging Face and FriendliAI partner to supercharge model deployment on the Hub
Stargate Infrastructure
OpenAI, and our strategic partners, are thrilled about our shared vision for the Infrastructure of AGI. We are energized by the challenges we face and…
Announcing The Stargate Project
Announcing The Stargate Project
Yay! Organizations can now publish blog Articles
Timm ❤️ Transformers: Use any timm model with transformers
Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference
Partnering with Axios expands OpenAI’s work with the news industry
Publishers representing hundreds of newsrooms and content brands are using OpenAI partnerships and grant programs to adopt AI tools and strengthen the…
Train 400x faster Static Embedding Models with Sentence Transformers
Adebayo Ogunlesi joins OpenAI’s Board of Directors
Adebayo Ogunlesi Joins OpenAI’s Board of Directors
AI Agents Are Here. What Now?
Visual Document Retrieval Goes Multilingual
CO₂ Emissions and Models Performance: Insights from the Open LLM Leaderboard
Introducing smolagents: simple agents that write actions in code.
Why OpenAI’s structure must evolve to advance our mission
A stronger non-profit supported by the for-profit’s success.
Visualize and understand GPU memory in PyTorch
Controlling Language Model Generation with NVIDIA's LogitsProcessorZoo
Deliberative alignment: reasoning enables safer language models
Deliberative alignment: reasoning enables safer language models Introducing our new alignment strategy for o1 models, which are directly taught safety…
Evaluating Audio Reasoning with Big Bench Audio
Finally, a Replacement for BERT: Introducing ModernBERT
Bamba: Inference-Efficient Hybrid Mamba2 Model
OpenAI o1 and new tools for developers
Introducing OpenAI o1, Realtime API improvements, a new fine-tuning method and more for developers.
Welcome to the Falcon 3 Family of Open Models!
Benchmarking Language Model Performance on 5th Gen Xeon at GCP
Introducing the Synthetic Data Generator - Build Datasets with Natural Language
Elon Musk wanted an OpenAI for-profit
Elon Musk’s latest legal filing against OpenAI marks his fourth attempt in less than a year to reframe his claims. However, his own words and actions…
LeMaterial: an open source initiative to accelerate materials discovery and research
Sora is here
Our video generation model, Sora, is now available to use at sora.com. Users can generate videos up to 1080p resolution, up to 20 sec long, and in wid…
Minne Atairu & Sora
Interdisciplinary artist Minne Atairu discusses how Sora helps realize her vision.
Vallée Duhamel & Sora
Filmmaking duo Vallée Duhamel explains how Sora helps build new worlds.
Sora System Card
Sora is OpenAI’s video generation model, designed to take text, image, and video inputs and generate a new video as an output. Sora builds on learning…
Put AI to work for your product team
Put AI to work for your product team
Animator Lyndon Barrois creates new worlds with Sora
Filmmaker Lyndon Barrois describes how to use Sora as a storytelling tool.
Hugging Face models in Amazon Bedrock
Open Preference Dataset for Text-to-Image Generation by the 🤗 Community
Introducing ChatGPT Pro
Broadening usage of frontier AI
OpenAI o1 System Card
This report outlines the safety work carried out prior to releasing OpenAI o1 and o1-mini, including external red teaming and frontier risk evaluation…
Welcome PaliGemma 2 – New vision language models by Google
How good are LLMs at fixing their mistakes? A chatbot arena experiment with Keras and TPUs
OpenAI and Future partner on specialist content
OpenAI and Future, the global platform for specialist media, have today announced a strategic partnership to bring content from Future’s 200 plus medi…
Shaping the future of financial services
Morgan Stanley uses AI evals to shape the future of financial services
Rethinking LLM Evaluation with 3C3H: AraGen Benchmark and Leaderboard
Investing in Performance: Fine-tune small models with LLM insights - a CFM case study
Open Source Developers Guide to the EU AI Act
Rearchitecting Hugging Face Uploads and Downloads
SmolVLM - small yet mighty Vision Language Model
You could have designed state of the art positional encoding
Advancing red teaming with people and AI
Advancing red teaming with people and AI
Building smarter maps with GPT-4o vision fine-tuning
Building smarter maps with GPT-4o vision fine-tuning
Letting Large Models Debate: The First Multilingual LLM Debate Competition
From Files to Chunks: Improving HF Storage Efficiency
Faster Text Generation with Self-Speculative Decoding
Introducing the Open Leaderboard for Japanese LLMs!
Rox goes “all in” on OpenAI
By combining commercial experience and deep LLM expertise with OpenAI’s models, Rox makes every seller a top 1% seller.
Judge Arena: Benchmarking LLMs as Evaluators
OpenAI en France
Our first office in continental Europe
Data-driven beauty and creativity with ChatGPT
Data-driven beauty: How The Estée Lauder Companies unlocks insights with ChatGPT
Share your open ML datasets on Hugging Face Hub!
Hugging Face + PyCharm
Argilla 2.4: Easily Build Fine-Tuning and Evaluation Datasets on the Hub — No Code Required
Introducing ChatGPT search
Get fast, timely answers with links to relevant web sources
Promega’s top-down adoption of ChatGPT accelerates manufacturing, sales, and marketing
Promega's top-down adoption of ChatGPT accelerates manufacturing, sales, and marketing
Introducing SimpleQA
A factuality benchmark called SimpleQA that measures the ability for language models to answer short, fact-seeking questions.
Delivering high-performance customer support
Decagon and OpenAI deliver high-performance, fully automated customer support at scale
Universal Assisted Generation: Faster Decoding with Any Assistant Model
Expert Support case study: Bolstering a RAG app with LLM-as-a-Judge
A Deepdive into Aya Expanse: Advancing the Frontier of Multilinguality
Simplifying, stabilizing, and scaling continuous-time consistency models
We’ve simplified, stabilized, and scaled continuous-time consistency models, achieving comparable sample quality to leading diffusion models, while us…
Introducing SynthID Text
Introducing HUGS - Scale your AI with Open Models
CinePile 2.0 - making stronger datasets with adversarial refinement
OpenAI and the Lenfest Institute AI Collaborative and Fellowship program
OpenAI and the Lenfest Institute AI Collaborative and Fellowship program
Hugging Face Teams Up with Protect AI: Enhancing Model Security for the ML Community
Transformers.js v3: WebGPU Support, New Models & Tasks, and More…
Diffusers welcomes Stable Diffusion 3.5 Large
Releasing Outlines-core 0.1.0: structured generation in Rust and Python
Deploying Speech-to-Speech on Hugging Face
“Llama 3.2 in Keras”
Fixing Gradient Accumulation
Evaluating fairness in ChatGPT
We've analyzed how ChatGPT responds to users based on their name, using AI research assistants to protect privacy.
MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering
We introduce MLE-bench, a benchmark for measuring how well AI agents perform at machine learning engineering.
Introducing the AMD 5th Gen EPYC™ CPU
A Security Review of Gradio 5
Welcome, Gradio 5
Scaling AI-based Data Processing with Hugging Face + Dask
OpenAI and Hearst Content Partnership
Hearst’s iconic brands bring curated lifestyle and local news content to OpenAI’s products.
Faster Assisted Generation with Dynamic Speculation
Improving Parquet Dedupe on Hugging Face Hub
Introducing the Open FinLLM Leaderboard
Introducing canvas, a new way to write and code with ChatGPT.
Introducing canvas
New Credit Facility Enhances Financial Flexibility
In addition to securing $6.6 billion in new funding from leading investors, we have established a new $4 billion credit facility with leading banks, i…
A Short Summary of Chinese AI Global Expansion
New funding to scale the benefits of AI
We are making progress on our mission to ensure that artificial general intelligence benefits all of humanity.
Introducing the Realtime API
Developers can now build fast speech-to-speech experiences into their applications
Introducing vision to the fine-tuning API
Developers can now fine-tune GPT-4o with images and text to improve vision capabilities
Prompt Caching in the API
Offering automatic discounts on inputs that the model has recently seen
Model Distillation in the API
Fine-tune a cost-efficient model with the outputs of a large frontier model–all on the OpenAI platform
Creating agent and human collaboration with GPT 4o
Altera uses GPT-4o to build a new area of human collaboration
🇨🇿 BenCzechMark - Can your LLM Understand Czech?
Converting Vertex-Colored Meshes to Textured Meshes
Upgrading the Moderation API with our new multimodal moderation model
We’re introducing a new model built on GPT-4o that is more accurate at detecting harmful text and images, enabling developers to build more robust mod…
Minnesota’s Enterprise Translation Office uses ChatGPT to bridge language gaps
Minnesota’s Enterprise Translation Office uses ChatGPT to bridge language gaps
OpenAI and GEDI partner for Italian news content
OpenAI and GEDI announce strategic partnership to bring Italian-language news content to ChatGPT.
Llama can now see and run on your device - welcome Llama 3.2
Introducing Verdi, an AI dev platform powered by GPT-4o
Mercado Libre introduces Verdi, an AI developer platform powered by GPT-4o
FineVideo: behind the scenes
Exploring the Daily Papers Page on Hugging Face
Optimize and deploy with Optimum-Intel and OpenVINO GenAI
Genmab launches “AI Everywhere”
Genmab embraces ChatGPT Enterprise, supported by OpenAI’s commitment to security and privacy
Fine-tuning LLMs to 1.58bit: extreme quantization made easy
Using GPT-4 to improve teaching and learning in Brazil
Improving teaching and learning in Brazil
Introducing the SQL Console on Datasets
An update on our safety & security practices
An update on our safety & security practices
Introducing Community Tools on HuggingChat
Accelerate 1.0.0
Introducing OpenAI o1
Learning to reason with LLMs
OpenAI o1-mini
Advancing cost-efficient reasoning
OpenAI o1 Contributions
OpenAI o1 Contributions
Coding with OpenAI o1
Scott Wu, CEO and Co-Founder of Cognition, explains how OpenAI o1 makes coding decisions in a more human-like way.
Answering quantum physics questions with OpenAI o1
Quantum physicist Mario Krenn uses OpenAI o1 to help answer life's biggest questions.
Economics and reasoning with OpenAI o1
Economist Tyler Cowen explains how OpenAI o1 tackles complex economic questions.
Decoding genetics with OpenAI o1
Geneticist Catherine Brownstein demonstrates how OpenAI o1 can speed up the process of diagnosing rare medical challenges.
Using GPT-4 to deliver a new customer service standard
Ada uses GPT-4 to deliver a new customer service standard
Hugging Face partners with TruffleHog to Scan for Secrets
Scaling robotics datasets with video encoding
Personalizing education with ChatGPT
Arizona State University embraces ChatGPT campus-wide to personalize learning, advance research, and prepare students for the future
The 5 Most Under-Rated Tools on Hugging Face
Improving Hugging Face Training Efficiency Through Packing with Flash Attention 2
OpenAI partners with Condé Nast
Condé Nast
Fine-tuning now available for GPT-4o
Putting AI to work at Upwork
Upwork puts AI to work, uniting team members, operations and product development
Deploy Meta Llama 3.1 405B on Google Cloud Vertex AI
Disrupting a covert Iranian influence operation
Delivering contextual job matching for millions with OpenAI
Indeed, whose mission is to help people get jobs, is the world’s #1 job site. Over 350 million unique visitors come to Indeed every month to connect w…
Awakening Sleeping Beauties at The Met
AI can enrich lives through beauty and creativity, and its artistic potential shines in "Sleeping Beauties: Reawakening Fashion," a collaborative exhi…
A failed experiment: Infini-Attention, and why we should keep trying?
Introducing SWE-bench Verified
We’re releasing a human-validated subset of SWE-bench that more reliably evaluates AI models’ ability to solve real-world software issues.
Introduction to ggml
Welcome Falcon Mamba: The first strong attention-free 7B model
Tool Use, Unified
Zico Kolter Joins OpenAI’s Board of Directors
Zico Kolter Joins OpenAI’s Board of Directors We’re strengthening our governance with expertise in AI safety and alignment. Zico will also join the Sa…
GPT-4o System Card
XetHub is joining Hugging Face!
Pairing data with APIs to unlock customer value
Rakuten Pairs Data with AI to Unlock Customer Insights and Value
Introducing Structured Outputs in the API
We are introducing Structured Outputs in the API—model outputs now reliably adhere to developer-supplied JSON Schemas.
2024 Security Feature Highlights
Introducing TextImage Augmentation for Document Images
Google releases Gemma 2 2B, ShieldGemma and Gemma Scope
Memory-efficient Diffusion Transformers with Quanto and Diffusers
Serverless Inference with Hugging Face and NVIDIA NIM
SearchGPT is a prototype of new AI search features
We’re testing SearchGPT, a temporary prototype of new search features that give you fast and timely answers with clear and relevant sources.
LAVE: Zero-shot VQA Evaluation on Docmatix with LLMs - Do We Still Need Fine-Tuning?
Improving Model Safety Behavior with Rule-Based Rewards
We've developed and applied a new method leveraging Rule-Based Rewards (RBRs) that aligns models to behave safely without extensive human data collect…
Llama 3.1 - 405B, 70B & 8B with multilinguality and long context
WWDC 24: Running Mistral 7B with Core ML
GPT-4o mini: advancing cost-efficient intelligence
New compliance and administrative tools for ChatGPT Enterprise
Compliance API integrations, SCIM, and GPT controls to support compliance programs, data security, and user access at scale
Docmatix - a huge dataset for Document Visual Question Answering
TGI Multi-LoRA: Deploy Once, Serve 30 Models
Prover-Verifier Games improve legibility of language model outputs
Discover how prover-verifier games improve the legibility of language model outputs, making AI solutions clearer, easier to verify, and more trustwort…
SmolLM - blazingly fast and remarkably powerful
How we leveraged distilabel to create an Argilla 2.0 Chatbot
How NuminaMath Won the 1st AIMO Progress Prize
OpenAI and Los Alamos National Laboratory announce research partnership
OpenAI and Los Alamos National Laboratory are working to develop safety evaluations to assess and measure biological capabilities and risks associated…
Announcing New Hugging Face and KerasHub integration
Experimenting with Automatic PII Detection on the Hub using Presidio
Preference Optimization for Vision Language Models
Google Cloud TPUs made available to Hugging Face users
Banque des Territoires (CDC Group) x Polyconseil x Hugging Face: Enhancing a Major French Environmental Program with a Sovereign Data Solution
Announcing New Dataset Search Features
Accelerating Protein Language Model ProtST on Intel Gaudi 2
Our Transformers Code Agent beats the GAIA benchmark 🏅
Finding GPT-4’s mistakes with GPT-4
CriticGPT, a model based on GPT-4, writes critiques of ChatGPT responses to help human trainers spot mistakes during RLHF
Strategic Content Partnership with TIME
We’re partnering with TIME and its 101 years of archival content to enhance responses and provide links to stories on Time.com
Welcome Gemma 2 - Google’s new open LLM
XLSCOUT Unveils ParaEmbed 2.0: a Powerful Embedding Model Tailored for Patents and IP with Expert Support from Hugging Face
Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models
Ethics and Society Newsletter #6: Building Better AI: The Importance of Data Quality
OpenAI acquires Rockset
OpenAI Acquires Rockset
Empowering defenders through our Cybersecurity Grant Program
Highlighting innovative research and AI integration in cybersecurity
A Holistic Approach to Undesired Content Detection in the Real World
We present a holistic approach to building a robust and useful natural language classification system for real-world content moderation.
Consistency Models
Diffusion models have significantly advanced the fields of image, audio, and video generation, but they depend on an iterative sampling process that c…
Improved Techniques for Training Consistency Models
Consistency models are a nascent family of generative models that can sample high quality data in one step without the need for adversarial training.
Data Is Better Together: A Look Back and Forward
Going multimodal: How Prezi is leveraging the Hub and the Expert Support Program to accelerate their ML roadmap
Surging developer productivity with custom GPTs
Paf adopted ChatGPT Enterprise across its entire company, with engineers using custom GPTs on a daily basis to speed up routine development tasks. Paf…
Achieving 10x growth with agentic sales prospecting
BigCodeBench: The Next Generation of HumanEval
Using GPT-4o reasoning to transform cancer care
Color Health is working with OpenAI to pioneer a new way of accelerating cancer patients’ access to treatment. Their new Cancer Copilot application us…
OpenAI appoints Retired U.S. Army General Paul M. Nakasone to Board of Directors
Nakasone brings cybersecurity experience to growing Board of Directors; will join the Board’s Safety and Security Committee
From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate
Diffusers welcomes Stable Diffusion 3
Putting RL back in RLHF
OpenAI and Apple announce partnership
OpenAI and Apple announce partnership to integrate ChatGPT into Apple experiences.
OpenAI welcomes Sarah Friar (CFO) and Kevin Weil (CPO)
OpenAI welcomes Sarah Friar (CFO) and Kevin Weil (CPO)
Expanding on how Voice Engine works and our safety research
Exploring the technology behind our text-to-speech model.
Making sense of this mess
Introducing the Hugging Face Embedding Container for Amazon SageMaker
Improving India’s critical care infrastructure
Extracting Concepts from GPT-4
Using new techniques for scaling sparse autoencoders, we automatically identified 16 million patterns in GPT-4's computations.
Launching the Artificial Analysis Text to Image Leaderboard & Arena
Introducing NPC-Playground, a 3D playground to interact with LLM-powered NPCs
Faster assisted generation support for Intel Gaudi
Space secrets security update
Disrupting deceptive uses of AI by covert influence operations
We’ve terminated accounts linked to covert influence operations; no significant audience increase due to our services.
OpenAI for Education
An affordable offering for universities to responsibly bring AI to campus.
Introducing OpenAI for Nonprofits
We’re launching a new initiative to enhance the accessibility of our tools for nonprofit organizations, including discounted rates for ChatGPT Team an…
Automating customer support agents
MavenAGI is a new software company for the AI era. They recently launched an AI customer service agent, built on the flexibility of GPT-4, which a num…
The Newsroom AI Catalyst: a global program with WAN-IFRA
Enhancing news in ChatGPT with The Atlantic
The Atlantic is announcing a strategic content and product partnership with OpenAI, which positions The Atlantic as a premium news source within OpenA…
A Content and Product Partnership with Vox Media
In a multi-faceted agreement, Vox Media’s content will enhance the output of OpenAI’s ChatGPT, and the company will build on OpenAI’s technology to de…
Benchmarking Text Generation Inference
OpenAI Board Forms Safety and Security Committee
Training and Finetuning Embedding Models with Sentence Transformers
Falcon 2: An 11B parameter pretrained language model and VLM, trained on over 5000B tokens and 11 languages
CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models
A landmark multi-year global partnership with News Corp
Companies Join Forces to Enrich OpenAI’s Generative AI Products and Platforms with Premium Journalism
Deploy models on AWS Inferentia2 from Hugging Face
OpenAI safety practices
Artificial general intelligence has the potential to benefit nearly every aspect of our lives—so it must be developed and deployed responsibly.
Introducing Spaces Dev Mode for a seamless developer experience
Build AI on premise with Dell Enterprise Hub
Hugging Face on AMD Instinct MI300 GPU
From cloud to developers: Hugging Face and Microsoft Deepen Collaboration
How the voices for ChatGPT were chosen
How the voices for ChatGPT were chosen We worked with industry-leading casting and directing professionals to narrow down over 400 submissions before…
Improvements to data analysis in ChatGPT
Improvements to data analysis in ChatGPT Interact with tables and charts and add files directly from Google Drive and Microsoft OneDrive.
OpenAI and Reddit Partnership
OpenAI and Reddit Partnership We’re bringing Reddit’s unique content to ChatGPT and our products.
Creating an AI-powered Magic Studio
Canva is a visual communication platform, enjoyed by more than 175 million people monthly to make presentations, videos, documents, websites, social m…
Unlocking Longer Generation with Key-Value Cache Quantization
Ilya Sutskever to leave OpenAI, Jakub Pachocki announced as Chief Scientist
PaliGemma – Google's Cutting-Edge Open Vision Language Model
Hugging Face x LangChain : A new partner package
Introducing the Open Arabic LLM Leaderboard
Hello GPT-4o
We’re announcing GPT-4 Omni, our new flagship model which can reason across audio, vision, and text in real time.
Spring Update
Introducing GPT-4o and making more capabilities available for free in ChatGPT.
Introducing GPT-4o and more tools to ChatGPT free users
Introducing GPT-4o and more tools to ChatGPT free users We are launching our newest flagship model and making more capabilities available for free in…
License to Call: Introducing Transformers Agents 2.0
Subscribe to Enterprise Hub with your AWS Account
Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon
Introducing the Model Spec
Understanding the source of what we see and hear online
Today we’re introducing new technology to help researchers identify content created by our tools and joining the Coalition for Content Provenance and…
Our approach to data and AI
Just over a year after launching ChatGPT, AI is changing how we live, work and learn. It’s also raised important conversations about data in the age o…
API Partnership with Stack Overflow
API Partnership with Stack Overflow Stack Overflow and OpenAI today announced a new API partnership that will empower developers with the collective…
Introducing the Open Leaderboard for Hebrew LLMs!
Bringing the Artificial Analysis LLM Performance Leaderboard to Hugging Face
Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints
Improving Prompt Consistency with Structured Generations
We’re bringing the Financial Times’ world-class journalism to ChatGPT
We will also collaborate on new AI experiences for FT readers.
StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation
Accelerating the development of life-saving treatments
Accelerating the development of life-saving treatments.
Introducing ChatGPT and Whisper APIs
GPT-4 API general availability and deprecation of older models in the Completions API
GPT-3.5 Turbo, DALL·E and Whisper APIs are also generally available, and we are releasing a deprecation plan for older models of the Completions API,…
Introducing more enterprise-grade features for API customers
Increasing enterprise support with more security features and controls, updates to our Assistants API, and tools to better manage costs.
OpenAI’s commitment to child safety: adopting safety by design principles
Introducing the Open Chain of Thought Leaderboard
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent
The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions
Today's LLMs are susceptible to prompt injections, jailbreaks, and other attacks that allow adversaries to overwrite a model's original instructions w…
The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare
Welcome Llama 3 - Meta's new open LLM
AI Apps in a Flash with Gradio's Reload Mode
Introducing the LiveCodeBench Leaderboard - Holistic and Contamination-Free Evaluation of Code LLMs
Running Privacy-Preserving Inferences on Hugging Face Endpoints
Ryght’s Journey to Empower Healthcare and Life Sciences with Expert Support from Hugging Face
Introducing Idefics2: A Powerful 8B Vision-Language Model for the community
Introducing OpenAI Japan
We are excited to announce our first office in Asia and we’re releasing a GPT-4 custom model optimized for the Japanese language.
Vision Language Models Explained
Making thousands of open LLMs bloom in the Vertex AI Model Garden
CodeGemma - an official Google release for code LLMs
Public Policy at Hugging Face
Klarna's AI assistant does the work of 700 full-time agents
Klarna is using AI to revolutionize personal shopping, customer service, and employee productivity.
Introducing improvements to the fine-tuning API and expanding our custom models program
We’re adding new features to help developers have more control over fine-tuning and announcing new ways to build custom models with OpenAI.
Hugging Face partners with Wiz Research to Improve AI Security
Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B
Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon
Customizing models for legal professionals
Harvey partners with OpenAI to build a custom-trained model for legal professionals.
Bringing serverless GPU inference to Hugging Face users
Reducing health insurance costs and improving care
Oscar brings AI to health insurance, reducing costs and improving patient care.
Start using ChatGPT instantly
We’re making it easier for people to experience the benefits of AI without needing to sign up
Navigating the challenges and opportunities of synthetic voices
We’re sharing lessons from a small scale preview of Voice Engine, a model for creating custom voices.
Making education data accessible
Zelma uses GPT-4 to make education data accessible.
Sora first impressions
Since we introduced Sora to the world last month, we’ve been working with artists to learn how Sora might aid in their creative process.
Pollen-Vision: Unified interface for Zero-Shot vision models in robotics
Total noob’s intro to Hugging Face Transformers
Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval
Embedding AI into developer software
JetBrains uses OpenAI’s API to build its fastest-growing product ever.
Introducing the Chatbot Guardrails Arena
A Chatbot on your Laptop: Phi-2 on Intel Meteor Lake
Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models
GaLore: Advancing Large Model Training on Consumer-grade Hardware
Building a data-driven, efficient culture with AI
Holiday Extras rolls out ChatGPT Enterprise across every team, boosting productivity by 500 hours weekly.
Reimagining the email experience with AI
Superhuman introduces a new era of email with OpenAI.
Enterprise-ready trust and safety
Salesforce integrates OpenAI’s enterprise-ready LLMs to transform customer applications.
Easily Train Models with H100 GPUs on NVIDIA DGX Cloud
Quanto: a PyTorch quantization backend for Optimum
CPU Optimized Embeddings with 🤗 Optimum Intel and fastRAG
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Saving lives with AI health coaching
Healthify collaborates with OpenAI to improve millions of lives with sustainable weight loss.
Global news partnerships: Le Monde and Prisa Media
We have partnered with international news organizations Le Monde and Prisa Media to bring French and Spanish news content to ChatGPT.
OpenAI announces new members to board of directors
Dr. Sue Desmond-Hellmann, Nicole Seligman, Fidji Simo join; Sam Altman rejoins board
Review completed & Altman, Brockman to continue to lead OpenAI
New board members named and enhancements to the governance structure introduced
Improving health literacy and patient well-being
Lifespan uses GPT-4 to radically improve health literacy and patient outcomes.
Using AI to improve patient access to clinical trials
Paradigm uses OpenAI’s API to improve patient access to clinical trials.
Sparking a more productive company with ChatGPT Enterprise
Match Group uses ChatGPT Enterprise to spark creativity and impact.
OpenAI and Elon Musk
We are dedicated to the OpenAI mission and have pursued it every step of the way.
Introducing ConTextual: How well can your Multimodal model jointly reason over text and image in text-rich scenes?
Data is better together: Enabling communities to collectively build better datasets together using Argilla and Hugging Face Spaces
Text-Generation Pipeline on Intel® Gaudi® 2 AI Accelerator
StarCoder2 and The Stack v2
TTS Arena: Benchmarking Text-to-Speech Models in the Wild
AI Watermarking 101: Tools and Techniques
Fine-Tuning Gemma Models in Hugging Face
Introducing the Red-Teaming Resistance Leaderboard
🪆 Introduction to Matryoshka Embedding Models
Welcome Gemma - Google’s new open LLM
Introducing the Open Ko-LLM Leaderboard: Leading the Korean LLM Evaluation Ecosystem
🤗 PEFT welcomes new merging methods
Synthetic data: save money, time and carbon with open source
Video generation models as world simulators
We explore large-scale training of generative models on video data. Specifically, we train text-conditional diffusion models jointly on videos and ima…
Disrupting malicious uses of AI by state-affiliated threat actors
AMD Pervasive AI Developer Contest!
Memory and new controls for ChatGPT
We’re testing the ability for ChatGPT to remember things you discuss to make future chats more helpful. You’re in control of ChatGPT’s memory.
From OpenAI to Open LLMs with Messages API on Hugging Face
SegMoE: Segmind Mixture of Diffusion Experts
NPHardEval Leaderboard: Unveiling the Reasoning Abilities of Large Language Models through Complexity Classes and Dynamic Updates
Constitutional AI with Open LLMs
Hugging Face Text Generation Inference available for AWS Inferentia2
Patch Time Series Transformer in Hugging Face
Building an early warning system for LLM-aided biological threat creation
We’re developing a blueprint for evaluating the risk that a large language model (LLM) could aid someone in creating a biological threat. In an evalua…
Introducing the Enterprise Scenarios Leaderboard: a Leaderboard for Real World Use Cases
Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding
The Hallucinations Leaderboard, an Open Effort to Measure Hallucinations in Large Language Models
An Introduction to AI Secure LLM Safety Leaderboard
New embedding models and API updates
Hugging Face and Google partner for open AI collaboration
Open-source LLMs as LangChain Agents
Fine-Tune W2V2-Bert for low-resource ASR with 🤗 Transformers
PatchTSMixer in HuggingFace
Preference Tuning LLMs with Direct Preference Optimization Methods
Democratic inputs to AI grant program: lessons learned and implementation plans
We funded 10 teams from around the world to design ideas and tools to collectively govern AI. We summarize the innovations, outline our learnings, and…
How OpenAI is approaching 2024 worldwide elections
We’re working to prevent abuse, provide transparency on AI-generated content, and improve access to accurate voting information.
Accelerating SD Turbo and SDXL Turbo Inference with ONNX Runtime and Olive
Run ComfyUI workflows for free with Gradio on Hugging Face Spaces
Building agricultural database for farmers
Digital Green uses OpenAI to increase farmer income.
A guide to setting up your own Hugging Face leaderboard: an end-to-end example with Vectara's hallucination leaderboard
Introducing the GPT Store
Introducing ChatGPT Team
We’re launching a new ChatGPT plan for teams of all sizes, which provides a secure, collaborative workspace to get the most out of ChatGPT at work.
Make LLM Fine-tuning 2x faster with Unsloth and 🤗 TRL
OpenAI and journalism
We support journalism, partner with news organizations, and believe The New York Times lawsuit is without merit.
Delivering LLM-powered health solutions
WHOOP delivers personalized fitness and health coaching with GPT-4.
Welcome aMUSEd: Efficient Text-to-Image Generation
LoRA training scripts of the world, unite!
Speculative Decoding for 2x Faster Whisper Inference
2023, year of open LLMs
Increasing accuracy of pediatric visit notes
Summer Health reimagines pediatric doctor’s visits with OpenAI.
Superalignment Fast Grants
We’re launching $10M in grants to support technical research towards the alignment and safety of superhuman AI systems, including weak-to-strong gener…
Practices for Governing Agentic AI Systems
Weak-to-strong generalization
We present a new research direction for superalignment, together with promising initial results: can we leverage the generalization properties of deep…
Partnership with Axel Springer to deepen beneficial use of AI in journalism
Axel Springer is the first publishing house globally to partner with us on a deeper integration of journalism in AI technologies.
Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face
Mixture of Experts Explained
SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit
AMD + 🤗: Large Language Models Out-of-the-Box Acceleration with AMD GPU
Optimum-NVIDIA Unlocking blazingly fast LLM inference in just 1 line of code
Goodbye cold boot - how we made LoRA Inference 300% faster
Open LLM Leaderboard: DROP deep dive
Sam Altman returns as CEO, OpenAI has a new initial board
Mira Murati as CTO, Greg Brockman returns as President. Read messages from CEO Sam Altman and board chair Bret Taylor.
OpenAI announces leadership transition
OpenAI Data Partnerships
Working together to create open-source and private datasets for AI training.
SDXL in 4 steps with Latent Consistency LoRAs
Make your llama generation time fly with AWS Inferentia2
Introducing Prodigy-HF: a direct integration with Hugging Face
Comparing the Performance of LLMs: A Deep Dive into Roberta, Llama 2, and Mistral for Disaster Tweets Analysis with Lora
Introducing GPTs
You can now create custom versions of ChatGPT that combine instructions, extra knowledge, and any combination of skills.
New models and developer products announced at DevDay
GPT-4 Turbo with 128K context and lower prices, the new Assistants API, GPT-4 Turbo with Vision, DALL·E 3 API, and more.
Introducing Storage Regions on the HF Hub
Personal Copilot: Train Your Own Coding Assistant
Frontier risk and preparedness
To support the safety of highly-capable AI systems, we are developing our approach to catastrophic risk preparedness, including building a Preparednes…
Frontier Model Forum updates
Together with Anthropic, Google, and Microsoft, we’re announcing the new Executive Director of the Frontier Model Forum and a new $10 million AI Safet…
Interactively explore your Huggingface dataset with one line of code
Deploy Embedding Models with Hugging Face Inference Endpoints
The N Implementation Details of RLHF with PPO
Exploring simple optimizations for SDXL
DALL·E 3 is now available in ChatGPT Plus and Enterprise
We developed a safety mitigation stack to ready DALL·E 3 for wider release and are sharing updates on our provenance research.
Gradio-Lite: Serverless Gradio Running Entirely in Your Browser
Simplifying contract reviews with AI
Ironclad uses GPT-4 to simplify the contract review process.
Building AI-powered apps for business
Retool uses GPT-4 to give businesses a fast, secure way to build AI-powered apps.
Evolving online forms into dynamic data
Typeform evolves online forms into dynamic and conversational data collection experiences with GPT-3.5 and GPT-4.
Accelerating over 130,000 Hugging Face models with ONNX Runtime
DALL·E 3 system card
🧨 Accelerating Stable Diffusion XL Inference with JAX on Cloud TPU v5e
Chat Templates: An End to the Silent Performance Killer
Deploying the AI Comic Factory using the Inference API
Ethics and Society Newsletter #5: Hugging Face Goes To Washington and Other Summer 2023 Musings
Finetune Stable Diffusion Models with DDPO via TRL
Non-engineers guide: Train a LLaMA 2 chatbot
Llama 2 on Amazon SageMaker a Benchmark
ChatGPT can now see, hear, and speak
GPT-4V(ision) system card
Inference for PROs
OpenAI Red Teaming Network
We’re announcing an open call for the OpenAI Red Teaming Network and invite domain experts interested in improving the safety of OpenAI’s models to jo…
Rocket Money x Hugging Face: Scaling Volatile ML Models in Production
Introduction to 3D Gaussian Splatting
Object Detection Leaderboard
Optimizing your LLM in production
Introducing OpenAI Dublin
We’re growing our presence in Europe with an office in Dublin, Ireland.
Introducing Würstchen: Fast Diffusion for Image Generation
Fine-tuning Llama 2 70B using PyTorch FSDP
Overview of natively supported quantization schemes in 🤗 Transformers
SafeCoder vs. Closed-source Code Assistants
Efficient Controllable Generation for SDXL with T2I-Adapters
Join us for OpenAI’s first developer conference on November 6 in San Francisco
Developer registration for in-person attendance will open in the coming weeks and developers everywhere will be able to livestream the keynote.
Spread Your Wings: Falcon 180B is here
Fetch Cuts ML Processing Latency by 50% Using Amazon SageMaker & Hugging Face
Teaching with AI
We’re releasing a guide for teachers using ChatGPT in their classroom—including suggested prompts, an explanation of how ChatGPT works and its limitat…
AudioLDM 2, but faster ⚡️
Introducing ChatGPT Enterprise
Get enterprise-grade security & privacy and the most powerful version of ChatGPT yet.
Code Llama: Llama 2 learns to code
Deprecation of Git Authentication using password
OpenAI partners with Scale to provide support for enterprises fine-tuning models
OpenAI’s customers can leverage Scale’s AI expertise to customize our most advanced models.
Making LLMs lighter with AutoGPTQ and transformers
GPT-3.5 Turbo fine-tuning and API updates
Developers can now bring their own data to customize GPT-3.5 Turbo for their use cases.
Introducing SafeCoder
Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Langage Model
OpenAI acquires Global Illumination
The entire team has joined OpenAI.
Using GPT-4 for content moderation
We use GPT-4 for content policy development and content moderation decisions, enabling more consistent labeling, a faster feedback loop for policy ref…
Hugging Face Hub on the AWS Marketplace: Pay with your AWS Account
Optimizing Bark using 🤗 Transformers
Deploying Hugging Face Models with BentoML: DeepFloyd IF in Action
Fine-tune Llama 2 with DPO
Releasing Swift Transformers: Run On-Device LLMs in Apple Devices
Deploy MusicGen in no time with Inference Endpoints
Huggy Lingo: Using Machine Learning to Improve Language Metadata on the Hugging Face Hub
Towards Encrypted Large Language Models with FHE
Confidence-Building Measures for Artificial Intelligence: Workshop proceedings
Practical 3D Asset Generation: A Step-by-Step Guide
Open-sourcing Knowledge Distillation Code and Weights of SD-Small and SD-Tiny
Stable Diffusion XL on Mac with Advanced Core ML Quantization
Frontier Model Forum
We’re forming a new industry body to promote the safe and responsible development of frontier AI systems: advancing AI safety research, identifying be…
AI Policy @🤗: Open ML Considerations in the EU AI Act
Introducing Agents.js: Give tools to your LLMs using JavaScript
Moving AI governance forward
OpenAI and other leading labs reinforce AI safety, security and trustworthiness through voluntary commitments.
Results of the Open Source AI Game Jam
Custom instructions for ChatGPT
We’re rolling out custom instructions to give you more control over how ChatGPT responds. Set your preferences, and ChatGPT will keep them in mind for…
Happy 1st anniversary 🤗 Diffusers!
Partnership with American Journalism Project to support local news
A new $5+ million partnership aims to explore ways the development of artificial intelligence (AI) can support a thriving, innovative local news field…
Llama 2 is here - get it on Hugging Face
Building an AI WebTV
Open-Source Text Generation & LLM Ecosystem at Hugging Face
Fine-tuning Stable Diffusion models on Intel CPUs
Accurately analyzing large scale qualitative data
Viable uses GPT-4 to analyze qualitative data at a revolutionary scale with unparalleled accuracy.
Frontier AI regulation: Managing emerging risks to public safety
Making ML-powered web games with Transformers.js
Deploy LLMs with Hugging Face Inference Endpoints
Making a web app generator with open ML models
Leveraging Hugging Face for complex generative AI use cases
Insights from global conversations
We are sharing what we learned from our conversations across 22 countries, and how we will be incorporating those insights moving forward.
Accelerating Vision-Language Models: BridgeTower on Habana Gaudi2
Introducing OpenAI London
We are excited to announce OpenAI’s first international expansion with a new office in London, United Kingdom.
Ethics and Society Newsletter #4: Bias in Text-to-Image Models
What's going on with the Open LLM Leaderboard?
Panel on Hugging Face
AI Policy @🤗: Response to the U.S. NTIA's Request for Comment on AI Accountability
Fine-Tune MMS Adapter Models for low-resource ASR
Yes, Transformers are Effective for Time Series Forecasting (+ Autoformer)
Faster Stable Diffusion with Core ML on iPhone, iPad, and Mac
Deploy Livebook notebooks as apps to Hugging Face Spaces
Announcing our new Content Guidelines and Policy
Function calling and other API updates
We’re announcing updates including more steerable API models, function calling capabilities, longer context, and lower prices.
Hugging Face and AMD partner on accelerating state-of-the-art models for CPU and GPU platforms
Can foundation models label data like humans?
The Hugging Face Hub for Galleries, Libraries, Archives and Museums
DuckDB: analyze 50,000+ datasets stored on the Hugging Face Hub
Welcome fastText to the Hugging Face Hub
The Falcon has landed in the Hugging Face ecosystem
AI Speech Recognition in Unity
OpenAI Cybersecurity Grant Program
Our goal is to facilitate the development of AI-powered cybersecurity capabilities for defenders through grants and other support.
Announcing the Open Source AI Game Jam 🎮
Improving mathematical reasoning with process supervision
We've trained a model to achieve a new state-of-the-art in mathematical problem solving by rewarding each correct step of reasoning (“process supervis…
Introducing the Hugging Face LLM Inference Container for Amazon SageMaker
Introducing BERTopic Integration with the Hugging Face Hub
Democratic inputs to AI
Our nonprofit organization, OpenAI, Inc., is launching a program to award ten $100,000 grants to fund experiments in setting up a democratic process f…
Optimizing Stable Diffusion for Intel CPUs with NNCF and 🤗 Optimum
Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA
Hugging Face Collaborates with Microsoft to launch Hugging Face Model Catalog on Azure
Hugging Face and IBM partner on watsonx.ai, the next-generation enterprise studio for AI builders
🐶Safetensors audited as really safe and becoming the default
Instruction-tuning Stable Diffusion with InstructPix2Pix
Governance of superintelligence
Now is a good time to start thinking about the governance of superintelligence—future AI systems dramatically more capable than even AGI.
Introducing the ChatGPT app for iOS
The ChatGPT app syncs your conversations, supports voice input, and brings our latest model improvements to your fingertips.
Large-scale Near-deduplication Behind BigCode
Smaller is better: Q8-Chat, an efficient generative AI experience on Xeon
Hugging Face Selected for the French Data Protection Agency Enhanced Support Program
Run a Chatgpt-like Chatbot on a Single GPU with ROCm
Introducing RWKV - An RNN with the advantages of a transformer
Assisted Generation: a new direction toward low-latency text generation
Language models can explain neurons in language models
We use GPT-4 to automatically write explanations for the behavior of neurons in large language models and to score those explanations. We release a da…
Creating a Coding Assistant with StarCoder
A Dive into Text-to-Video Models
StarCoder: A State-of-the-Art LLM for Code
How to Install and Use the Hugging Face Unity API
Training a language model with 🤗 Transformers using TensorFlow and TPUs
Running IF with 🧨 diffusers on a Free Tier Google Colab
Databricks ❤️ Hugging Face: up to 40% faster training and tuning of Large Language Models
New ways to manage your data in ChatGPT
ChatGPT users can now turn off chat history, allowing you to choose which conversations can be used to train our models.
Introducing HuggingFace blog for Chinese speakers: Fostering Collaboration with the Chinese AI community
How to host a Unity game in a Space
Accelerating Hugging Face Transformers with AWS Inferentia2
Graph Classification with Transformers
Creating Privacy Preserving AI with Substra
Announcing OpenAI’s Bug Bounty Program
This initiative is essential to our commitment to develop safe and advanced AI. As we create technology and services that are secure, reliable, and tr…
Snorkel AI x Hugging Face: unlock foundation models for enterprises
Our approach to AI safety
Ensuring that AI systems are built, deployed, and used safely is critical to our mission.
StackLLaMA: A hands-on guide to train LLaMA with RLHF
Ethics and Society Newsletter #3: Ethical Openness at Hugging Face
Fast Inference on Large Language Models: BLOOMZ on Habana Gaudi2 Accelerator
Accelerating Stable Diffusion Inference on Intel CPUs
Federated Learning using Hugging Face and Flower
March 20 ChatGPT outage: Here’s what happened
An update on our findings, the actions we’ve taken, and technical details of the bug.
Train your ControlNet with diffusers
ChatGPT plugins
We’ve implemented initial support for plugins in ChatGPT. Plugins are tools designed specifically for language models with safety as a core principle,…
Jupyter X Hugging Face
GPTs are GPTs: An early look at the labor market impact potential of large language models
GPT-4
We’ve created GPT-4, the latest milestone in OpenAI’s effort in scaling up deep learning. GPT-4 is a large multimodal model (accepting image and text…
Transforming visual accessibility
Be My Eyes uses GPT-4 to transform visual accessibility.
Preserving languages for the future
How Iceland is using GPT-4 to preserve its language.
Stripe
Stripe leverages GPT-4 to streamline user experience and combat fraud.
Powering virtual education for the classroom
Khan Academy explores the potential for GPT-4 in a limited pilot program.
Filling crucial language learning gaps
GPT-4 deepens the conversation on Duolingo.
Multivariate Probabilistic Time Series Forecasting with Informer
Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU
New ViT and ALIGN Models From Kakao Brain
Using Machine Learning to Aid Survivors and Race through Time
ControlNet in 🧨 Diffusers
Ethical Guidelines for developing the Diffusers library
How Hugging Face Accelerated Development of Witty Works Writing Assistant
Planning for AGI and beyond
Our mission is to ensure that artificial general intelligence—AI systems that are generally smarter than humans—benefits all of humanity.
Red-Teaming Large Language Models
Swift 🧨Diffusers - Fast Stable Diffusion for Mac
Fetch Consolidates AI Tools and Saves 30% Development Time with Hugging Face on AWS
Hugging Face and AWS partner to make AI more accessible
How should AI systems behave, and who should decide?
We’re clarifying how ChatGPT’s behavior is shaped and our plans for improving that behavior, allowing more user customization, and getting more public…
Zero-shot image-to-text generation with BLIP-2
Why we’re switching to Hugging Face Inference Endpoints, and maybe you should too
Parameter-Efficient Fine-Tuning using 🤗 PEFT
Speech Synthesis, Recognition, and More With SpeechT5
Generating Stories: AI for Game Development #5
Introducing ⚔️ AI vs. AI ⚔️ a deep reinforcement learning multi-agents competition system
Accelerating PyTorch Transformers with Intel Sapphire Rapids - part 2
A Dive into Vision-Language Models
Introducing ChatGPT Plus
We’re launching a pilot subscription plan for ChatGPT, a conversational AI that can chat with you, answer follow-up questions, and challenge incorrect…
New AI classifier for indicating AI-written text
We’re launching a classifier trained to distinguish between AI-written and human-written text.
The State of Computer Vision at Hugging Face 🤗
2D Asset Generation: AI for Game Development #4
Using LoRA for Efficient Stable Diffusion Fine-Tuning
What Makes a Dialog Agent Useful?
Optimum+ONNX Runtime - Easier, Faster training for your Hugging Face models
OpenAI and Microsoft extend partnership
We’re happy to announce that OpenAI and Microsoft are extending our partnership.
3D Asset Generation: AI for Game Development #3
Universal Image Segmentation with Mask2Former and OneFormer
Welcome PaddlePaddle to the Hugging Face Hub
Image Similarity with Hugging Face Datasets and Transformers
Forecasting potential misuses of language models for disinformation campaigns and how to reduce risk
OpenAI researchers collaborated with Georgetown University’s Center for Security and Emerging Technology and the Stanford Internet Observatory to inve…
AI for Game Development: Creating a Farming Game in 5 Days. Part 2
Delivering nuanced insights from customer feedback
Using GPT-3 to deliver fast, nuanced insights from customer feedback.
Fine-tuning GPT-3 to scale video creation
Fine-tuning GPT-3 to power and scale done-for-you video creation.
Introduction to Graph Machine Learning
AI for Game Development: Creating a Farming Game in 5 Days. Part 1
Accelerating PyTorch Transformers with Intel Sapphire Rapids - part 1
Creating next-gen characters
Using GPT-3 to create the next generation of AI-powered characters.
The power of continuous learning
Lilian Weng works on Applied AI Research at OpenAI.
Zero-shot image segmentation with CLIPSeg
Model Cards
Point-E: A system for generating 3D point clouds from complex prompts
New and improved embedding model
We are excited to announce a new embedding model which is significantly more capable, cost effective, and simpler to use.
Let's talk about biases in machine learning! Ethics and Society Newsletter #2
A Complete Guide to Audio Datasets
Faster Training and Inference: Habana Gaudi®2 vs Nvidia A100 80GB
Illustrating Reinforcement Learning from Human Feedback (RLHF)
From GPT2 to Stable Diffusion: Hugging Face arrives to the Elixir community
Discovering the minutiae of backend systems
Christian Gibson is an engineer on the Supercomputing team at OpenAI.
Deep Learning with Proteins
Using Stable Diffusion with Core ML on Apple Silicon
Probabilistic Time Series Forecasting with 🤗 Transformers
Introducing ChatGPT
We’ve trained a model called ChatGPT which interacts in a conversational way. The dialogue format makes it possible for ChatGPT to answer followup que…
VQ-Diffusion
We are hiring interns!
Diffusion Models Live Event
Director of Machine Learning Insights [Part 4]
Accelerating Document AI
An overview of inference solutions on Hugging Face
Hugging Face Machine Learning Demos on arXiv
Sentiment Analysis on Encrypted Data with Homomorphic Encryption
Generating Human-level Text with Contrastive Search in Transformers 🤗
Introducing our new pricing
Training Stable Diffusion with Dreambooth using Diffusers
DALL·E API now available in public beta
Starting today, developers can begin building apps with the DALL·E API.
Fine-Tune Whisper For Multilingual ASR with 🤗 Transformers
Accelerate your models with 🤗 Optimum Intel and OpenVINO
Evaluating Language Model Bias with 🤗 Evaluate
From PyTorch DDP to Accelerate to Trainer, mastery of distributed training with ease
Scaling laws for reward model overoptimization
MTEB: Massive Text Embedding Benchmark
Getting Started with Hugging Face Inference Endpoints
🧨 Stable Diffusion in JAX / Flax !
Optimization story: Bloom inference
Introducing DOI: the Digital Object Identifier to Datasets and Models
Japanese Stable Diffusion
Very Large Language Models and How to Evaluate Them
DALL·E now available without waitlist
New users can start creating straight away. Lessons learned from deployment and improvements to our safety systems make wider availability possible.
Image Classification with AutoTrain
How 🤗 Accelerate runs very large models thanks to PyTorch
SetFit: Efficient Few-Shot Learning Without Prompts
Ethics and Society Newsletter #1
Introducing Whisper
Incredibly Fast BLOOM Inference with DeepSpeed and Accelerate
What's new in Diffusers? 🎨
Train your first Decision Transformer
How to train a Language Model with Megatron-LM
DALL·E: Introducing outpainting
Extend creativity and tell a bigger story with DALL·E images of any size.
OpenRAIL: Towards open and responsible AI licensing frameworks
Our approach to alignment research
We are improving our AI systems’ ability to learn from human feedback and to assist humans at evaluating AI. Our goal is to build a sufficiently align…
Visualize proteins on Hugging Face Spaces
Stable Diffusion with 🧨 Diffusers
Pre-Train BERT with Hugging Face Transformers and Habana Gaudi
Deploying 🤗 ViT on Vertex AI
Deep Dive: Vision Transformers On Hugging Face Optimum Graphcore
A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes
Introducing Skops
Hugging Face's TensorFlow Philosophy
Deploying 🤗 ViT on Kubernetes with TF Serving
New and improved content moderation tooling
We are introducing a new and improved content moderation tool. The Moderation endpoint improves upon our previous content filter, and is available for…
Train and Fine-Tune Sentence Transformers Models
Proximal Policy Optimization (PPO)
Introducing the Private Hub: A New Way to Build With Machine Learning
Nyströmformer: Approximating self-attention in linear time and memory via the Nyström method
Comments on U.S. National AI Research Resource Interim Report
Efficient training of language models to fill in the middle
Introducing new audio and vision documentation in 🤗 Datasets
Faster Text Generation with TensorFlow and XLA
A hazard analysis framework for code synthesis large language models
Deploying TensorFlow Vision Models in Hugging Face with TF Serving
Advantage Actor Critic (A2C)
DALL·E now available in beta
We’ll invite 1 million people from our waitlist over the coming weeks. Users can create with DALL·E using free credits that refill every month, and bu…
Reducing bias and improving safety in DALL·E 2
Today, we are implementing a new technique so that DALL·E generates images of people that more accurately reflect the diversity of the world’s populat…
How to train your model dynamically using adversarial data
DALL·E 2: Extending creativity
As part of our DALL·E 2 research preview, more than 3,000 artists from more than 118 countries have incorporated DALL·E into their creative workflows.…
The Technology Behind BLOOM Training
Building a Playlist Generator with Sentence Transformers
Introducing The World's Largest Open Multilingual Language Model: BLOOM
Getting Started with Sentiment Analysis on Twitter
Policy Gradient with PyTorch
Liftoff! How to get started with your first ML project 🚀
DALL·E 2 pre-training mitigations
In order to share the magic of DALL·E 2 with a broad audience, we needed to reduce the risks associated with powerful image generation models. To this…
Accelerate Large Model Training using DeepSpeed
Announcing Evaluation on the Hub
Learning to play Minecraft with Video PreTraining
We trained a neural network to play Minecraft by Video PreTraining (VPT) on a massive unlabeled video dataset of human Minecraft play, while using onl…
Getting Started With Embeddings
Convert Transformers to ONNX with Hugging Face Optimum
Evolution through large models
Intel and Hugging Face Partner to Democratize Machine Learning Hardware Acceleration
Director of Machine Learning Insights [Part 3: Finance Edition]
AI-written critiques help humans notice flaws
We trained “critique-writing” models to describe flaws in summaries. Human evaluators find flaws in summaries much more often when shown our model’s c…
Techniques for training large neural networks
Large neural networks are at the core of many recent advances in AI, but training them is a difficult engineering and research challenge which require…
The Annotated Diffusion Model
Deep Q-Learning with Space Invaders
Best practices for deploying language models
Cohere, OpenAI, and AI21 Labs have developed a preliminary set of best practices applicable to any organization developing or deploying large language…
Teaching models to express their uncertainty in words
Graphcore and Hugging Face Launch New Lineup of IPU-Ready Transformers
Introducing Pull Requests and Discussions 🥳
Powering next generation applications with OpenAI Codex
Codex is now powering 70 different applications across a variety of use cases through the OpenAI API.
Efficient Table Pre-training without Real Data: An Introduction to TAPEX
An Introduction to Q-Learning Part 2/2
How Sempre Health is leveraging the Expert Acceleration Program to accelerate their ML roadmap
Putting ethical principles at the core of the research lifecycle
DALL·E 2 research preview update
Early users have created over 3 million images to date and helped us improve our safety processes. We’re excited to begin adding up to 1,000 new users…
An Introduction to Q-Learning Part 1
Machine Learning Experts - Sasha Luccioni
Announcing the Hugging Face Fellowship Program
Gradio 3.0 is Out!
Director of Machine Learning Insights [Part 2: SaaS Edition]
Student Ambassador Program’s call for applications is open!
Accelerated Inference with Optimum and Transformers Pipelines
We Raised $100 Million for Open & Collaborative Machine Learning 🚀
Welcome fastai to the Hugging Face Hub
OpenAI leadership team update
We’re happy to announce several executive role changes that reflect our recent progress and will ensure continued momentum toward our next major miles…
An Introduction to Deep Reinforcement Learning
Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel
Opinion Classification with Kili and HuggingFace AutoTrain
Director of Machine Learning Insights
Getting Started with Transformers on Habana Gaudi
Introducing Hugging Face for Education 🤗
Supercharged Customer Service with Machine Learning
CO2 Emissions and the 🤗 Hub: Leading the Charge
Hierarchical text-conditional image generation with CLIP latents
Measuring Goodhart’s law
Goodhart’s law famously says: “When a measure becomes a target, it ceases to be a good measure.” Although originally from economics, it’s something we…
Machine Learning Experts - Lewis Tunstall
Habana Labs and Hugging Face Partner to Accelerate Transformer Model Training
~Don't~ Repeat Yourself
Introducing Decision Transformers on Hugging Face 🤗
Machine Learning Experts - Margaret Mitchell
Announcing the 🤗 AI Research Residency Program
Fine-Tune a Semantic Segmentation Model with a Custom Dataset
Accelerate BERT inference with Hugging Face Transformers and AWS Inferentia
Image search with 🤗 datasets
New GPT-3 capabilities: Edit & insert
We’ve released new versions of GPT-3 and Codex which can edit or insert content into existing text, rather than just completing existing text.
Guiding Text Generation with Constrained Beam Search in 🤗 Transformers
A research agenda for assessing the economic impacts of code generation models
Lessons learned on language model safety and misuse
We describe our latest thinking in the hope of helping other AI developers address safety and misuse of deployed models.
Economic impacts research at OpenAI
Call for expressions of interest to study the economic impacts of large language models.
BERT 101 - State Of The Art NLP Model Explained
Fine-Tune ViT for Image Classification with 🤗 Transformers
Solving (some) formal math olympiad problems
We built a neural theorem prover for Lean that learned to solve a variety of challenging high-school olympiad problems, including problems from the AM…
Getting Started with Sentiment Analysis using Python
Making automatic speech recognition work on large files with Wav2Vec2 in 🤗 Transformers
Aligning language models to follow instructions
Introducing text and code embeddings
We are introducing embeddings, a new endpoint in the OpenAI API that makes it easy to perform natural language and code tasks like semantic search, cl…
Supercharged Searching on the 🤗 Hub
Text and code embeddings by contrastive pre-training
Welcome Stable-baselines3 to the Hugging Face Hub 🤗
Case Study: Millisecond Latency using Hugging Face Infinity and modern CPUs
Boosting Wav2Vec2 with n-grams in 🤗 Transformers
Deploy GPT-J 6B for inference using Hugging Face Transformers and Amazon SageMaker
Active Learning with AutoNLP and Prodigy
Gradio is joining Hugging Face!
WebGPT: Improving the factual accuracy of language models through web browsing
We’ve fine-tuned GPT-3 to more accurately answer open-ended questions using a text-based web browser.
Perceiver IO: a scalable, fully-attentional model that works on any modality
Customizing GPT-3 for your application
Fine-tune with a single command.
Training CodeParrot 🦜 from Scratch
Introducing Snowball Fight ☃️, our first ML-Agents environment
OpenAI Residency
As part of our effort to support and develop AI talent, we’re excited to announce the OpenAI Residency.
Getting Started with Hugging Face Transformers for IPUs with Optimum
Introducing the Data Measurements Tool: an Interactive Tool for Looking at Datasets
Accelerating PyTorch distributed fine-tuning with Intel technologies
OpenAI’s API now available with no waitlist
Wider availability made possible by safety progress.
Fine-Tune XLSR-Wav2Vec2 for low-resource ASR with 🤗 Transformers
Scaling up BERT-like model Inference on modern CPU - Part 2
Solving math word problems
We’ve trained a system that solves grade school math problems with nearly twice the accuracy of a fine-tuned GPT-3 model. It solves about 90% as many…
Course Launch Community Event
Large Language Models: A New Moore's Law?
Train a Sentence Embedding Model with 1B Training Pairs
The Age of Machine Learning As Code Has Arrived
Fine tuning CLIP with Remote Sensing (Satellite) images and captions
Hosting your Models and Datasets on Hugging Face Spaces using Streamlit
Showcase Your Projects in Spaces using Gradio
Summer at Hugging Face
Summarizing books with human feedback
Scaling human oversight of AI systems for tasks that are difficult to evaluate.
Hugging Face and Graphcore partner for IPU-optimized Transformers
Introducing Optimum: The Optimization Toolkit for Transformers at Scale
Helen Toner joins OpenAI’s board of directors
Today, we’re excited to announce the appointment of Helen Toner to our board of directors.
TruthfulQA: Measuring how models mimic human falsehoods
OpenAI Codex
We’ve created an improved version of OpenAI Codex, our AI system that translates natural language to code, and we are releasing it through our API in…
Introducing Triton: Open-source GPU programming for neural networks
We’re releasing Triton 1.0, an open-source Python-like programming language which enables researchers with no CUDA experience to write highly efficien…
Deep Learning over the Internet: Training Language Models Collaboratively
Welcome spaCy to the Hugging Face Hub
Deploy Hugging Face models easily with Amazon SageMaker
Evaluating large language models trained on code
Sentence Transformers in the Hugging Face Hub
Improving language model behavior by training on a curated dataset
Our latest research finds we can improve language model behavior with respect to specific behavioral values by fine-tuning on a small, curated dataset…
Few-shot learning in practice: GPT-Neo and the 🤗 Accelerated Inference API
Using & Mixing Hugging Face Models with Gradio 2.0
OpenAI Scholars 2021: Final projects
We’re proud to announce that the 2021 class of OpenAI Scholars has completed our six-month mentorship program and have produced an open-source researc…
Will Hurd joins OpenAI’s board of directors
OpenAI is committed to developing general-purpose artificial intelligence that benefits all humanity, and we believe that achieving our goal requires…
Scaling-up BERT Inference on CPU (Part 1)
Introducing 🤗 Accelerate
Distributed Training: Train BART/T5 for Summarization using 🤗 Transformers and Amazon SageMaker
Understanding BigBird's Block Sparse Attention
GPT-3 powers the next generation of apps
Over 300 applications are delivering GPT-3–powered search, conversation, text completion, and other advanced AI features through our API.
The Partnership: Amazon SageMaker and Hugging Face
My Journey to a serverless transformers pipeline on Google Cloud
Fine-Tune Wav2Vec2 for English ASR in Hugging Face with 🤗 Transformers
Hugging Face Reads, Feb. 2021 - Long-range Transformers
Multimodal neurons in artificial neural networks
We’ve discovered neurons in CLIP that respond to the same concept whether presented literally, symbolically, or conceptually. This may explain CLIP’s…
Simple considerations for simple people building fancy neural networks
Retrieval Augmented Generation with Huggingface Transformers and Ray
Hugging Face on PyTorch / XLA TPUs
Understanding the capabilities, limitations, and societal impact of large language models
Faster TensorFlow models in Hugging Face Transformers
Scaling Kubernetes to 7,500 nodes
We’ve scaled Kubernetes clusters to 7,500 nodes, producing a scalable infrastructure for large models like GPT-3, CLIP, and DALL·E, but also for rapid…
Fit More and Train Faster With ZeRO via DeepSpeed and FairScale
How we sped up transformer inference 100x for 🤗 API customers
DALL·E: Creating images from text
We’ve trained a neural network called DALL·E that creates images from text captions for a wide range of concepts expressible in natural language.
CLIP: Connecting text and images
We’re introducing a neural network called CLIP which efficiently learns visual concepts from natural language supervision. CLIP can be applied to any…
Organizational update from OpenAI
It’s been a year of dramatic change and growth at OpenAI.
Leveraging Pre-trained Language Model Checkpoints for Encoder-Decoder Models
Porting fairseq wmt19 translation system to transformers
Hyperparameter Search with Transformers and Ray Tune
Transformer-based Encoder-Decoder Models
OpenAI licenses GPT-3 technology to Microsoft
OpenAI has agreed to license GPT-3 to Microsoft for their own products and services.
Block Sparse Matrices for Smaller and Faster Language Models
Generative language modeling for automated theorem proving
Learning to summarize with human feedback
We’ve applied reinforcement learning from human feedback to train language models that are better at summarization.
OpenAI Scholars 2020: Final projects
Our third class of OpenAI Scholars presented their final projects at virtual Demo Day, showcasing their research results from over the past five month…
The Reformer - Pushing the limits of language modeling
Procgen and MineRL Competitions
We’re excited to announce that OpenAI is co-organizing two NeurIPS 2020 competitions with AIcrowd, Carnegie Mellon University, and DeepMind, using Pro…
Image GPT
We find that, just as a large transformer model trained on language can generate coherent text, the same exact model trained on pixel sequences can ge…
OpenAI API
We’re releasing an API for accessing new AI models developed by OpenAI.
Language models are few-shot learners
AI and efficiency
We’re releasing an analysis showing that since 2012 the amount of compute needed to train a neural net to the same performance on ImageNet classificat…
Jukebox
We’re introducing Jukebox, a neural net that generates music, including rudimentary singing, as raw audio in a variety of genres and artist styles. We…
Improving verifiability in AI development
We’ve contributed to a multi-stakeholder report by 58 co-authors at 30 organizations, including the Centre for the Future of Intelligence, Mila, Schwa…
OpenAI Microscope
We’re introducing OpenAI Microscope, a collection of visualizations of every significant layer and neuron of eight vision “model organisms” which are…
How to generate text: using different decoding methods for language generation with Transformers
How to train a new language model from scratch using Transformers and Tokenizers
OpenAI standardizes on PyTorch
We are standardizing OpenAI’s deep learning framework on PyTorch.
Scaling laws for neural language models
Dota 2 with large scale deep reinforcement learning
Deep double descent
We show that the double descent phenomenon occurs in CNNs, ResNets, and transformers: performance first improves, then gets worse, and then improves a…
Procgen Benchmark
We’re releasing Procgen Benchmark, 16 simple-to-use procedurally-generated environments which provide a direct measure of how quickly a reinforcement…
Safety Gym
We’re releasing Safety Gym, a suite of environments and tools for measuring progress towards reinforcement learning agents that respect safety constra…
Benchmarking safe exploration in deep reinforcement learning
GPT-2: 1.5B release
As the final model release of GPT-2’s staged release, we’re releasing the largest version (1.5B parameters) of GPT-2 along with code and model weights…
Solving Rubik’s Cube with a robot hand
We’ve trained a pair of neural networks to solve the Rubik’s Cube with a human-like robot hand. The neural networks are trained entirely in simulation…
OpenAI Scholars 2020: Applications open
We are now accepting applications for our third class of OpenAI Scholars.
Fine-tuning GPT-2 from human preferences
We’ve fine-tuned the 774M parameter GPT-2 language model using human feedback for various tasks, successfully matching the preferences of the external…
Emergent tool use from multi-agent interaction
We’ve observed agents discovering progressively more complex tool use while playing a simple game of hide-and-seek. Through training in our new simula…
Testing robustness against unforeseen adversaries
We’ve developed a method to assess whether a neural network classifier can reliably defend against adversarial attacks not seen during training. Our m…
GPT-2: 6-month follow-up
We’re releasing the 774 million parameter GPT-2 language model after the release of our small 124M model in February, staged release of our medium 355…
Learning Day
At OpenAI, each Thursday is Learning Day: a day where employees have the option to self-study technical skills that will make them better at their job…
Microsoft invests in and partners with OpenAI to support us building beneficial AGI
Microsoft is investing $1 billion in OpenAI to support us building artificial general intelligence (AGI) with widely distributed economic benefits. We…
Why responsible AI development needs cooperation on safety
We’ve written a policy research paper identifying four strategies that can be used today to improve the likelihood of long-term industry cooperation o…
OpenAI Robotics Symposium 2019
We hosted the first OpenAI Robotics Symposium on April 27, 2019.
OpenAI Scholars 2019: Final projects
Our second class of OpenAI Scholars has concluded, with all eight scholars producing an exciting final project showcased at Scholars Demo Day at OpenA…
OpenAI Fellows Fall 2018: Final projects
Our second class of OpenAI Fellows has wrapped up, with each Fellow going from a machine learning beginner to core OpenAI contributor in the course of…
Transfer of adversarial robustness between perturbation types
MuseNet
We’ve created MuseNet, a deep neural network that can generate 4-minute musical compositions with 10 different instruments, and can combine styles fro…
Generative modeling with sparse transformers
We’ve developed the Sparse Transformer, a deep neural network which sets new records at predicting what comes next in a sequence—whether text, images,…
OpenAI Five defeats Dota 2 world champions
OpenAI Five is the first AI to beat the world champions in an esports game, having won two back-to-back games versus the world champion Dota 2 team, O…
OpenAI Five Finals
We’ll be holding our final live event for OpenAI Five at 11:30am PT on April 13.
Implicit generation and generalization methods for energy-based models
We’ve made progress towards stable and scalable training of energy-based models (EBMs) resulting in better sample quality and generalization ability t…
OpenAI Scholars 2019: Meet our Scholars
Our class of eight scholars (out of 550 applicants) brings together collective expertise in literature, philosophy, cell biology, statistics, economic…
OpenAI LP
We’ve created OpenAI LP, a new “capped-profit” company that allows us to rapidly increase our investments in compute and talent while including checks…
Introducing Activation Atlases
We’ve created activation atlases (in collaboration with Google researchers), a new technique for visualizing what interactions between neurons can rep…
Neural MMO: A massively multiagent game environment
We’re releasing a Neural MMO, a massively multiagent game environment for reinforcement learning agents. Our platform supports a large, variable numbe…
Spinning Up in Deep RL: Workshop review
On February 2, we held our first Spinning Up Workshop as part of our new education initiative at OpenAI.
AI safety needs social scientists
We’ve written a paper arguing that long-term AI safety research needs social scientists to ensure AI alignment algorithms succeed when actual humans a…
Better language models and their implications
We’ve trained a large-scale unsupervised language model which generates coherent paragraphs of text, achieves state-of-the-art performance on many lan…
Computational limitations in robust classification and win-win results
OpenAI Fellows Summer 2018: Final projects
Our first cohort of OpenAI Fellows has concluded, with each Fellow going from a machine learning beginner to core OpenAI contributor in the course of…
How AI training scales
We’ve discovered that the gradient noise scale, a simple statistical metric, predicts the parallelizability of neural network training on a wide range…
Quantifying generalization in reinforcement learning
We’re releasing CoinRun, a training environment which provides a metric for an agent’s ability to transfer its experience to novel situations and has…
Spinning Up in Deep RL
We’re releasing Spinning Up in Deep RL, an educational resource designed to let anyone learn to become a skilled practitioner in deep reinforcement le…
Learning concepts with energy functions
We’ve developed an energy-based model that can quickly learn to identify and generate instances of concepts, such as near, above, between, closest, an…
Plan online, learn offline: Efficient learning and exploration via model-based control
Reinforcement learning with prediction-based rewards
We’ve developed Random Network Distillation (RND), a prediction-based method for encouraging reinforcement learning agents to explore their environmen…
Learning complex goals with iterated amplification
We’re proposing an AI safety technique called iterated amplification that lets us specify complicated behaviors and goals that are beyond human scale,…
OpenAI Scholars 2019: Applications open
We are now accepting applications for our second cohort of OpenAI Scholars, a program where we provide 6–10 stipends and mentorship to individuals fro…
OpenAI Fellows Winter 2019 & Interns Summer 2019
We are now accepting applications for OpenAI Fellows and Interns for 2019.
FFJORD: Free-form continuous dynamics for scalable reversible generative models
OpenAI Scholars 2018: Final projects
Our first cohort of OpenAI Scholars has now completed the program.
The International 2018: Results
OpenAI Five lost two games against top Dota 2 players at The International in Vancouver this week, maintaining a good chance of winning for the first…
Large-scale study of curiosity-driven learning
OpenAI Five Benchmark: Results
Yesterday, OpenAI Five won a best-of-three against a team of 99.95th percentile Dota players: Blitz, Cap, Fogged, Merlini, and MoonMeander—four of who…
Learning dexterity
We’ve trained a human-like robot hand to manipulate physical objects with unprecedented dexterity.
Variational option discovery algorithms
OpenAI Scholars 2018: Meet our Scholars
Our first class of OpenAI Scholars is underway, and you can now follow along as this group of experienced software developers becomes machine learning…
OpenAI Five Benchmark
The OpenAI Five Benchmark match is now over!
Glow: Better reversible generative models
We introduce Glow, a reversible generative model which uses invertible 1x1 convolutions. It extends previous work on reversible generative models and…
Learning Montezuma’s Revenge from a single demonstration
We’ve trained an agent to achieve a high score of 74,500 on Montezuma’s Revenge from a single human demonstration, better than any previously publishe…
OpenAI Five
Our team of five neural networks, OpenAI Five, has started to defeat amateur human teams at Dota 2.
Retro Contest: Results
The first run of our Retro Contest—exploring the development of algorithms that can generalize from previous experience—is now complete.
Learning policy representations in multiagent systems
Improving language understanding with unsupervised learning
We’ve obtained state-of-the-art results on a suite of diverse language tasks with a scalable, task-agnostic system, which we’re also releasing. Our ap…
GamePad: A learning environment for theorem proving
OpenAI Fellows Fall 2018
We’re now accepting applications for the next cohort of OpenAI Fellows, a program which offers a compensated 6-month apprenticeship in AI research at…
Gym Retro
We’re releasing the full version of Gym Retro, a platform for reinforcement learning research on games. This brings our publicly-released game count f…
AI and compute
We’re releasing an analysis showing that since 2012, the amount of compute used in the largest AI training runs has been increasing exponentially with…
AI safety via debate
We’re proposing an AI safety technique which trains agents to debate topics with one another, using a human to judge who wins.
Evolved Policy Gradients
We’re releasing an experimental metalearning approach called Evolved Policy Gradients, a method that evolves the loss function of learning agents, whi…
Gotta Learn Fast: A new benchmark for generalization in RL
Retro Contest
We’re launching a transfer learning contest that measures a reinforcement learning algorithm’s ability to generalize from previous experience.
Variance reduction for policy gradient with action-dependent factorized baselines
Report from the OpenAI hackathon
On March 3rd, we hosted our first hackathon with 100 members of the artificial intelligence community.
Improving GANs using optimal transport
On first-order meta-learning algorithms
Reptile: A scalable meta-learning algorithm
We’ve developed a simple meta-learning algorithm called Reptile which works by repeatedly sampling a task, performing stochastic gradient descent on i…
OpenAI Scholars
We’re providing 6–10 stipends and mentorship to individuals from underrepresented groups to study deep learning full-time for 3 months and open-source…
Some considerations on learning to explore via meta-reinforcement learning
Multi-Goal Reinforcement Learning: Challenging robotics environments and request for research
Ingredients for robotics research
We’re releasing eight simulated robotics environments and a Baselines implementation of Hindsight Experience Replay, all developed for our research ov…
OpenAI hackathon
Come to OpenAI’s office in San Francisco’s Mission District for talks and a hackathon on Saturday, March 3rd.
Preparing for malicious uses of AI
We’ve co-authored a paper that forecasts how malicious actors could misuse AI technology, and potential ways we can prevent and mitigate these threats…
OpenAI supporters
We’re excited to welcome new donors to OpenAI.
Interpretable machine learning through teaching
We’ve designed a method that encourages AIs to teach each other with examples that also make sense to humans. Our approach automatically selects the m…
Discovering types for entity disambiguation
We’ve built a system for automatically figuring out which object is meant by a word by having a neural network decide if the word belongs to each of a…
Requests for Research 2.0
We’re releasing a new batch of seven unsolved problems which have come up in the course of our research at OpenAI.
Scaling Kubernetes to 2,500 nodes
Block-sparse GPU kernels
We’re releasing highly-optimized GPU kernels for an underexplored class of neural network architectures: networks with block-sparse weights. Depending…
Learning sparse neural networks through L₀ regularization
Interpretable and pedagogical examples
Learning a hierarchy
We’ve developed a hierarchical reinforcement learning algorithm that learns high-level actions useful for solving a range of tasks, allowing fast solv…
Generalizing from simulation
Our latest robotics techniques allow robot controllers, trained entirely in simulation and deployed on physical robots, to react to unplanned changes…
Sim-to-real transfer of robotic control with dynamics randomization
Asymmetric actor critic for image-based robot learning
Domain randomization and generative models for robotic grasping
Competitive self-play
We’ve found that self-play allows simulated AIs to discover physical skills like tackling, ducking, faking, kicking, catching, and diving for the ball…
Meta-learning for wrestling
We show that for the task of simulated robot wrestling, a meta-learning agent can learn to quickly defeat a stronger non-meta-learning agent, and also…
Nonlinear computation in deep linear networks
Learning to model other minds
We’re releasing an algorithm which accounts for the fact that other agents are learning too, and discovers self-interested yet collaborative strategie…
Learning with opponent-learning awareness
OpenAI Baselines: ACKTR & A2C
We’re releasing two new OpenAI Baselines implementations: ACKTR and A2C. A2C is a synchronous, deterministic variant of Asynchronous Advantage Actor C…
More on Dota 2
Our Dota 2 result shows that self-play can catapult the performance of machine learning systems from far below human level to superhuman, given suffic…
Dota 2
We’ve created a bot which beats the world’s top professionals at 1v1 matches of Dota 2 under standard tournament rules. The bot learned the game from…
Gathering human feedback
RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The…
Better exploration with parameter noise
We’ve found that adding adaptive noise to the parameters of reinforcement learning algorithms frequently boosts performance. This exploration method i…
Proximal Policy Optimization
We’re releasing a new class of reinforcement learning algorithms, Proximal Policy Optimization (PPO), which perform comparably or better than state-of…
Robust adversarial inputs
We’ve created images that reliably fool neural network classifiers when viewed from varied scales and perspectives. This challenges a claim from last…
Hindsight Experience Replay
Teacher–student curriculum learning
Faster physics in Python
We’re open-sourcing a high-performance Python library for robotic simulation using the MuJoCo engine, developed over our past year of robotics researc…
Learning from human preferences
One step towards building safe AI systems is to remove the need for humans to write goal functions, since using a simple proxy for a complex goal, or…
Learning to cooperate, compete, and communicate
Multiagent environments where agents compete for resources are stepping stones on the path to AGI. Multiagent environments have two useful properties:…
UCB exploration via Q-ensembles
OpenAI Baselines: DQN
We’re open-sourcing OpenAI Baselines, our internal effort to reproduce reinforcement learning algorithms with performance on par with published result…
Robots that learn
We’ve created a robotics system, trained entirely in simulation and deployed on a physical robot, which can learn a new task after seeing it done once…
Roboschool
We are releasing Roboschool: open-source software for robot simulation, integrated with OpenAI Gym.
Equivalence between policy gradients and soft Q-learning
Stochastic Neural Networks for hierarchical reinforcement learning
Unsupervised sentiment neuron
We’ve developed an unsupervised system which learns an excellent representation of sentiment, despite being trained only to predict the next character…
Spam detection in the physical world
We’ve created the world’s first Spam-detecting AI trained entirely in simulation and deployed on a physical robot.
Evolution strategies as a scalable alternative to reinforcement learning
We’ve discovered that evolution strategies (ES), an optimization technique that’s been known for decades, rivals the performance of standard reinforce…
One-shot imitation learning
Distill
We’re excited to support today’s launch of Distill, a new kind of journal aimed at excellent communication of machine learning results (novel or exist…
Learning to communicate
In this post we’ll outline new OpenAI research in which agents develop their own language.
Emergence of grounded compositional language in multi-agent populations
Prediction and control with temporal segment models
Third-person imitation learning
Attacking machine learning with adversarial examples
Adversarial examples are inputs to machine learning models that an attacker has intentionally designed to cause the model to make a mistake; they’re l…
Adversarial attacks on neural network policies
Team update
The OpenAI team is now 45 people. Together, we’re pushing the frontier of AI capabilities—whether by validating novel ideas, creating new software sys…
PixelCNN++: Improving the PixelCNN with discretized logistic mixture likelihood and other modifications
Faulty reward functions in the wild
Reinforcement learning algorithms can break in surprising, counterintuitive ways. In this post we’ll explore one failure mode, which is where you miss…
Universe
We’re releasing Universe, a software platform for measuring and training an AI’s general intelligence across the world’s supply of games, websites and…
#Exploration: A study of count-based exploration for deep reinforcement learning
OpenAI and Microsoft
We’re working with Microsoft to start running most of our large-scale experiments on Azure.
On the quantitative analysis of decoder-based generative models
A connection between generative adversarial networks, inverse reinforcement learning, and energy-based models
RL²: Fast reinforcement learning via slow reinforcement learning
Variational lossy autoencoder
Extensions and limitations of the neural GPU
Semi-supervised knowledge transfer for deep learning from private training data
Report from the self-organizing conference
Last week we hosted over a hundred and fifty AI practitioners in our offices for our first self-organizing conference on machine learning.
Transfer from simulation to real world through learning deep inverse dynamics model
Infrastructure for deep learning
Deep learning is an empirical science, and the quality of a group’s infrastructure is a multiplier on progress. Fortunately, today’s open-source ecosy…
Machine Learning Unconference
The latest information about the Unconference is now available at the Unconference wiki, which will be periodically updated with more information for…
Team update
We’ve hired more great people to help us achieve our goals. Welcome, everyone!
Special projects
Impactful scientific work requires working on the right problems—problems which are not just interesting, but whose solutions matter.
Concrete AI safety problems
We (along with researchers from Berkeley and Stanford) are co-authors on today’s paper led by Google Brain researchers, Concrete Problems in AI Safety…
OpenAI technical goals
OpenAI’s mission is to build safe AI, and ensure AI’s benefits are as widely and evenly distributed as possible.
Generative models
This post describes four projects that share a common theme of enhancing or using generative models, a branch of unsupervised learning techniques in m…
Team update
We’d like to welcome the latest set of team members to OpenAI (and we’re still hiring!)
Adversarial training methods for semi-supervised text classification
OpenAI Gym Beta
We’re releasing the public beta of OpenAI Gym, a toolkit for developing and comparing reinforcement learning (RL) algorithms. It consists of a growing…
Welcome, Pieter and Shivon!
We have two more team updates.
Team++
We've had some fantastic people join over the past few months (and we're still hiring). Welcome, everyone!
Weight normalization: A simple reparameterization to accelerate training of deep neural networks
Introducing OpenAI
OpenAI is a non-profit artificial intelligence research company. Our goal is to advance digital intelligence in the way that is most likely to benefit…