HOW TO ADD OPENAI GPT-5 MODEL IN AZURE AI FOUNDRY

HOW TO ADD OPENAI GPT-5 MODEL IN AZURE AI FOUNDRY

August 13, 2025 / by Marco / Categories : Business

HOW TO ADD OPENAI GPT 5 MODEL IN AZURE AI FOUNDRY

Looking to try the new GPT‑5 model inside Azure AI Foundry? In this walkthrough, I’ll take you through the exact journey shown in the video—from signing in, requesting access (because the model is gated), all the way to deploying the base model and grabbing the endpoint you’ll use in your apps. If you’ve never deployed a gated Azure AI model before, don’t stress. It’s a straightforward process once you know where everything lives in the portal.

We’ll cover:

How to find the deployment area in Azure AI Foundry
What to do if GPT‑5 shows a lock icon (access request)
The email sequence you’ll receive and what to click
Deploying GPT‑5 as a base model and choosing a region
Setting a sensible rate limit (tokens per minute)
Where to find the endpoint details to start coding

Along the way, I’ll call out some practical tips about quotas, regions, and rate limits, so you don’t get stuck waiting in queues or wondering why a region is unavailable. The screenshots referenced in the video are included below—when you see them, they’ll help you visualise the exact screens you need to use.

As you can see in the image below, this is the starting point for the process inside Azure AI Foundry.

Before you start: what you’ll need

Before diving into the deployment steps, check you’ve got:

An active Azure subscription with permission to create resources (Owner or Contributor on the subscription or resource group).
Access to Azure AI Foundry (formerly part of Azure OpenAI/AI Studio experiences).
Your Azure Subscription ID handy for any access request forms.
Awareness of your organisation’s region and compliance requirements (because not every region will have quota for new models straight away).

If you’ve deployed other Azure AI models before, this will feel familiar. The only extra step is the access request, because GPT‑5 appears as a gated model in the catalogue.

Step 1: Sign in and open the Deployments area

Start by signing in to your Azure account. In Azure AI Foundry, look down the left‑hand side navigation and find the Deployments or Deployment area. This is where you control which models are deployed into your workspace.

Click Deploy model to kick off the process. You’ll be presented with a list of models you can deploy. If this is your first time here, it’s simply a catalogue—you still need to choose and configure the model you want.

Step 2: Find GPT‑5 in the model list and request access

In the model list, locate GPT‑5. You’ll notice there’s a lock icon next to it. That lock indicates the model is gated. To proceed, you need to submit a request to Microsoft for access.

As you can see in the image below, the GPT‑5 entry displays with a lock until your request is approved, which is normal for new or high‑demand models.

Click the option to request access. You’ll be taken to a form where you’ll provide:

Your contact details
Your Azure Subscription ID
Basic usage context (if requested), describing how you plan to use the model

It’s all fairly straightforward. Fill out the fields accurately and submit. You’ll receive an acknowledgement email shortly after, usually from Cognitive Services Gating Support. Keep an eye on your inbox for the follow‑up verification and approval messages.

Step 3: Verify your email

After submitting the access request, you’ll receive an email prompting you to verify your email address. Click the blue verification button in that message to confirm it’s really you. This is important—your request won’t progress until you complete verification.

As you can see in the image below, the verification email includes a clear call‑to‑action button. Click it, and you’ll be on your way.

Once verified, all you need to do is wait for the approval email. This can be quick, but timing can vary depending on demand and your account.

Step 4: Watch for the approval/onboarding email

When your request has been approved, you’ll receive an onboarding or approval email stating your access has been granted. At that point, head back into Azure AI Foundry and return to the Deploy model area. Choose Deploy base model to open the deployment wizard for GPT‑5.

You should now see GPT‑5 listed and selectable. You may also notice newer variants appear in the list—things like Nano, Chat variants, or Pro editions. These are handy if you’re experimenting with different capabilities or cost/latency profiles, but for this guide, we’ll stick with deploying GPT‑5.

As you can see in the image below, GPT‑5 appears in the deployable list once the request is approved, along with other models. You’ll also note an indicative capacity figure in the UI.

At this point, the wizard may show a default or suggested capacity such as tokens per minute (TPM). Consider this as a starting point; you can customise the rate limit in later steps within your quota.

Step 5: Choose configuration, region and quotas

Next, you’ll be prompted to configure the deployment: naming, pricing tier (for example, Global Standard), and region. The region is a key decision because availability and quota can vary. In the video example, some regions displayed no quota at all for GPT‑5, which meant they couldn’t be selected. That’s common when a model is newly available or in high demand.

As you can see in the image below, when a region has no quota, you simply can’t pick it. In the walkthrough, the only available option was Eastern US 2, so that’s what was selected.

If you have a preference for a closer region due to latency or data residency, you can either wait for quota to open up or submit a quota request through your Azure support channel. For many teams, selecting an available region now and migrating later is the quickest way to get hands‑on.

Step 6: Set a sensible rate limit (tokens per minute)

During deployment, you’ll have the option to set a tokens‑per‑minute (TPM) rate limit for the deployment. Think of this as your safety valve—it protects your budget and ensures your application won’t exceed a throughput you’re comfortable with. The video demonstrates adjusting the TPM value, with commentary that they wouldn’t need more than roughly 1,000 TPM for their use case.

Here are a few tips for picking a TPM value:

Start conservatively. If you’re testing or building a prototype, a lower TPM helps control costs whilst you tune prompts and usage patterns.
Align with expected traffic. Estimate how many requests per minute you’ll receive, multiply by average tokens per request/response, and set TPM accordingly.
Watch for throttling. If you see 429 or rate limit errors in your application logs, you may need to nudge your TPM higher (assuming your quota allows).

Remember, the TPM you set must fit within your assigned quota for the region and model. If you need more headroom, you’ll have to request a quota increase or choose a region with more available capacity.

Step 7: Create the resource and deploy

With the region and TPM set, click Create resource and deploy. Azure will begin provisioning the deployment. This can take a little while—especially when a model is new and a lot of people are trying to get access at once—so don’t be surprised if there’s a short wait.

As you can see in the image below, the provisioning screen shows progress while Azure spins up the deployment in the background.

Once it flips to a success state, you’re ready for the final step: grabbing the endpoint and connecting your app.

Step 8: Grab your endpoint and start building

After the deployment is created, open it from the Deployments area. There you’ll find the endpoint URL, the deployment name, and the other bits you need to connect. Depending on your workspace and security setup, your keys or tokens will be in the usual place for Azure AI services.

With that information, you can integrate GPT‑5 into your app or workflow. Whether you’re using the REST API or an SDK, you’ll specify the endpoint, include your deployment name, and authenticate using your key or Azure AD. From there, it’s just a matter of sending prompts and handling responses.

That’s it—you’ve added GPT‑5 in Azure AI Foundry and you’re good to go.

Key takeaways from the video

GPT‑5 appears as a locked model by default—submit an access request via the portal.
You’ll receive an acknowledgement email, then a verification email with a blue button to confirm your address, followed by an onboarding/approval email.
Once approved, go to Deploy model → Deploy base model, select GPT‑5, and continue through the wizard.
Regions vary in availability; in the example, Eastern US 2 had capacity while others didn’t.
Set a tokens‑per‑minute rate limit that suits your use case and budget.
Provisioning may take a little while due to demand—be patient.
After deployment, copy the endpoint details to integrate GPT‑5 with your application.

FAQ: common issues and quick fixes

I can’t see GPT‑5 in the list or it still shows a lock. What now?

If the model still shows a lock, your access request is probably still pending. Make sure you completed the email verification (click the blue button in the verification email), and double‑check that the approval email has arrived. If it’s still pending after a reasonable wait, raise a support ticket with Microsoft or check your organisation’s admin settings to ensure you’re allowed to request gated models.

Why does my preferred region show “no quota”?

New models often roll out incrementally. Some regions have capacity early, others follow. If your preferred region has no quota, pick an available region (as in the video example with Eastern US 2) so you can get moving. You can also request a quota increase or wait for capacity to open. Keep an eye on your cost and latency needs—sometimes using a nearby region is perfectly fine for prototypes and internal tools.

How do tokens per minute relate to my costs?

TPM doesn’t directly set your cost; it’s a throttle. Your actual costs are driven by total tokens processed (input + output) multiplied by the model’s pricing. Setting TPM helps prevent runaway usage in case of traffic spikes or code loops. Start lower, monitor, and adjust as needed.

My deployment is stuck or taking ages. Should I cancel?

Provisioning can take a bit under load. Unless it’s clearly failed with an error, give it time. If it doesn’t complete, check Azure Service Health for any known incidents, verify your subscription limits, and try again or choose another region with capacity.

Where do I find the endpoint and credentials?

Open your deployment in Azure AI Foundry and look for the endpoint details section. You’ll see the base URL and deployment name you’ll use in your API calls. Authentication is usually via a key or Azure AD; ensure your team’s security policies are followed for storing and rotating secrets.

Can I deploy multiple variants (e.g., GPT‑5 and Nano/Pro) side‑by‑side?

Yes. Many teams run a few variants: a heavier model for premium features, and a lighter one for quick or low‑cost tasks. Just keep an eye on quotas and make sure you have sufficient capacity. Use different deployment names so you can route requests as needed.

Practical tips for a smooth deployment

1) Name your deployments clearly

Adopt a naming convention that includes the model, environment, and purpose—something like “gpt5‑prod‑chat” or “gpt5‑dev‑experiments”. It makes it easier to track usage and direct requests appropriately in code.

2) Separate environments (dev/test/prod)

Use separate deployments (and even separate workspaces or subscriptions if your governance requires it) for development and production. That way, you can tune prompts without risking production stability or budget.

3) Decide on your token policy upfront

Estimate average prompt and response sizes, then set TPM so you’re within your budget even under peak. For example, if you expect 20 requests per minute at ~500 tokens each, a 10,000 TPM limit might be reasonable. If you’re experimenting, dial it lower and only increase when you see throttling and you’re comfortable with the spend.

4) Monitor usage and errors from day one

Plug your application logs into a dashboard and keep an eye on 429 (rate limit), 5xx (service) and 4xx (request) errors. Combine that with cost monitoring in Azure Cost Management so you can correlate behaviour with spend.

5) Plan for retries and fallbacks

Rate limits and transient errors happen. Implement exponential backoff on retries, and consider a lightweight fallback model for non‑critical requests. For user‑facing features, graceful degradation beats hard failures.

6) Keep security tight

Use Azure Key Vault or your organisation’s secret manager for API keys. Apply role‑based access control to limit who can deploy or change rate limits. If you’re working with sensitive data, review data handling policies for the region you selected.

7) Consider prompt caching and batching

Caching frequent prompts (where appropriate) or batching multiple small requests can reduce costs and smooth out spikes. Just make sure you preserve privacy and comply with your data retention policies.

What you’ll see at each stage (mapped to the screenshots)

Here’s a quick refresher on what each screenshot in the video represents, so you know you’re on the right path as you follow along.

Initial sign‑in and navigation to the deployment area—this is your starting point in Azure AI Foundry. You click Deploy model from the left‑hand menu items to get going.
Model list with GPT‑5 showing a lock icon—indicates the model is gated and requires an access request. You’ll click through to submit your details, including your Azure Subscription ID.
Email verification step—look for the blue button in your inbox from Cognitive Services Gating Support. Clicking it confirms your email address so your request can be processed.
Post‑approval, GPT‑5 is visible in the Deploy base model list—you might also see other variants like Nano or Pro. The UI may show an indicative capacity figure such as tokens per minute
Region selection and quota visibility—some regions show no quota and are unavailable. In the example, Eastern US 2 had capacity and was selected. You can adjust TPM within your quota.
Provisioning in progress—the deployment may take a little while, especially under heavy demand. Once it completes, you’ll see the endpoint details to use in your integrations.

Testing your deployment

Once you’ve got the endpoint and deployment name, it’s time to test. You can do this in a few ways:

Use the built‑in testing tools inside Azure AI Foundry (if available for your workspace). These often let you send a prompt and view the response without leaving the portal.
Spin up a small test script in your favourite language using the Azure SDK or REST API. Hard‑code a prompt, call the endpoint, and print the response to confirm everything’s working.
Hook it into a Postman collection or an API client you trust. It’s a quick way to iterate whilst you fine‑tune your headers, auth, and body.

If you see rate limit errors out of the gate, double‑check your TPM limit and make sure you’re not unintentionally sending multiple requests in a loop. If you see authentication errors, confirm you’re using the right key and that the key hasn’t expired or been rotated.

Cost and performance guardrails

New models like GPT‑5 can deliver advanced capabilities, but it’s smart to put a few guardrails in place from day one:

Set budgets and alerts in Azure Cost Management. It’s easy to do, and it gives you early warnings if something unexpected happens.
Log token usage per request. That visibility is gold when you’re trying to reduce prompt sizes or tune response lengths.
Enforce sensible maximums on input and output tokens in your code. Users have a habit of pasting entire documents—best to put a cap in place.
Introduce caching where safe. If your application frequently sends the same context, cache and reuse results to reduce spend and latency.

Scaling and future‑proofing

As your usage grows, you’ll likely need to revisit region selection, quotas, and rate limits. A few pointers:

Watch service health and regional announcements—capacity shifts over time as Microsoft adds more hardware and new regions come online.
Batch non‑urgent work. For scheduled tasks, pick off‑peak windows to reduce contention and the risk of throttling.
Consider a multi‑deployment strategy. For example, keep a “premium” GPT‑5 deployment for business‑critical tasks and a lighter model for routine work.
Document your configuration. Capture the deployment name, region, TPM, and quota history so future team members can understand your setup quickly.

Summary of the step‑by‑step process

Sign in to Azure AI Foundry and open Deployments.
Click Deploy model and find GPT‑5 in the model list.
If it’s locked, submit the access request with your contact details and Azure Subscription ID.
Verify your email by clicking the blue button in the verification email.
Wait for the onboarding/approval email confirming your access.
Return to Deploy model → Deploy base model and select GPT‑5.
Choose your region (select one with available quota, e.g., Eastern US 2 if that’s what you see).
Set your tokens‑per‑minute limit to match your needs and budget.
Click Create resource and deploy; wait for provisioning to complete.
Open the deployment to retrieve the endpoint and deployment name, then integrate with your app.

Wrapping up

That’s the full tour for adding GPT‑5 into Azure AI Foundry the same way it’s shown in the video. The only slightly tricky bit is the initial access request—once you’ve clicked the verification link and received approval, the rest follows the usual Azure AI deployment flow. Regions and quotas will ebb and flow, so if your first choice isn’t available, pick a region that is and get your prototype running now.

If this guide helped you get set up, do share it with a mate or your team—it’ll save them a bit of back‑and‑forth hunting for the right buttons. And if you’re keen for more tips on practical AI deployments, consider subscribing to the channel from the video and keeping an eye on updates as new model variants land.

Happy building, and enjoy exploring what GPT‑5 can do in your projects.

DO YOU LIKE WHAT YOU'VE READ?

Join our subscription list and receive our content right in your mailbox. If you like to receive some Great deals our Freebies then subscribe now!

Our Sponsors

Become a Sponsor for $45.

Limit to 5 sponsors.

Company Name :

Email :

Website :

Description :

Upload Logo :

Terms & Conditions : ACCEPTANCE OF TERMS This Agreement contains the complete terms and conditions that apply to your participation in our site. If you wish to use the site including its tools and services please read these terms of use carefully. By accessing this site or using any part of the site or any content or services hereof, you agree to become bound by these terms and conditions. If you do not agree to all the terms and conditions, then you may not access the site or use the content or any services in the site. MODIFICATIONS OF TERMS OF USE Amendments to this agreement can be made and effected by us from time to time without specific notice to your end. Agreement posted on the Site reflects the latest agreement and you should carefully review the same before you use our site. USE OF THE SITE You are prohibited to do the following acts, to wit: (a) use our sites, including its services and or tools if you are not able to form legally binding contracts, are under the age of 18, or are temporarily or indefinitely suspended from using our sites, services, or tools (b) posting of an items in inappropriate category or areas on our sites and services; (c) collecting information about users’ personal information; (d) maneuvering the price of any item or interfere with other users' listings; (f) post false, inaccurate, misleading, defamatory, or libelous content; (g) take any action that may damage the rating system. Registration Information For you to complete the sign-up process in our site, you must provide your full legal name, current address, a valid email address, member name and any other information needed in order to complete the signup process. You must qualify that you are 18 years or older and must be responsible for keeping your password secure and be responsible for all activities and contents that are uploaded under your account. You must not transmit any worms or viruses or any code of a destructive nature. Any information provided by you or gathered by the site or third parties during any visit to the site shall be subject to the terms of businesslegions.com’s Privacy Policy. Term This Agreement will remain in full force and effect while you use the Website. You may terminate your membership at any time for any reason by following the instructions on the “TERMINATION OF ACCOUNT” in the setting page. We may terminate your membership for any reason at any time. If you are using a paid version of the Service and we terminate your membership in the Service because you have breached this Agreement, you will not be entitled to any refund of unused subscription fees. Even after your membership is terminated, certain sections of this Agreement will remain in effect. NON-COMMERCIAL USE BY MEMBERS. Members on this website are prohibited to use the services of the website in connection with any commercial endeavors or ventures. This includes providing links to other websites, whether deemed competitive to this website or not. Juridical persons or entities including but not limited to organizations, companies, and/or businesses may not become Members of businesslegions.com and should not use the site for any purpose. LINKs & FRAMINGS Illegal and/or unauthorized uses of the Services, including unauthorized framing of or linking to the Sites will be investigated, and appropriate legal action may be taken. Some links, however, are welcome to the site and you are allowed to establish hyperlink to appropriate part within the site provided that: (i) you post your link only within the forum, chat or message board section; (ii) you do not remove or obscure any advertisements, copyright notices or other notices on the placed at the site; and (iii) you immediately stop providing any links to the site on written notice from us. However, you must check the copyright notice on the homepage to which you wish to link to make sure that one of our content providers does not have its own policies regarding direct links to their content on our sites. WARRANTY DISCLAIMER AND EXCLUSIONS / LIMITATIONS OF LIABILITY We make no express or implied warranties or representations with respect to the Program or any products sold through the Program (including, without limitation, warranties of fitness, merchantability, non-infringement, or any implied warranties arising out of a course of performance, dealing, or trade usage). In addition, we make no representation that the operation of our site will be uninterrupted or error-free, and we will not be liable for the consequences of any interruptions or errors. We may change, restrict access to, suspend or discontinued the site or any part of it at anytime. The information, content and services on the site are provided on an “as is” basis. When you use the site and or participate therein, you understand and agree that you participate at your own risk. INTELLECTUAL PROPERTY rights You hereby acknowledge that all rights, titles and interests, including but not limited to rights covered by the Intellectual Property Rights, in and to the site, and that You will not acquire any right, title, or interest in or to the site except as expressly set forth in this Agreement. You will not modify, adapt, translate, prepare derivative works from, decompile, reverse engineer, disassemble or otherwise attempt to derive source code from any of our services, software, or documentation, or create or attempt to create a substitute or similar service or product through use of or access to the Program or proprietary information related thereto. Confidentiality You agree not to disclose information you obtain from us and or from our clients, advertisers, suppliers and forum members. All information submitted to by an end-user customer pursuant to a Program is proprietary information of businesslegions.com. Such customer information is confidential and may not be disclosed. Publisher agrees not to reproduce, disseminate, sell, distribute or commercially exploit any such proprietary information in any manner. NON-ASSIGNMENT OF RIGHTS Your rights of whatever nature cannot be assigned nor transferred to anybody, and any such attempt may result in termination of this Agreement, without liability to us. However, we may assign this Agreement to any person at any time without notice. Waiver Failure of the businesslegions.com to insist upon strict performance of any of the terms, conditions and covenants hereof shall not be deemed a relinquishment or waiver of any rights or remedy that the we may have, nor shall it be construed as a waiver of any subsequent breach of the terms, conditions or covenants hereof, which terms, conditions and covenants shall continue to be in full force and effect. Severability of Terms. In the event that any provision of these Terms and Conditions is found invalid or unenforceable pursuant to any judicial decree or decision, such provision shall be deemed to apply only to the maximum extent permitted by law, and the remainder of these Terms and Conditions shall remain valid and enforceable according to its terms. Entire Agreement This Agreement shall be governed by and construed in accordance with the substantive laws of Australia , without any reference to conflict-of-laws principles. The Agreement describes and encompasses the entire agreement between us and you, and supersedes all prior or contemporaneous agreements, representations, warranties and understandings with respect to the Site, the contents and materials provided by or through the Site, and the subject matter of this Agreement. Choice of Law; Jurisdiction; Forum Any dispute, controversy or difference which may arise between the parties out of, in relation to or in connection with this Agreement is hereby irrevocably submitted to the exclusive jurisdiction of the courts of Australia , to the exclusion of any other courts without giving effect to its conflict of laws provisions or your actual state or Australia of residence.

Enter Captcha :

Accept our terms

Marco / admin / 30 Nov

Writing day to day ramblings about making money, business, technology, sharing awesome deals and everything else that I know I'll forget. Follow my personal blog https://marcotran.com.au
I've recently also turned Vegan and started this website Veggie Meals - check it out
"When technology speaks for itself, that is art" - MT
Affiliate Compensated: there are some articles with links to products or services that I may receive a commission.

OTHER ARTICLES YOU MAY LIKE

STOP NOISY CALLS: THE ANDROID SETTING YOU SHOULD SWITCH ON TODAY

STOP NOISY CALLS: THE ANDROID SETTING YOU SHOULD SWITCH ON TODAY

Phone calls still matter. Whether you’re speaking to a client, checking in with family, or handling something urgent on the move, clarity can make or break the conversation. If you’ve ever found yourself apologising for the construction noise behind you, the wind across your microphone, or the café chatter muddling your words, there’s an Android […]

FILMORA 15 – WHAT’S NEW? AI EXTEND, DYNAMIC CAPTIONS, TRUE TIMELINE EDITING + INSTALL GUIDE

FILMORA 15 – WHAT’S NEW? AI EXTEND, DYNAMIC CAPTIONS, TRUE TIMELINE EDITING + INSTALL GUIDE

There is a particular thrill to installing a major new release of your everyday editor, especially when your workflow is comfortable and productive in the current version. That feeling is right at the heart of the move from Filmora 14 to Filmora 15. This upgrade promises fresh creative features, faster handling of complex projects, and […]