Technical SEO

International SEO & hreflang: The Complete Guide (2026)

67% of websites using hreflang have it broken — and a single missing return tag can erase your entire international setup. This is the 2026 guide that gets language, region and hreflang right, with real Gulf and Egypt examples.

You did everything right. You translated every page into flawless Arabic, launched in Saudi Arabia, Egypt and the UAE, and waited for the traffic. Instead, Google showed your English page to a shopper in Riyadh, served your Egyptian prices to a buyer in Dubai, and quietly buried half your work as “duplicate content.” Months of effort, and the search engine seems determined to show the wrong version to everyone.

Here is the uncomfortable truth almost nobody says out loud: 67% of websites using hreflang have it broken, and a single missing return tag can make Google ignore your entire international setup. International SEO is not hard because the rules are obscure — it is hard because the rules are precise, unforgiving, and constantly misexplained by people who treat hreflang as a magic ranking lever it was never designed to be. This is the guide that gets it right.

What international SEO really is (and what it is not)

International SEO is the practice of optimizing your website so search engines serve the right language and country version of your content to the right user. When a shopper in Jeddah searches, she should land on your Saudi page — Arabic, with the right currency, shipping and dialect. When a buyer in Cairo searches the same product, he should land on your Egyptian edition. And when someone in a market you have not specifically targeted searches, they should get a sensible default rather than a confused mix.

That sounds simple. In practice it is a coordination problem across four moving parts that must agree with each other:

  • Site structure — whether your markets live on country domains (example.sa), subdirectories (example.com/ar-sa/), or subdomains (sa.example.com).
  • Hreflang annotations — the signals that tell Google which page is the alternate for which language and region.
  • Localized content and keyword research — genuinely adapted pages, not just translated strings, targeting the terms each market actually searches.
  • Geotargeting and crawl health — the technical SEO layer that makes every version crawlable, indexable and free of self-inflicted duplicate-content problems.

When all four agree, Google can confidently match user to page. When even one disagrees — say, your hreflang points to Saudi but your canonical tag points everything back to the US page — Google resolves the conflict in ways you did not intend, and your localized work disappears.

The hreflang truth nobody tells you

Almost every article you have read about hreflang gets the central point wrong, and it costs businesses real money. So let me state it plainly, in Google’s own words.

Google doesn’t use hreflang or the HTML lang attribute to detect the language of a page; instead, we use algorithms to determine the language.

Google Search Central — Localized Versions of your Pages Official Google documentation

Read that again. Google does not use your hreflang tags to figure out what language a page is in. It reads the actual content and decides for itself. Hreflang does exactly one job: it tells Google, “when you are about to show this page to a user, here are the alternate versions you could swap in instead, depending on the user’s language and country.” It is a hint about which alternate to display — not a command, not a ranking factor, and not a language declaration.

This is the single most misunderstood point in the entire topic, and it is the number-one content gap among top-ranking competitors who frame hreflang as a lever that pushes pages up the rankings. It is not. Backlinko says it cleanly.

Hreflangs do not guarantee higher rankings. Use them as part of a broader strategy.

Backlinko — International SEO: How to Dominate Global Rankings Backlinko team

Why does this matter so much in practice? Because several other signals can quietly override your intended version no matter how perfect your hreflang is:

  • Canonical tags. If your Egyptian page canonicalizes to your US page, you have told Google the Egyptian page is a duplicate that should not be indexed in its own right — and hreflang cannot save it.
  • Which pages are actually indexed. Hreflang can only swap in a version Google has crawled and indexed. An alternate stuck in “Discovered — currently not indexed” will never appear.
  • Content similarity and site structure. If your “Saudi” and “UAE” pages are byte-for-byte identical, Google may collapse them; if your structure sends contradictory geo signals, Google trusts the structure over the tag.

The mental model to hold: hreflang is a polite suggestion at the swap-in moment, and almost everything else is louder. Get the foundations right first — indexation, canonicals, genuinely distinct localized content — and then hreflang does its narrow, valuable job well.

That last point deserves emphasis because it is where most setups quietly die. Helen Pollitt, Head of SEO at Getty Images, names it as the most common failure she sees.

A common issue that can be seen with hreflang tags is that they are not formatted to reference the other pages that are, in turn, referencing them.

Search Engine Journal — Ask An SEO: Common Hreflang Mistakes Helen Pollitt, Head of SEO at Getty Images

ccTLD vs subdirectory vs subdomain: choosing your foundation

Before you write a single hreflang tag, you choose how your markets are organized on the web. This decision is hard to reverse, so it deserves real thought. There are three valid patterns, and each sends a different geotargeting signal while carrying different costs.

A country-code top-level domain (ccTLD) is a separate domain per country — example.sa for Saudi Arabia, example.ae for the UAE, example.com.eg for Egypt. It is the strongest possible geo signal: Google reads the domain itself as “this is for Saudi Arabia.” The cost is steep. Each domain starts its authority from zero, so you fragment your link equity across many weak sites instead of building one strong one, and the legal, hosting and maintenance overhead multiplies.

A subdirectory keeps everything on one domain with a folder per market — example.com/ar-sa/, example.com/ar-eg/, example.com/en/. All your authority and backlinks consolidate under a single domain, it is the easiest pattern to scale, and you set geotargeting per folder where needed. For the large majority of businesses, this is the right answer.

A subdomain uses a prefix — sa.example.com. Google tends to treat subdomains as semi-separate sites, so authority does not flow as cleanly as with subdirectories. Subdomains earn their place mainly when technical or platform constraints force them.

StructureExampleGeo signalAuthorityBest for
ccTLDexample.sa, example.aeStrongestFragmented across domainsBig brands with budget and a long-term per-country commitment
Subdirectoryexample.com/ar-sa/Set via Search Console / hreflangConsolidated under one domainMost businesses — best balance of power and cost
Subdomainsa.example.comSet per subdomainPartly separatedPlatform or hosting constraints force it

One practical note on geotargeting: with subdirectories or subdomains you can no longer use the old “International Targeting” country setting the way ccTLDs imply it automatically — you lean on hreflang plus genuinely localized content and signals. Set up separate Google Search Console properties per market so you can monitor each edition’s performance and catch problems early. This monitoring discipline is the same one I apply on every e-commerce SEO build, because international stores break in market-specific ways that a single combined property hides.

Three ways to implement hreflang

Hreflang can live in three places. All three are valid; you pick based on your platform and scale. Whatever you choose, use only one method — duplicating the same annotations across two methods invites conflicts.

You place link tags in the head of every page, one per alternate, including a self-referencing tag and an x-default. This is the most common method and the easiest to read, but it adds weight to every page and is painful to maintain by hand at scale. Conceptually, the German page for a three-market site lists: the German URL (self), the English URL, the Arabic-Saudi URL, and an x-default — and every one of those pages lists the same full set back.

Method 2 — HTTP headers

For non-HTML files like PDFs, you cannot add a link tag to a head that does not exist. Instead you return hreflang relationships in the HTTP response headers. This is the right tool for documents, downloadable catalogs and similar assets, and it works for HTML too — but it is harder to inspect and debug.

Method 3 — XML sitemap

You declare every page’s alternates inside your XML sitemap using the xhtml:link markup. This keeps your page head clean, centralizes all annotations in one auditable file, and is by far the best method at scale — think a catalog with thousands of products across several markets. The tradeoff is that the sitemap markup is verbose and must be generated programmatically; doing it by hand invites exactly the reciprocity errors that void clusters.

MethodWhere it livesBest forWatch out for
HTML head link tagsThe head of each pageSmall to mid-size sitesPage weight; manual maintenance errors
HTTP headerServer responsePDFs and non-HTML filesHarder to inspect and debug
XML sitemapYour sitemap fileLarge sites and stores at scaleVerbose; must be generated, not hand-written

Language and region codes: get them exactly right

Hreflang values are built from two parts: a language code and an optional region code. The format is language or language-REGION — for example en, en-GB, ar, ar-SA. Google is strict about which codes it accepts, and it fails silently when you get them wrong, which is what makes this so dangerous.

  • Language must be an ISO 639-1 two-letter code: en, ar, fr, de, es.
  • Region must be an ISO 3166-1 Alpha-2 two-letter code: SA (Saudi Arabia), EG (Egypt), AE (UAE), GB (United Kingdom), US (United States).

Two rules trip people up constantly. First, you cannot target a region without a language — there is no such thing as a region-only hreflang value. Second, made-up or non-standard codes are silently ignored. Google is explicit:

Only language codes listed in ISO 639-1 and region codes listed in ISO 3166-1 Alpha 2 are supported; other codes that aren’t listed in those standards, such as es-419, aren’t supported.

Google Search Central — Localized Versions of your Pages Official Google documentation

The most infamous fatal typo is using UK for the United Kingdom. The country exists, the intuition is reasonable — and it is wrong. The correct ISO 3166-1 code is GB. Write en-UK and Google ignores it. The same trap catches anyone who invents regional groupings like ar-419, ar-MEA or ar-GULF: those are not valid region codes, so the entire annotation is discarded without warning.

Intended targetCorrect hreflangCommon wrong versionWhy it fails
United Kingdom (English)en-GBen-UKUK is not an ISO 3166-1 code; GB is
Saudi Arabia (Arabic)ar-SAar-SAU / ar-KSARegion must be exactly two letters
Egypt (Arabic)ar-EGar-EGYRegion must be exactly two letters
Gulf region (Arabic)one tag per country (ar-SA, ar-AE, …)ar-GULF / ar-MEANo such region code exists; silently ignored
Latin America (Spanish)per-country codeses-419419 is not ISO 3166-1 Alpha-2
Global fallbackx-defaultomitting it56.3% of sites skip it; untargeted users get the wrong page

Arabic, the Gulf and the RTL reality

This is where international SEO gets specific for the markets Eman serves every day — Egypt, Saudi Arabia and the wider Gulf — and where the generic guides fall apart. Roughly three quarters of Arabic-world e-commerce and SaaS spend is concentrated in the GCC countries plus Egypt, which is exactly the audience for country-level Arabic targeting. Getting Arabic hreflang right is not a nicety here; it is the difference between owning a market and being invisible in it.

When to use ar versus ar-SA, ar-EG, ar-AE

Use the bare ar only if you serve one identical Arabic edition to every Arabic-speaking user. The moment your editions differ — by pricing, currency, dialect, shipping, or legal terms — you need country-level codes:

  • ar-SA — Saudi Arabia
  • ar-EG — Egypt
  • ar-AE — United Arab Emirates

A Saudi shopper and an Egyptian shopper genuinely need different pages: different prices, different payment methods, different delivery promises, sometimes different phrasing entirely. Country-level hreflang lets Google show each the right one.

RTL needs the language and direction set at the HTML level

Arabic is read right-to-left, and Google needs that signaled cleanly. Set the language and direction on the html element itself — the lang and dir attributes — not only in CSS. The dir="rtl" attribute at the document root keeps your content parsing, indexing and rendering correctly; relying on CSS alone leaves the structural signal missing. Clean RTL markup is a small detail that compounds across thousands of pages.

Do not ship raw machine translation

The fastest way to sabotage an Arabic launch is to run pages through Google Translate and publish the output. Google’s quality systems detect unedited machine translation, and thin auto-translated pages can suppress rankings across all your language versions, not just the Arabic one.

Ready to outrank your competition?Get a free consultation and a custom growth plan within 24 hours.
Get your free consultation

The 7 mistakes that break everything

Hreflang is, by a wide margin, the most error-prone area of technical SEO. The largest study ever conducted on it — Ahrefs’ analysis of 374,756 domains, roughly ten times bigger than any prior study — found that more than two thirds of domains using hreflang have at least one issue. An earlier Search Engine Land study put the figure at 31%. The range itself tells you something: even the experts disagree on exactly how bad it is, and everyone agrees it is bad.

67%+of hreflang domains have an issue
56.3%missing x-default
16.9%point to broken/redirected pages
15.3%missing reciprocal return tags

Google’s John Mueller summed up the whole problem in one sentence.

Hreflang is complex and hard to get right. It can break in so many different ways.

Ahrefs — Over 67% of Domains Using Hreflang Have Issues quoting John Mueller, Google

Here are the seven that account for nearly all of it:

  1. Missing reciprocal return tags. If page A points to page B but B does not point back, Google ignores the relationship. 15.3% of domains have this. It is the silent cluster-killer.
  2. Missing self-referencing tags. Every page must include a tag pointing to itself. 18% of domains skip it, weakening the whole annotation set.
  3. Pointing to broken or redirected URLs. 16.9% of domains point hreflang at pages that 404 or redirect. Always annotate the final, live, indexable URL.
  4. Missing x-default. The most common single error at 56.3%. Without it, users in untargeted markets get whatever Google guesses — often the wrong page.
  5. Invalid codes. en-UK instead of en-GB, invented codes like ar-GULF, region without language. Silently ignored.
  6. Canonical and hreflang fighting each other. A page that canonicalizes to a different-language version is telling Google “ignore me” while hreflang says “swap me in.” Canonical wins, and your localized page loses.
  7. Mixing implementation methods. Declaring hreflang in both the head and the sitemap with any inconsistency creates conflicts. Choose one method and keep it the single source of truth.

A do-this-today workflow

You do not boil the ocean. The teams that get international SEO right launch deliberately, one market at a time, and validate relentlessly. Backlinko’s advice matches exactly how I run these projects.

Start with one region. Focus on getting it right. Fine-tune your approach and workflows.

Backlinko — International SEO: How to Dominate Global Rankings Backlinko team

Here is the sequence:

  1. Decide your structure first. ccTLD, subdirectory or subdomain — for most businesses, subdirectories. Lock it in before building anything, because it is expensive to reverse.
  2. Do localized keyword research per market. Do not assume Saudi and Egyptian searchers use the same terms or dialect. Research each market natively; this is the heart of real SEO services.
  3. Build genuinely distinct localized pages. Human-edited content, correct currency, local payment and shipping, RTL lang and dir for Arabic. Distinct enough that Google does not collapse them.
  4. Implement hreflang with one method. Head tags for smaller sites, XML sitemap at scale. Include self-referencing tags and x-default on every cluster.
  5. Align canonicals. Each localized page canonicalizes to itself, never to another language version.
  6. Validate before and after launch. Crawl with Screaming Frog, audit with Ahrefs Site Audit or Semrush, and review the Google Search Console international and indexing reports. Confirm reciprocity, codes, x-default and live targets.
  7. Set up per-market GSC properties and recurring crawls. Hreflang breaks silently during template, URL and migration changes. Schedule crawls and alerts so you catch regressions before they cost rankings.
ToolWhat it checksWhen to use it
Screaming FrogCrawl-level hreflang, reciprocity, broken targetsPre-launch and recurring crawls
Ahrefs Site Audithreflang issues at scale, missing x-defaultOngoing monitoring
SemrushInternational issues alongside the full auditOngoing monitoring
Google Search ConsoleInternational / indexing reports per marketContinuous, with per-market properties

What actually gets your international pages ranking

Here is the part the hreflang-obsessed guides bury: a flawless hreflang setup over weak content still loses. Hreflang decides which already-ranking version gets shown — it does not make pages rank. What makes them rank is genuinely localized content, topical authority and links, on a technically sound foundation. And the 2026 algorithm landscape has made that more true than ever.

The March 2025 Core Update rolled out globally from March 13 to 27 across all regions and languages, weighting topical authority and punishing thin or outdated content. Google reiterated through 2025 that hreflang remains a hint even when present. AI Overviews and the rise of regional and AI-assisted search add a new layer on top, but classic hreflang still governs which localized page surfaces in the traditional SERPs that dominate Gulf search volume — and those SERPs still run almost entirely on Google.

89.85%Google's global search share (Mar 2026)
Mar 13–27 2025global Core Update window
~75%of Arabic-world e-com/SaaS spend in GCC + Egypt
374,756domains in the largest hreflang study

What this looks like when the foundation and the localized content work together is the difference between invisibility and market leadership. A few results from my own work, every number independently verifiable in Google Search Console:

  • Roseberry (Saudi Arabia) climbed from roughly 25 impressions a day to 51.5M impressions and 545K clicks over 16 months — the payoff of a sound technical and content foundation built for the Saudi market specifically.
  • A niche store in Saudi Arabia, crushed by technical debt and missing localization, was rebuilt and driven to #1 in Saudi search in 166 days — proof that genuine localization, not translated strings, is what wins Gulf markets.
  • Conscent grew from 61K to 1.2M impressions once the technical structure and localized content were in place.
  • Oxford (Egypt) is an example of the same playbook applied to the Egyptian market, where local keyword research and country-level targeting do the heavy lifting.

In each case, hreflang and structure were the enabling layer — the thing that made sure the right user saw the right page — while the rankings themselves were earned by content built for that specific market. That is the real model of international SEO: structure and hreflang remove the confusion, and localized excellence does the winning.

If you are expanding into Egypt, Saudi Arabia or the Gulf and you want the structure, the hreflang and the localized content to actually agree with each other — the way they must for Google to serve the right page to the right person — that is exactly the work I do.

Ready to outrank your competition?Get a free consultation and a custom growth plan within 24 hours.
Get your free consultation

Frequently asked questions

These are the questions clients ask me most often about international SEO and hreflang. The full set is also indexed for search at the top of this page.

What is international SEO? It is optimizing your site so search engines serve the right language and country version to each user — combining site structure, hreflang, localized content and keyword research, and geotargeting, so a Saudi user sees your ar-SA page and an Egyptian user sees ar-EG.

Does hreflang improve my rankings? No, not directly. It influences which version of an already-ranking page is shown to each user. Strong localized content, topical authority and links are what move you up.

ccTLD, subdirectory or subdomain? ccTLDs give the strongest geo signal but fragment authority and cost the most; subdirectories consolidate authority and scale best — the right answer for most; subdomains behave like separate sites. Pick one and stay consistent.

Which Arabic code — ar, ar-SA or ar-EG? Bare ar only if you serve one identical Arabic edition. Use ar-SA, ar-EG, ar-AE when editions differ by price, dialect, shipping or legal terms. There is no valid ar-GULF or ar-MEA.

What is x-default and do I need it? It is the fallback for users in markets you have not targeted. Yes — and 56.3% of sites skip it, which is why untargeted users so often see the wrong page.

Can I just use Google Translate? No. Google detects unedited machine translation, and thin auto-translated pages can suppress all your language versions. Localize with human editing, especially in Arabic.

It is really important to get this right — as Helen Pollitt notes, hreflang errors leave search engines “confused over which version of a page to show to users in the SERPs.” Confusion is the enemy of international SEO, and precision is the whole game.

Ranking #1 in Egypt & Saudi Arabia

Ready to rank #1 on Google?

Get a free consultation and a custom growth plan for your project within 24 hours.