Link Extractor

Paste HTML to extract all internal and external links with anchor text, href, and rel attributes. Perfect for SEO link audits and content analysis.


Paste HTML above to extract links.
No external links found.
No internal links found.

Frequently Asked Questions

In your browser, right-click the page and select "View Page Source" (Ctrl+U on Windows, Cmd+U on Mac). You can then Select All (Ctrl+A), copy, and paste the full HTML into this tool. Alternatively, use browser DevTools → Network tab → click the page request → Response tab.

A rel="nofollow" link tells search engines not to pass PageRank (link equity) through that link. Google introduced this in 2005 to combat comment spam. Other rel values include sponsored (for paid links) and ugc (user-generated content). All three variants are treated similarly by Google.

Anchor text is one of Google's ranking signals. Links with descriptive, keyword-rich anchor text (e.g., "best running shoes") pass more topical context than generic anchors like "click here". Analyzing your internal link anchor text distribution helps ensure important pages receive contextually relevant internal links.

Google's John Mueller has said Google crawls all links on a page, but there is no strict limit. However, for usability and crawl budget reasons, most SEO experts recommend keeping it under 100 internal links per page. Having too many links dilutes the PageRank passed through each individual link.

Internal links (links between pages on the same domain) distribute PageRank throughout your site, help search engines discover and crawl new pages, and establish a content hierarchy. Pages with more internal links pointing to them typically rank higher because they accumulate more internal link equity. External links (links to other domains) signal trust and authority to Google — linking to high-quality, relevant sources is a positive quality signal. Outbound links do not directly harm your SEO, but linking to low-quality or spammy sites can. Use rel="nofollow" or rel="sponsored" for paid or untrusted external links.

A dofollow link (the default with no rel attribute) tells Google to follow the link and pass PageRank (link equity) from the linking page to the destination. A nofollow link (rel="nofollow") was originally a hard instruction not to follow or pass PageRank, but Google now treats it as a hint — it may still crawl and partially credit nofollow links. Additional rel values: sponsored — for paid/affiliate links; ugc — for user-generated content (comments, forum posts). Use the correct rel attribute for link transparency. Abusing dofollow links for paid placements without proper disclosure violates Google's link spam policies.

PageRank flows from high-authority pages to the pages they link to, distributed equally across all outbound links. A homepage with many external backlinks accumulates high PageRank — internal links from the homepage to category pages pass some of that authority downstream. Deeply nested pages (e.g., 5 clicks from the homepage) receive very little inherited authority. To maximize link equity for important pages: link to them from high-traffic pages, keep the link depth shallow (2–3 clicks from the homepage), use descriptive anchor text, and include them in your main navigation. Pages with many internal links pointing to them accumulate more internal PageRank.

Link extraction enables several critical SEO audit tasks: Broken link detection — extract all links then check each URL for 404/500 errors; Redirect chain identification — find links pointing to URLs that redirect instead of directly to the final destination; Orphaned page discovery — identify important pages that have no internal links pointing to them; Anchor text analysis — audit whether internal links use descriptive, keyword-rich anchor text; Nofollow audit — verify that UGC, sponsored, and affiliate links have correct rel attributes; External link quality review — identify links to low-quality, hacked, or irrelevant external sites that could harm your site's reputation.

Broken links (returning 404 or other errors) cause three problems: Wasted crawl budget — Googlebot wastes crawl time following links to non-existent pages instead of discovering new content; PageRank leak — links pointing to broken internal pages "waste" the link equity that could have been passed to working pages; Poor user experience — users who click broken links leave your site, increasing bounce rate. For external broken links, the impact is mainly user experience — Google does not penalize for linking to pages that later break. Fix internal broken links promptly. Use 301 redirects to reclaim PageRank from internally broken URLs that previously had inbound links.

There is no strict Google limit on outbound links per page. Google has confirmed that linking out to high-quality, relevant sources is a positive quality signal — it demonstrates thorough research and improves user experience. The concern is not the count but the quality and relevance of linked destinations. Avoid linking to low-quality, spammy, or irrelevant sites. For most content pages, 5–20 outbound links to authoritative sources is natural and beneficial. Pages with hundreds of outbound links (link farm patterns) do attract algorithmic scrutiny. Focus on link quality: one link to a respected source is worth more than ten links to low-quality sites.

About This Link Extractor

This free link extractor fetches any public URL and lists all hyperlinks found on the page — anchor text and destination URL for each. Internal and external links are shown together for a complete view of the page's link profile.

Extracting and auditing all links on a page is a common SEO and QA task: it reveals broken links, missing rel="noopener" on external links, and unexpected outbound links introduced by CMS plugins or templates.

When to use this tool

  • Finding broken outbound links before a page goes live
  • Auditing internal linking structure across key pages
  • Checking that external links use rel="noopener"
  • Extracting all URLs from a page for a link-building analysis

Standards & References

How It Works

Paste HTML

Paste the HTML source of any web page. Enter your base URL for accurate internal vs external classification.

DOM Parsing

JavaScript parses the HTML using a DOMParser, extracts all anchor tags, and collects href, anchor text, rel, target, and title attributes.

Sorted Results

Links are classified as internal or external, sorted, and displayed with all attributes. Filter by tab to focus on internal or external links.

Common Use Cases

Internal Link Audit

Check that important pages have enough internal links pointing to them, and that anchor text is descriptive rather than generic "click here" links.

External Link Review

Audit outbound links to ensure you are not linking to low-quality or broken external sites, and that sponsored links have proper rel="sponsored" attributes.

Nofollow Audit

Verify that user-generated content links, affiliate links, and sponsored links have the correct rel attributes to comply with Google's link spam policies.

Content Migration

Extract all links from a page before migrating content to identify internal links that will need to be updated to point to new URLs after the migration.

Competitor Analysis

Extract links from competitor pages to discover which resources they reference externally, potentially revealing link-building opportunities.

Broken Link Detection

Export the full list of links from a page to check for broken links (404s) that may harm user experience and waste your crawl budget.

Related Articles

View all articles
Link Rot: Why Links That Worked at Publish Time Quietly Stop Working, and How to Catch It

Link Rot: Why Links That Worked at Publish Time Quietly Stop Working, and How to Catch It

A link extractor showing 200 links tells you they exist — not whether 30 of them now lead nowhere. Link rot accumulates silently because broken outbound links produce no visible error on the page containing them, and external sites break links for reasons entirely outside your control. Here's why this requires recurring (not one-time) checks, the different fix approaches depending on why a link broke, and the particularly tricky case of links that "work" but now redirect to unrelated content.

Jun 17, 2026
Nofollow, Sponsored, and UGC: The Three Link Attributes and What Each Actually Signals to Search Engines

Nofollow, Sponsored, and UGC: The Three Link Attributes and What Each Actually Signals to Search Engines

Since 2019, "nofollow" has been joined by rel="sponsored" and rel="ugc" — three distinct attributes that can be combined, and all treated as "hints" rather than absolute directives. Here's what each attribute signals, why affiliate links specifically should carry rel="sponsored", how UGC sections typically handle this by platform default, and why these attributes generally don't belong on internal links.

Jun 17, 2026
External Link Auditing and Toxic Links: How to Run a Backlink Audit and Use the Disavow Tool Correctly

External Link Auditing and Toxic Links: How to Run a Backlink Audit and Use the Disavow Tool Correctly

Most "toxic" backlinks are simply ignored by Google, not penalised. The Disavow Tool is often misused to surrender legitimate link equity. Here's how to run a real backlink audit, what actually warrants disavowal, how to structure a disavow file, and why most sites rarely need it.

Jun 10, 2026
Internal Linking Strategy: How to Distribute PageRank Effectively Across Your Site

Internal Linking Strategy: How to Distribute PageRank Effectively Across Your Site

Most internal linking patterns developed accidentally, not by design. Here's how PageRank flows through internal links, the hub-and-spoke model, how to find underlinked important pages, and anchor text strategy for distributing authority effectively.

Jun 8, 2026
Link Extractor — See Every Link on Any Web Page & Audit Your SEO Structure

Link Extractor — See Every Link on Any Web Page & Audit Your SEO Structure

Learn why links matter for SEO, how anchor text, rel attributes, and internal vs. external links affect rankings, and how to use a free link extractor to audit any page's link structure.

Jun 6, 2026