How do I extract all URLs from a block of text?

Paste your text into TextSorter's URL Extractor. It finds all http and https links in the text and lists them separately. Works on emails, documents, HTML source, meeting notes, and any other text containing links.

Can I extract URLs from HTML source code?

Yes. The tool identifies URLs in plain text, HTML attributes (like href and src), and inline content. Paste the raw HTML and it will pull out all the links.

Does the URL extractor remove duplicate links?

The tool extracts all URL occurrences. For deduplication, copy the results and run them through TextSorter's Remove Duplicates tool to get a unique list of URLs.

How to Extract URLs from Text — Free Link Extractor Online

What Is URL Extraction?

URL extraction is the process of scanning a block of text and automatically pulling out every web address it contains. Instead of reading through a wall of text and copying links manually, a URL extractor uses pattern matching to identify anything that looks like a valid URL — https://, http://, or even bare www. addresses — and returns them as a clean, separate list. This is especially useful when you are working with raw HTML, exported documents, scraped webpage source code, or any large body of text where URLs are buried among other content.

How the Tool Works

The URL Extractor on TextSorter.com applies a regular expression (regex) pattern against your pasted text. The pattern recognizes standard URL structures, including:

Full URLs starting with https:// or http://
Addresses starting with www.
URLs with query strings and fragment identifiers (e.g., ?id=123#section)
URLs embedded inside HTML attributes such as href="..." and src="..."

Every match is extracted and presented one per line, ready to copy, download, or pipe into your next workflow step.

Step-by-Step: How to Extract URLs

Open the URL Extractor tool.
Paste your raw text, HTML source, or document content into the input box.
Click Extract URLs.
Review the extracted list. Use the copy button to grab all results at once.
Optionally, run the list through Clean Text to strip trailing slashes or normalize formatting.

Real Use Cases

1. Auditing Links in a Document

You have a 50-page Word document or PDF export and need to verify every link it contains. Paste the document text into the extractor and get a complete list in seconds — no manual scanning required.

2. Scraping Visible Links from Webpage Source

Copy the raw HTML source of any webpage and paste it into the tool. The extractor will surface every URL referenced in href, src, and action attributes, giving you a full picture of where a page links to.

3. Finding Broken or Outdated Links in Exported Content

When migrating a blog or CMS, export your content as plain text or HTML and run it through the extractor. You can then check each URL against your new site structure to find any that need updating before the migration goes live.

4. Building a Link Inventory for SEO Audits

SEO professionals often need to catalog internal and external links across a page. Extract all URLs first, then cross-reference them with your site map or use a crawler. Pair the extractor with Email Extractor or IP Extractor when working with server logs that mix multiple data types.

Tips for Cleaning Your Results

Raw extraction sometimes pulls duplicate URLs or near-duplicates (with and without trailing slashes). After extracting, consider running your list through a deduplication step and sorting it alphabetically so patterns become obvious. If the source text includes email addresses formatted as mailto: links, those will also be captured — filter them out if you only want web URLs.

Open the URL Extractor →