Back to Top

API Calls: Text Extraction

API Calls: Text Extraction

AlchemyAPI offers easy-to-use facilities for processing your content: Extract page text, titles, and other content from any publicly-accessible webpage or posted HTML document.

To extract the cleaned primary article/page text (ignoring advertisements, navigation links, and other unimportant content) from a publicly-accessible webpage or posted HTML document, use the following API calls:

URLGetText Used for processing publicly-accessible Internet web pages.
HTMLGetText Used for processing uploaded HTML content.


To extract the raw page text (including all advertisements, navigation links, and other content) from a publicly-accessible webpage or posted HTML document, use the following API calls:

URLGetRawText Used for processing publicly-accessible Internet web pages.
HTMLGetRawText Used for processing uploaded HTML content.


To extract the page title from a publicly-accessible webpage or posted HTML document, use the following API calls:

URLGetTitle Used for processing publicly-accessible Internet web pages.
HTMLGetTitle Used for processing uploaded HTML content.