AlchemyAPI offers easy-to-use facilities for processing your content: Extract page text, titles, and other content from any publicly-accessible webpage or posted HTML document.
To extract the cleaned primary article/page text (ignoring advertisements, navigation links, and other unimportant content) from a publicly-accessible webpage or posted HTML document, use the following API calls:
| URLGetText | Used for processing publicly-accessible Internet web pages. |
| HTMLGetText | Used for processing uploaded HTML content. |
To extract the raw page text (including all advertisements, navigation links, and other content) from a publicly-accessible webpage or posted HTML document, use the following API calls:
| URLGetRawText | Used for processing publicly-accessible Internet web pages. |
| HTMLGetRawText | Used for processing uploaded HTML content. |
To extract the page title from a publicly-accessible webpage or posted HTML document, use the following API calls:
| URLGetTitle | Used for processing publicly-accessible Internet web pages. |
| HTMLGetTitle | Used for processing uploaded HTML content. |