Text Extraction / Web Page Cleaning
AlchemyAPI provides easy-to-use mechanisms to extract page text and title information from any web page.
A HTML page cleaning facility is provided, which normalizes / cleans HTML content (removing ads, navigation links, and other unimportant content), enabling extraction of only the important article text.

API endpoints are provided for performing text / title extraction on Internet-accessible URLs and posted HTML files.
Extracted meta-data may be returned in XML, JSON, and RDF formats. More information on API response formats is available here.
