Back to Top

Microformats Parsing

Microformats Parsing

AlchemyAPI understands Microformats data standards and is capable of extracting hCard, adr, geo, and rel-* formatted content from any web page. Leverage this structured data to enhance web page categorization and indexing, and perform content discovery tasks.

API endpoints are provided for performing Microformats extraction on Internet-accessible URLs and posted HTML files.

Extracted meta-data may be returned in XML, JSON, and RDF formats. More information on Microformats API response formats is available.

AlchemyAPI understands and extracts the following Microformats data standards:

hCard hCard is a simple, open, distributed format for representing people, companies, organizations, and places.
geo geo is a simple format for representing geographic coordinates (latitude; longitude).
adr adr is a simple format for embedding physical address information.
rel-tag rel-tag is a simple format for embedding category tags and keywords.
rel-license rel-License is a simple, open, format for specifying content licenses.


AlchemyAPI understands the hCard Microformats data standard and is capable of extracting the following hCard data fields:

NamePrefix NameGivenName NameFamilyName NameSuffix
FormattedName OrgName RoleNickname Title
Telephone Url all geo-formatted data all adr-formatted data


AlchemyAPI understands the adr Microformats data standard and is capable of extracting the following adr data fields:

AddrStreetAddress AddrExtendedAddress AddrPostOfficeBox
AddrLocality AddrPostalCode AddrCountry


AlchemyAPI understands the geo Microformats data standard and is capable of extracting the following geo data fields:

GeoLatitude GeoLongitude


AlchemyAPI understands the rel-tag Microformats data standard and is capable of extracting the following rel-tag data fields:

RelTag RelTagLink


AlchemyAPI understands the rel-license Microformats data standard and is capable of extracting the following rel-license data fields:

RelLicense