Back to Top

Web API

Web API: Author Extraction

AlchemyAPI provides easy-to-use facilities for extracting author information embedded within news articles and blog posts. These URL processing calls automatically fetch the desired Internet webpage, extracting any author information embedded within. These API calls may be utilized to process news articles, blogs, and other publicly-accessible Internet content. If you are processing content that is not hosted on a public webserver, use our HTML API calls instead.

API Call: URLGetAuthor

Description: The URLGetAuthor call is utilized to extract author information contained within the specified web page. AlchemyAPI will download the requested URL, process the retrieved HTML document, looking for author information.

Endpoint: http://access.alchemyapi.com/calls/url/URLGetAuthor

Parameters:

http argument parameter description
url http url (must be uri-argument encoded)

(required parameter)
apikey your private api key

(required parameter)
outputMode desired API output format

Possible values:
xml (default)
json
rdf

(optional parameter)
jsonp desired JSONP callback

(optional parameter, requires "outputMode" to be set to json)

Response Format (XML):

<results>
    <status>REQUEST_STATUS</status>
    <url>REQUESTED_URL</url>
    <author>DOCUMENT_AUTHOR</author>
</results>

Response Format (JSON):

{
    "status": "REQUEST_STATUS",
    "url": "REQUESTED_URL",
    "author": "DOCUMENT_AUTHOR"
}

Response Format (RDF):

<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
                 xmlns:aapi="http://rdf.alchemyapi.com/rdf/v1/s/aapi-schema#"
                 xml:base="http://rdf.alchemyapi.com/rdf/v1/r/response.rdf">
    <rdf:Description rdf:ID="DOCUMENT_HASH">
        <rdf:type rdf:resource="http://rdf.alchemyapi.com/rdf/v1/s/aapi-schema#DocInfo"/>
        <aapi:ResultStatus>REQUEST_STATUS</aapi:ResultStatus>
        <aapi:URL>DOCUMENT_URL</aapi:URL>
        <aapi:DocAuthor>DOCUMENT_AUTHOR</aapi:DocAuthor>
    </rdf:Description>
</rdf:RDF>

Response Fields:

field name field description
status success / failure status indicating whether the request was processed.

Possible values:
OK
ERROR
url http url information was requested for.
author the extracted author name.
statusInfo failure status information (sent only if "status" == "ERROR").

Possible values:
author-not-found:cannot-locate
author-not-found:multiple-candidates
invalid-api-key
cannot-retrieve
page-is-not-html

Example Calls:

XML: http://access.alchemyapi.com/calls/...
RDF: http://access.alchemyapi.com/calls/...

API Notes:

  1. Calls to URLGetAuthor can be made using HTTP GET or POST.
  2. HTTP POST calls should include the Content-Type header: application/x-www-form-urlencoded
  3. URL retrieval is attempted for a maximum of 10 seconds. Requests taking longer than this will result in a "cannot-retrieve" error response.
  4. Requested HTML documents can be a maximum of 600 kilobytes. Larger documents will result in a "content-exceeds-size-limit" error response.
  5. If no author name is found within the processed HTML document, an author-not-found error will be returned.