How to use the Textractor API

Simply by doing a request to https://www.textractor.net/api?u={Aricle url}. It will return a bunch of JSon for you to further process.

The JSon output will return something like this:


                {
                    "PageTitle":"......",
                    "MetaDescription":"......",
                    "MetaKeywords":"......",
                    "DocumentLanguage":"......",
                    "Paragraphs":[
                        {"Text":"......"},
                        {"Text":"......"}
                    ],
                    "Images":......,
                    "Title":"......"
                }
            

What can go wrong?

If either you or we mess up, one of the following errors will be returned:

  • 100: Not a valid url
  • 101: The requested page is not available
  • 102: Page not found
  • 103: Website returned an internal server error
  • 104: The page html structure of the requested url is invalid
  • 105: No content was found
  • 106: Unknown error occured

Some examples