![]()
The HTML2Text API extracts plain text from web page content. The API accepts URL’s or HTML markup. This API can be used as a pre-step to any text processing task.
Example Extracting Text from a URL
JSON Input
|
1 2 3 4 5 |
{ "input": "http://www.rxnlp.com", "keepLineBreak": "false" } |
JSON Output
|
1 2 3 |
{ "results": " Solutions for Text Mining and Analytics | RxNLP [RxNLP: APIs, Tools and Tutorials for Text Mining and Analytics] RxNLP: APIs, Tools and Tutorials for Text Mining and Analytics View rxnlp_api’s profile on Twitter View rxnlp’s profile on GitHub Menu Home Web APIs API Key API Overview Text Similarity Topics Extraction API Sentence Clustering N-Gram & Word Count GitHub Repository Blog Dev Resources Q&A Ask Question Contact [word cloud] Topics Extraction API The Topics Extraction API allows you to find key topics in text. Extracted topics can be used to summarize documents, create navigable word clouds, features for machine learning, visualization of large amounts of text and more! The supporting texts allows you to easily link back to snippets from your text! Read more [text clustering] Sentence Clustering API The Sentence Clustering API clusters sentence level texts such as Legal Documents, Tweets, Facebook Status Updates,... |
}
Example Extracting Text from HTML Content
JSON Input
JSON Output
|
1 2 3 4 |
{ "results": "this is and example of a html page " } |
