Questions and Answers › Category: API Related Question › Text extraction from HTML and PDF 0 Vote Up Vote Down Dan Davis asked 2 months ago Does rxnlp support text extraction from document formats such as HTML (which can have a lot of boilerplate), PDF, and so on? Related 1 Answers 0 Vote Up Vote Down KG Staff answered 2 months ago It is mainly focused on removing HTML tags. So if anything is encapsulated within HTML style tags then it should work. Related