textract – extract text from any document textract – extract text from any document. Currently supports .doc, .docx, .eml, .json, .html, .pptx, .pdf, and .txt.