textract – extract text from any document Leonid Mamchenkov 10 years ago textract – extract text from any document. Currently supports .doc, .docx, .eml, .json, .html, .pptx, .pdf, and .txt. Share: Microsoft vulnerability, now served with plain text filesJSON API? No … HAL!Automerge – a JSON-like data structure for concurrent multi-user editing