Opensource: Natural Language Process (NLP)

Integrated System

Topic Modeling

Academic

  • Reference extraction: The cb2Bib is a free, open source, and multiplatform application for rapidly extracting unformatted, or unstandardized bibliographic references from email alerts, journal Web pages, and PDF files.
  • Crossref lab,crossref好像是搞学术文章索引的,核心点在于DOI? Anyway,它的lab页面收录了不少好的开源工具,比如可以做PDF文件的抽取等。
  • Bible Passage Reference Parser: https://github.com/souliberty/Bible-Passage-Reference-Parser