Skip to main content

Posts

Showing posts from February, 2018

Basset - Information Retrieval Library in PHP

Basset Basset is a full-text  PHP Information Retrieval library. This is a collection of developments in the field of IR and ported over to PHP for research purposes. Basset provides different ways of searching through documents in a collection (ad-hoc), by applying advanced and experimental IR algorithms and/or techniques gathered from different Research studies and Conferences, most notably: TREC SIGIR ECIR ACM Basics Warning: This is a tool that is continuously under development. Please use this as a research tool for your otherwise special Production needs. Adding Documents Basset manages adding document thru the IndexWriter Class. It processes the documents you'll be adding in and later on commit to an external file. It takes a directory path, and overwrite (they both default to '../index/' and true consecutively). Setting overwrite to false means that you won't be accidentally overwriting any existing index inside the directory. Methods:...