14

Pisa – Probably the Fastest Full Text Search Engine Written in C++

 5 years ago
source link: https://www.tuicool.com/articles/EFFzEjv
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.
neoserver,ios ssh client

vQNVr22.png!web

PISA: Performant Indexes and Search for Academia v0.6.6

Join us on Slack

Get in touch via Slack:

Description

PISA is a text search engine able to run on large-scale collections of documents. It allows researchers to experiment with state-of-the-art techniques, allowing an ideal environment for rapid development.

Some features of PISA are listed below:

  • Written in C++ for performance;
  • Indexing & Parsing & Sharding capabilities;
  • Many index compression methods implemented;
  • Many query processing algorithms implemented;
  • Implementation of document reordering;
  • Free and open-source with permissive license;

The best way to get started is by reading the official documentation .

If you want to get involved with PISA, please check out our Contributing page.

Reference

Reference to cite when you use PISA in a research paper:

@inproceedings{MSMS2019,
  title={{PISA}: {P}erformant {I}ndexes and {S}earch for {A}cademia},
  author={Mallia, Antonio and Siedlaczek, Michal and Mackenzie, Joel and Suel, Torsten},
  booktitle={Proceedings of the Open-Source IR Replicability Challenge (OSIRRC 2019)},
  year={2019}
}

Credits

PISA is a fork of the ds2i project started by Giuseppe Ottaviano .


About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK