Login   Register  
PHP Classes

Fuzzy Index: Index text for performing fuzzy search

Recommend this page to a friend!
Stumble It! Stumble It! Bookmark in del.icio.us Bookmark in del.icio.us

  Author Author  
Picture of Philipp Strazny
Name: Philipp Strazny is available for providing paid consulting. Contact Philipp Strazny .
Classes: 5 packages by
Country: United States United States
Age: 47
All time rank: 1576211 in United States United States
Week rank: 582 Up56 in United States United States Up
Innovation award
Innovation award
Nominee: 3x

Winner: 1x

  Detailed description   Download Download .zip .tar.gz  
This class can index text for performing fuzzy search.

It can process a list of text strings and build a database that indexes snippets of those strings and the locations where they appear.

The class can also search for given keywords and returns the locations of the indexed strings where the best matching text appears.

It uses SQLite to store the indexed text database, but the class can be extended to use a different database type.

It uses certain heuristics to extract the snippets from the indexed text. These heuristics are implemented as separate classes that can be used interchangeably.

  Classes of Philipp Strazny  >  Fuzzy Index  >  Download Download .zip .tar.gz  >  Support forum Support forum (2)  >  Blog Blog  >  RSS 1.0 feed RSS 2.0 feed Latest changes  
Name: Fuzzy Index
Base name: fuzzy-index
Description: Index text for performing fuzzy search
Version: -
PHP version: 5.3
License: GNU Lesser General Public License (LGPL)
All time users: 683 users
All time rank: 4310
Week users: 0 users
Week rank: 1599 Equal
  Groups   Rate classes User ratings   Applications   Files Files  

Group folder image PHP 5 Classes using PHP 5 specific features View top rated classes
Group folder image Databases Database management, accessing and searching View top rated classes
Group folder image Searching Search engines, crawling and indexing View top rated classes
Group folder image Text processing Manipulating and validating text data View top rated classes

  Innovation Award  
PHP Programming Innovation award winner
June 2012

Prize: One copy of the Zend Studio
Searching for text in a large documents is not a trivial text.

To make it useful it needs to be fast and take in account that search words may be misspelled and they may not appear contiguously in the document being searched.

This class addresses the challenges of searching large text documents. It builds a database that indexes the documents in a way that is fast to search and locate the text snippets that contain the words that the user is looking for.

Manuel Lemos

  User ratings  
Not yet rated by the users

  Applications that use this package  
No pages of applications that use this class were specified.
Add link image If you know an application of this package, send a message to the author to add a link here.
  Files folder image Files  
File Role Description
Plain text file FuzzyIndex.php Class FuzzyIndex class and utility classes
Accessible without login Plain text file FuzzyIndexTest.php Test tests for FuzzyIndex and heuristics
Accessible without login HTML file fuzzyindex_readme.html Doc. explanation
Accessible without login Plain text file demo_multilingual.php Example usage example

Download Download all files: fuzzy-index.tar.gz fuzzy-index.zip
NOTICE: if you are using a download manager program like 'GetRight', please Login before trying to download this archive.