Welcome to dbForumz.com!
FAQFAQ    SearchSearch      ProfileProfile    Private MessagesPrivate Messages   Log inLog in

algorithm for generating top fuzzy variations ...

 
   Database Forums (Home) -> Java RSS
Next:  Setup 10G to access external DLL on windows2000 -..  
Author Message
Giovanni Azua

External


Since: Sep 24, 2003
Posts: 11



(Msg. 1) Posted: Fri Aug 05, 2005 3:55 pm
Post subject: algorithm for generating top fuzzy variations ...
Archived from groups: comp>lang>java>databases (more info?)

Hello all,

I am interested in obtaining the top N fuzzy variations of an
string (a person or company name) using the same concept
as the Levenshtein distance. Ussually Levenshtein is used to
compute the distance between two given strings ... but I would
like to instead have an algortihm to generate the top N highest
scoring fuzzy variations for any given term e.g.

Giovanni - 100%
Giovann - 98%
iovanni - 98%
Govanni - 98%
....
anni - 55%

This way I can precompute this thing in advance and not
during online matching.

Can anyone recommend an existing implementation e.g. in Java ?

Many thanks in advance,
Best Regards,
Giovanni

 >> Stay informed about: algorithm for generating top fuzzy variations ... 
Back to top
Login to vote
Display posts from previous:   
Related Topics:
2nd Globals Programming Challenge - USD $3,500 Grand Prize.. - Want to show how good you are at coding "Big Data" solutions? InterSystems Corp. is hosting a series of programming challenges in the Globals Community. Our 2nd Globals Challenge kicks off with a two day competition on Friday, December 02, 20...

Bug in Oracle org.w3c.dom.Node ? - Hello, I am using Oracle 10g for retrieving XML documents as XMLType and manipulate them in Java, using the method org.w3c.dom.Node.setTextContent() to set the text child node of a DOM node. The oracle DOM implementation just appends a child node to the...

JDBC vs SQLXML - Hi, I am writing an application in java which will write lots of data in database (i m using ms-sqlsqever) I want to know which approach is faster 1. JDBC "Batch Updates" OR 2. SQLXML BulkLoad. (Also if anybody can tell how to use SQLXML...

ojb (OJB_HL_SEQ doesent exist) - Hi NG, i have a problem with ojb. If i tell it in sourcecode to use the "ojb" autoincremet feature, the it will look in a table "OJB_HL_SEQ" that doesent exist. I create the "project-schema.sql" with an ant script from th...

Disk Backed Collection/DB for Extremely Large Datasets - B.. - Hi, I have to many objects(10m+) to persist in hashmap, I run out of memory. So I've been experimenting with Sleepycat Java, HSQLDB, XXL and my own version of a disk backed collection. I've found that persisting objects in HSQLDB to be most scalable. ..
   Database Forums (Home) -> Java All times are: Pacific Time (US & Canada)
Page 1 of 1

 
You can post new topics in this forum
You can reply to topics in this forum
You can edit your posts in this forum
You can delete your posts in this forum
You can vote in polls in this forum



[ Contact us | Terms of Service/Privacy Policy ]