Detect similar sounding words in Ruby

I'm aware of SOUNDEX and (double) Metaphone, but these don't let me test for the similarity of words as a whole - for example "Hi" sounds very similar to "Bye", but both of these methods will mark...

"Did you mean" feature on a dictionary database

I have a ~300.000 row table; which includes technical terms; queried using PHP and MySQL + FULLTEXT indexes. But when I searching a wrong typed term; for example "hyperpext"; naturally giving no...

Metaphone 3 information

Does anyone know where code can be found for Metaphone 3 matching for T-SQL or at least something that describes in detail the difference between Double Metaphone and Metaphone 3? I have been...

What is the Metaphone 3 Algorithm?

I want to code the Metaphone 3 algorithm myself. Is there a description? I know the source code is available for sale but that is not what I am looking for.

RegEx for fulltext search with typos

I have a MySQL table with the following columns: City Country Continent New York States Noth America New York Germany Europe - considering there's one ;) Paris France Europe If...

elasticsearch remove custom analyzer / filter

I'm new to elastic search and I was wondering if it's possible to delete a custom analyzer or a custom filter from an index.. For example, imagine the following index settings: "settings" :...

Replace words using Soundex, python

i have a list of sentences and basically my aim is to replace all diff occurrences of prepositions in the form "opp,nr,off,abv,behnd" with their correct spellings "opposite,near,above,behind" and...

Package has mismatched uid: 10124 on disk, 10134 in settings

I have some problems on Android 2.3.X devices for one of my apps (package name is com.netbiscuits.kicker). However I can not install my APK. I have tried to install it directly from eclipse (debug...

How can I import function based indexes using impdp?

I have a number of tables that use function based indexes (indices if you prefer). These indexes use functions within packages that I have defined. When importing the schema of the user it would...

Unexpected results from Metaphone algorithm

I am using phonetic matching for different words in Java. i used Soundex but its too crude. i switched to Metaphone and realized it was better. However, when i rigorously tested it. i found weird...

Undefined symbols for architecture x86_64: ... "_main", referenced from: implicit entry/start for main executable

Yak-shaving alert. Although I am precluded from displaying any source code, I figure with a well-written post I may be able to provide enough info to get assistance. The steps I have tried below...

Elasticsearch Soundex Match Query - NEST

Can anyone think why this may not be working? I basically have two fields which I index using a soundex analyzer see configuration below but when I search using names similar to what is stored in...

PostgreSQL: Address matching using fuzzymatch from two tables

What I want to do; I have two tables with two address columns , both stored as text I want to create a view returning the matching rows. What I've tried; I've created and index on both columns and...

AWS RDS PostgreSQL 9.5.4 Extension postgis_tiger_geocoder Missing Soundex?

I am attempting to install the AWS "Approved" PostgreSql Extension on our on large RDS instance but every time I at the point I attempt to 'create extension postgis_tiger_geocoder' I get...

Python.- fuzzy.DMetaphone 'ascii' error

How is that possible, that with the same input I sometime get ascii codec error, and sometime it works just fine? The code cleans the name and build it's Soundex and DMetaphone values. It works in...

Difference between Metaphone 3 and Double Metaphone

I have been reading many articles on Metaphone 3 last couple of days. I saw Metaphone 3 also returns 2 key for each word just like Double Metaphone. Actually, I am confused to figure out what is...

Rearrange words in array to matching position calculating Levenshtein distance Php

Rearrange words in Array based on position of the first array. In my code there are two array my first array is the base array from which i am going to compare it with second array and make the...

How to fix"Illuminate\Support\Collection::get(), 0passed in /AMPPS/www/lsapp/vendor/laravel/framework/src/Illuminate/Support/Traits/ForwardsCalls.php"

In an attempt to program a search bar, I created a GET method and added a new controller where it gets the relevant data and returns it with the view. //This is the form in the view named...

Is there a multibyte-aware Postgresql Levenshtein?

When I use the fuzzystrmatch levenshtein function with diacritic characters it returns a wrong / multibyte-ignorant result: select levenshtein('ą', 'x'); levenshtein ------------- ...

Elasticsearch return phonetic token with search

I use the phonetic analysis plugin from elastic search to do some string matching thanks to phonetic transformation. My problem is, how to get phonetic transformation processed by elastic search...

PySpark ApproxSimilarityJoin Missing Results

I am trying to do a similarity join between two dataframes by applying MinHashLSH on the bigrams of metaphone representations of names. This works well in most cases but does not appear to handle...

How can I best leverage Azure Search for People name matching

I have a database of over a million contacts and need to return the best matches for a) user queries and b) batch jobs that run periodically. Not much debate that people name matching is complex...

How to give higher score to exact searches than phonetic ones in Elasticsearch?

I am currenty using Elasticsearch's phonetic analyzer. I want the query to give higher score to exact matches then phonetic ones. Here is the query I am using: { "query": { ...

How to make elastic search more flexible?

I am currently using this elasticsearch DSL query: { "_source": [ "title", "bench", "id_", "court", "date" ], "size": 15, "from": 0, ...

How to improve Elasticsearch query with ML/NLP?

I am currently using a fairly standard query with my Elasticsearch search. The only addition I am using is the metaphone analyzer. I wanted to know whether there are any in-built NLP or ML add-ons...

How to give different weights to exact, phonetic and fuzzy queries?

Note: I checked out this answer, but could not solve the problem. So currently I am using the following query: { "_source": [ "title", "bench", "id_", "court", ...

How to decide which Encoder to use for which language in Elasticsearch "Phonetic Token filter"?

I have used Metaphone and soundex Encoder with "Phonetic Token Filter" in Elasticsearch. Metaphone is good for English words. Soundex is good for English as well as Hindi maybe many other...

How to make a fulltext search

I want to make a fulltext search with metaphone. Everythings works fine. I have 4 fields ie. ID |Category | Type |Title |Meta 1 |Vehicle |4 Wheelers ...

Fuzzy Matching Emails on BigQuery

I would like to match names and emails that are spelled differently in various datasets on our BigQuery Data Warehouse. I've done cursive research on Fuzzy Matching on BigQuery. This question is...

Why use both conda and pip?

In this article, the author suggests the following To install fuzzy matcher, I found it easier to conda install the dependencies (pandas, metaphone, fuzzywuzzy) then use pip to...