Finding already existing records

I have a customer data set which have multiple columns(Firstname,Lastname,phone,state,zip). Need an algorithm in Python which can check when some new records will come those are already existing customer records or not. So main challenge is the algorithm will not check the exact duplicate row. I am not interested to find whether the customer is existing by all properties but only few properties. Ex: For Some customer: Name, Address 1 and city are matching (remaining are different).That will be an existing customer For some customer: Name, Birth Date and Pin Code are matching (remaining are different): Existing customer For some customer: Name and Contact number matching (remaining are different). I am new in python and not able to find any suitable solution for this. Please help.



Read more here: https://stackoverflow.com/questions/68474006/finding-already-existing-records

Content Attribution

This content was originally published by TANUJA GHOSH at Recent Questions - Stack Overflow, and is syndicated here via their RSS feed. You can read the original post over there.

%d bloggers like this: