Data Cleaning
Avoid dangerous business decisions
The Data Cleaning solutions enable you to avoid making dangerous business decisions, based on low quality, incomplete and inaccurate data. Data cleaning ensures that business processes, strategic decisions, and marketing and business initiatives are backed up by relevant, timely and reliable data.
The SparkER web platform for Entity Resolution and the NORMS standardization tool help you maximize the quality and reliability of data, while increasing its completeness by enriching the information available to companies and institutions.
Who is it for?
PRODUCTION AND SERVICE COMPANIES OF ALL PRODUCTION SECTORS
Data cleaning solutions ensureManagers, Commercial Managers, IT Managers, Security Officers and Logistics Managers of the Ceramics, Mechanical, Logistics, Biomedical and Pharmaceutical sectors, as well as Consulting, Facility Management and Global Services, the following competitive advantages :
• Optimizing strategic decisions, control activities and production processes based on high quality data
• Implementing commercial and marketing initiatives based on complete, timely and reliable data
• Making future predictions based on reliable, relevant and high quality data
• Avoiding costs due to errors in production and logistics processes due to dirty, incomplete and low quality data
PUBLIC ADMINISTRATIONS AND INSTITUTIONS ON A LOCAL, NATIONAL AND INTERNATIONAL LEVEL
The solutions for Data Cleaning ensure Senior Administrative Officials and Managers of Public Administrations, such as Regions, Provinces, Municipalities, National and Local Agencies, Consortiums, the following advantages:
• Analyzing the performance of the different departments on the basis of complete, timely and reliable data
• Optimizing strategic decisions and control activities based on high quality data
• Making future predictions based on reliable, relevant and high quality data
Human Expert Knowledge
Supervised Machine Learning
Data Cleaning operations are carried out through a process of Supervised Machine Learning: this is a process of progressive learning consisting of Machine Learning algorithms that learn from data and exploit the knowledge provided by human experts (Human Expert Knowledge).
SparkER
Entity Resolution Framework
The SparkER web platform is an efficient and scalable tool that allows you to achieve performances that are superior to the tools currently on the market. SparkER can be configured through similarity functions optimized for the specific use case.
Record Matching
Data integration
The SparkER web platform enables the integration of data with different formats, distributed on different systems and updated with different timelines.
RELATED CASES
Case studies
RELATED TOOLS
SPARKER
Data Cleaning Web platform enabling you to solve the main problems in the integration of Big Data such as deduplication, record linkage, reference matching, disambiguation
NORMS
Data Cleaning tool capable of interpreting abbreviations, compound names and acronyms using semantics to suggest the user a result more similar to the context of the analyzed data
MOMIS
MOMIS ensures a clear and instant view of company data through the virtual integration of company information systems with external data sources such as Market Analysis, Social Networks, Open Data and geographical data