Maka-datasets
Businesses never have enough data and are continuously seeking out external datasets to enrich existing data. We sell the much needed data to organizations in order to help them achieve the following:
a. Verify their existing data
b. Enrich their existing data
We achieve these by:
Identifying the external data most frequently needed by organizations (with a focus on organizations with less than 50 employees)
Identifying Potential Sources for these datasets and gaps in each source
Consolidating data from these sources and closing gaps present in each source
Maka-builder
Given datasets from multiple sources for a specific entity, we consolidate this into a distinct set of records with automatically and manually verified attributes. Maka-builder produces high quality datasets that can be used by organizations and has three components:
A service that extracts data from a public list, table or website
A service that merges data for different sources into one list of distinct records
A service that automatically verifies and updates the accuracy of each attribute in the list of distinct records.
Maka-transform
This a data transformation framework processing data in the cloud. Built using Airflow and DBT, this framework is SQL based and supports the following key features:
SQL based code generation
SQL Based code libraries
Smart Pipeline Scheduling
Automated Setup for Local Development and Testing
Post-Deployment Data quality checks
Functional Testing Framework