Data blending

Data blending is a process whereby big data from multiple sources[1] are merged into a single data warehouse or data set.[2] It concerns not merely the merging of different file formats or disparate sources of data but also different varieties of data.[3] Data blending allows business analysts to cope with the expansion of data which they need to make critical business decisions based on good quality business intelligence.[4]

Data blending has been described as different to data integration due to the requirements of data analysts to merge sources very quickly, too quickly for any practical intervention by data scientists.[5]

Challenges

The most common custom metadata question is: "How can this dataset blend with (join or union to) my other datasets?"[6] A 2015 Forrester Consulting study found that 52 percent of companies are blending 50 or more data sources and 12 percent are blending over 1,000 sources.[7]

See also

References

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.