BioMart
BioMart is a community-driven project to provide a single point of access to distributed research data. The BioMart project contributes open source software and data services to the international scientific community. Although the BioMart software is primarily used by the biomedical research community, it is designed in such a way that any type of data can be incorporated into the BioMart framework. The BioMart project originated at the European Bioinformatics Institute as a data management solution[1] for the Human Genome Project.[2] Since then, BioMart has grown to become a multi-institute collaboration involving various database projects on five continents.[3][4][5][6]
BioMart QuickQuery Interface | |
Written in | Java |
---|---|
Operating system | Unix-like |
Available in | English |
Type | Federated database system |
License | LGPL |
Website | biomart |
Software
BioMart is a freely available, open-source, federated database system that provides unified access to disparate, geographically distributed data sources.[7] BioMart allows databases hosted on different servers to be presented seamlessly to users, facilitating collaborative projects. BioMart contains several levels of query optimization to efficiently manage large data sets, and offers a diverse selection of graphical user interfaces and application programming interfaces to allow queries to be performed in whatever manner is most convenient for the user. BioMart's capabilities are extended by integration with several widely used software packages such as Bioconductor,[8] Galaxy,[9] Cytoscape,[10] and Taverna.[11]
Services
The BioMart community has set up a large number of servers, which provide access to various types of research data. Using BioMart's data federation technology, the BioMart Central Portal[3][5][6] was established to provide a convenient single point of access to this growing pool of data sources.
Data sources and community
There are around 40 BioMart data sources including the Atlas of UTR Regulatory Activity (AURA), the COSMIC cancer database, Ensembl Genomes, HapMap, InterPro, Mouse Genome Informatics (MGI), Rfam and UniProt. Access is provided by institutions including the European Bioinformatics Institute (EBI) and the Wellcome Trust Sanger Institute in the UK, Cold Spring Harbor Laboratory and the National Center for Biotechnology Information (NCBI) in the United States and French National Centre for Scientific Research (CNRS).[12]
References
- Kasprzyk A, Keefe D, Smedley D, et al. (2004). "EnsMart: a generic system for fast and flexible access to biological data". Genome Res. 14 (1): 160–9. doi:10.1101/gr.1645104. PMC 314293. PMID 14707178.
- Lander ES, Linton LM, Birren B, et al. (2001). "Initial sequencing and analysis of the human genome" (PDF). Nature. 409 (6822): 860–921. Bibcode:2001Natur.409..860L. doi:10.1038/35057062. PMID 11237011.
- Smedley D, Haider S, Ballester B, et al. (2009). "BioMart--biological queries made easy". BMC Genomics. 10: 22. doi:10.1186/1471-2164-10-22. PMC 2649164. PMID 19144180.
- Kasprzyk A (2011). "BioMart: driving a paradigm change in biological data management". Database (Oxford). 2011: bar049. doi:10.1093/database/bar049. PMC 3215098. PMID 22083790.
- Haider S, Ballester B, Smedley D, Zhang J, Rice P, Kasprzyk A (2009). "BioMart Central Portal--unified access to biological data". Nucleic Acids Res. 37 (Web Server issue): W23–7. doi:10.1093/nar/gkp265. PMC 2703988. PMID 19420058.
- Guberman JM, Ai J, Arnaiz O, et al. (2011). "BioMart Central Portal: an open database network for the biological community". Database (Oxford). 2011: bar041. doi:10.1093/database/bar041. PMC 3263598. PMID 21930507.
- Zhang J, Haider S, Baran J, et al. (2011). "BioMart: a data federation framework for large collaborative projects". Database (Oxford). 2011: bar038. doi:10.1093/database/bar038. PMC 3175789. PMID 21930506.
- Durinck S, Moreau Y, Kasprzyk A, Davis S, De Moor B, Brazma A, et al. (2005). "BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis". Bioinformatics. 21 (16): 3439–40. doi:10.1093/bioinformatics/bti525. PMID 16082012.
- Liu B, Madduri RK, Sotomayor B, et al. (2014). "Cloud-based bioinformatics workflow platform for large-scale next-generation sequencing analyses". J Biomed Inform. 49: 119–33. doi:10.1016/j.jbi.2014.01.005. PMC 4203338. PMID 24462600.
- Lopes CT, Franz M, Kazi F, Donaldson SL, Morris Q, Bader GD (2010). "Cytoscape Web: an interactive web-based network browser". Bioinformatics. 26 (18): 2347–8. doi:10.1093/bioinformatics/btq430. PMC 2935447. PMID 20656902.
- Wolstencroft K, Haines R, Fellows D, et al. (2013). "The Taverna workflow suite: designing and executing workflows of Web Services on the desktop, web or in the cloud". Nucleic Acids Res. 41 (Web Server issue): W557–61. doi:10.1093/nar/gkt328. PMC 3692062. PMID 23640334.
- "BioMart". www.biomart.org. Retrieved 14 July 2016.