Data and information are key assets for modern business. Large complex and incomplete datasets are common in industry. Exploiting that data successfully can give a major competitive advantage while, if it is not managed successfully, its value is often lost.
Given the scale of today’s data sets and a seemingly insatiable demand to collect even more, organisations must turn to new approaches and new technologies for data solutions.METHODS AND TECHNOLOGIES
Tackling the challenges in data intensive systems involves a mixture of methods and technologies. Development of appropriate semantic web and digital repositories, integrating and linking data from different sources, using appropriate data standards and semantics are all essential. Large volumes of data can also be tackled through the use of distributed computing methods such as clouds and grid-based systems.KEY INDUSTRY COLLABORATIONS
We were a partner in a long-term, large-scale research programme funded by the National Science Foundation in the USA, to develop a Scientific Environment for Ecological Knowledge (SEEK). Our research helped identify a problem caused by the ambiguity of biological names which was exacerbated in a cyberinfrastruture. We addressed the problem using a 'concept' rather than 'name' approach for identifiers, and developed a community-wide data standard for interchanging taxonomic information. This became the TCS standard adopted by the International Biodiversity Standards Community and implemented in global tools and databases.
Buono, P. (2011). Research directions in data wrangling: Visualizations and transformations for usable and credible data. Information Visualization, October 2011, (10), 271-288.
Peterson, L. (2010). Not what you know, nor who you know, but who you know already: examining online information sharing behaviours through the lens of social exchange theory. Libri, 60, (2), 117-128.
Hart, E. (2011). On Clonal Selection. Theoretical Computer Science, 412, (6), 502-516.
Kennedy, J. (2011). A Rule Based Taxonomy of Dirty Data. GSTF International Journal on Computing, 1, (2), 140-148.
Law, A. (2012). VIPER: a visualisation tool for exploring inheritance inconsistencies in genotyped pedigrees. BMC Bioinformatics, 13 (Suppl 8), (S5), .
Rasmussen, L. (2008). Knowledge Networking in a Public Service Agency: Contextual Challenges and Infrastructure Issues. In: Srikantaiah, T.,
Koenig, M. (Eds.) Knowledge Management in Practice: Connections and Context, , () ( ed.). (pp. 445-462). USA: . Information Today, Inc.