Data Intensive Systems

Data and information are key assets for modern business. Large complex and incomplete datasets are common in industry. Exploiting that data successfully can give a major competitive advantage while, if it is not managed successfully, its value is often lost.

Given the scale of today’s data sets and a seemingly insatiable demand to collect even more, organisations must turn to new approaches and new technologies for data solutions.

METHODS AND TECHNOLOGIES
Tackling the challenges in data intensive systems involves a mixture of methods and technologies. Development of appropriate semantic web and digital repositories, integrating and linking data from different sources, using appropriate data standards and semantics are all essential. Large volumes of data can also be tackled through the use of distributed computing methods such as clouds and grid-based systems.

KEY INDUSTRY COLLABORATIONS
We were a partner in a long-term, large-scale research programme funded by the National Science Foundation in the USA, to develop a Scientific Environment for Ecological Knowledge (SEEK). Our research helped identify a problem caused by the ambiguity of biological names which was exacerbated in a cyberinfrastruture. We addressed the problem using a 'concept' rather than 'name' approach for identifiers, and developed a community-wide data standard for interchanging taxonomic information. This became the TCS standard adopted by the International Biodiversity Standards Community and implemented in global tools and databases.
 
[Read More]

Case Studies link icon

  • Visual Analytics of Unknown Data Sets
    Using statistical methods and information visualisation involves working with known data sets to discover information locked within the data. I-nquire believes that businesses could benefit from advanced statistical techniques against arbitrary data sets.
next prev

Projects link icon

  • Scottish Woodlands KTP
    Scottish Woodlands Ltd is Scotland's leading independent full-service forest management company. It is also the second largest operator in the UK, with a network of 17 offices operating throughout Scotland, England, Wales and Northern Ireland.
  • Research Activities I/Net
    Information Management and Presentation for Research Activity and Related Data. A linked intranet and internet solution for the managing and presentation of data concerning the research related activties of university researchers.
next prev

Contact linked in profile of Data Intensive Systems web page for Data Intensive Systems



r.kukla@napier.ac.uk

+44 (0)131 455 2738

Robert Kukla
Room C51
Merchiston Campus
10 Colinton Road
Edinburgh
EH10 5DT

Selected Publications

Kandel, S., Heer, J., Plaisant, C., Kennedy, J., Ham, F., Riche, N., Weaver, C., Lee, B., Brodbeck, D., Buono, P. (2011). Research directions in data wrangling: Visualizations and transformations for usable and credible data. Information Visualization, October 2011, (10), 271-288.

Hall, H., Widen-Wulff, G., Peterson, L. (2010). Not what you know, nor who you know, but who you know already: examining online information sharing behaviours through the lens of social exchange theory. Libri, 60, (2), 117-128.

McEwan, C., Hart, E. (2011). On Clonal Selection. Theoretical Computer Science, 412, (6), 502-516.

Li, L., Peng, T., Kennedy, J. (2011). A Rule Based Taxonomy of Dirty Data. GSTF International Journal on Computing, 1, (2), 140-148.

Paterson, T., Graham, M., Kennedy, J., Law, A. (2012). VIPER: a visualisation tool for exploring inheritance inconsistencies in genotyped pedigrees. BMC Bioinformatics, 13 (Suppl 8), (S5), .

Taylor-Smith, E., Lindner, R. (2010). Social networking tools supporting constructive involvement throughout the policy-cycle. In: Prosser, P. (Ed.) Proceedings of EDEM 2010 - Conference on Electronic Democracy, May 7-8, 2010, Danube-University Krems, Austria, , () ( ed.). (pp. ). Vienna, Austria: . Austrian Computer Society.