Create, name and organise your data files according to best practice.
File formats for archiving, preservation and access
File formats suitable for long term preservation and access are recommended by State and National Archives authorities.
Organising your Data folders
Organising your data folders on network drives and other locations will make it easy to find and organise your data files.
Useful file names
Other tips for file naming
A number of publishers are allowing data and text mining of their licensed resources by members of subscribing institutions. This access is generally governed by Database Licenses, terms and conditions and existing copyright provisions.
Some publishers will require you to use tools that they provide to mine their content, or will conduct the process for you. In this way they can manage the quantity of data being accessed and the impact on their servers.
Downloading large amounts of data can trigger automatic lockouts and prevent access to resources by other users. Some publishers may apply a fee for the additional usage that sits outside of our existing agreement.
Please consult with your Faculty Librarians if you are considering using Library subscription databases as a source of data.
Knowing which software to use for data management and analysis is important, and can differ according to your research discipline.
Software for Statistical analysis
The Quantitative Consulting Unit has tutorials or training for analysis software such as R, SPSS, G*Power3.1, @Risk and Netica
Software for Data and Text Mining
Other Open Source Software for Data and Text Mining
Voyant - Voyant is a free online program for text analysis
Orange - open source software for data analysis, data visualisation and has add-ons for text mining and bioinformatics
VosViewer - VOSviewer also offers text mining functionality that can be used to construct networks of terms extracted from a body of scientific literature.
PubVenn - PubVenn takes a complex PubMed search and divides it into its constituent parts. It then shows the citations using a proportionally-sized Venn diagram.
Working with your data
OpenRefine (formerly Google Refine) is a free open source tool for working with messy data "... cleaning it; transforming it from one format into another; and extending it with web services and external data...."