Metadata
Title
Archiving digital content
Category
general
UUID
d6493d66b9d5400a81495ecfd0e6d367
Source URL
https://www.library.sydney.edu.au/support/publishing/archiving-digital-content
Parent URL
https://www.library.sydney.edu.au/support/publishing/lodging-your-thesis-at-the-...
Crawl Time
2026-03-10T04:42:17+00:00
Rendered Raw Markdown
# Archiving digital content

**Source**: https://www.library.sydney.edu.au/support/publishing/archiving-digital-content
**Parent**: https://www.library.sydney.edu.au/support/publishing/lodging-your-thesis-at-the-library

## **Archiving options for research data**

The following table provides University of Sydney managed options for long-term preservation of research data.

|  |  |  |  |  |
| --- | --- | --- | --- | --- |
| **Archiving option** | **Size limit** | **Access** | **Accepts sensitive data?** | **How to deposit data** |
| [Sydney eScholarship Repository](https://ses.library.usyd.edu.au/) | 5GB | Open | No | [How to submit](https://ses.library.usyd.edu.au/pages/support) |
| [Research Data Store](https://sydneyuni.atlassian.net/wiki/spaces/RC/pages/228589620/Research+Data+Store) | As arranged | Nominated team members | [Yes – if encrypted](https://sydneyuni.service-now.com/sm?sys_kb_id=4a361eb61b81e910aeea635a274bcbd2&id=kb_article_view&sysparm_rank=4&sysparm_tsqueryId=14c3a79dc378391059e72b75e0013175) | [Research Dashboard (DashR)](https://dashr.sydney.edu.au/) |
| [Records Online](https://recordsonline.sydney.edu.au/rol/quicksearch) | 50GB | Nominated team members | Yes | [ICT Self Service Portal](http://sydney.edu.au/trackit) |

The University also keeps a [list of research data platforms that can be used for storing your data](https://sydneyuni.service-now.com/sm?id=kb_article_view&sys_kb_id=4a361eb61b81e910aeea635a274bcbd2) for active projects.

### **Archiving eNotebooks**

For guidance on how to archive data from an eNotebook, see [Archiving completed eNotebooks](/content/dam/library/documents/support/doc_archivingcopleted_enotebooks.pdf).

## **Organise your files**

Give your files meaningful names and organise your content in a logical file structure. This is useful for active files that you are currently working with, and it is essential before publishing or archiving your content.

A well-organised folder structure can save you time and help other users to discover, understand and use your digital content.

### Key considerations

- Keep any raw data in a separate folder from working data.
- Store any consent forms separately for ethics and privacy reasons.
- Nest your folders in the direction that best suits how you plan to use them, e.g. *Location > Method > Date* or *Method > Date > Location*.Don’t create too many empty folders ahead of time.
- Don’t manually change or delete file extensions (e.g. .docx, .pdf, .csv).
- Avoid using special characters  (e.g. \ / : \* ? " < > |), apart from hyphens and underscores, in file names.
- Don’t make file names too long.

### **Helpful guidelines**

- [Guidance on organising digital content](http://ukdataservice.ac.uk/manage-data/format/organising.aspx), UK Data Service
- [Project TIER](https://www.projecttier.org/) protocol for guidance on developing a folder structure

## **File naming**

Select an appropriate naming convention for your files as early as possible and follow it consistently throughout your research or work.

Make sure your naming convention is documented in a README file before archiving or publishing your content.

### **Helpful guidelines**

- [TILS file naming convention](https://www.library.qut.edu.au/about/management/documents/QUTTILSDocNamingConvention.pdf), Queensland University of Technology
- [Naming conventions](https://www.ed.ac.uk/records-management/guidance/records/practical-guidance/naming-conventions), University of Edinburgh

## **File formats**

Save your work in an appropriate file format to ensure long-term accessibility. The file formats you use when working with your data may not be appropriate for archiving or publishing purposes.

You should think about capturing data or converting files into formats that are:

- widely used within your discipline
- publicly documented, i.e. the complete file specification is publicly available
- open and non-proprietary
- endorsed by standards agencies such as the International Organisation for Standardization (ISO)
- self-documenting, i.e. the file itself can include useful metadata
- unencrypted
- uncompressed or that use lossless compression.

### **Recommended long-term formats**

The following table provides general suggestions for file formats that enable long-term preservation of digital content.

|  |  |
| --- | --- |
| **Format category** | **Preservation format options** |
| Archive | - ZIP File Format (.zip) |
| Audio | - PCM Wave Format (.wav) – minimum 16bit/44.1kHz - Broadcast Wave Format (.bwf) - minimum 24bit/48kHz |
| Images | - Tagged Image File Format (.tif, .tiff) |
| Tabular datasets | - Comma Separated Values (.csv) - Microsoft Excel (.xlsx) |
| Text | - Plain Text (UTF-8) (.txt) - PDF/A (.pdf) - PDF/A-3 (.pdf) |
| Video | - Audio Video Interleave (.avi) – uncompressed - MPEG-4 (.mp4) - CODEC: ProRes, H.264, AVC, audio: stereo AAC - MOV (.mov) - CODEC: ProRes, H.264, AVC, audio: stereo AAC - JPEG2000 OP1a MXF (.mxf) - FFV1 Matroska (.mkv) |

For more specific recommendations, please contact the [Library Digital Collections team](mailto:library.digitalcollections@sydney.edu.au). We can provide advice on:

- which file formats to use for long-term preservation, access, and sharing with collaborators
- when and how to convert to these formats.

### **Helpful guidelines**

- [Preferred forms for depositing data with Australian Data Archive](https://ada.edu.au/depositing-data/)
- [National Archives of Australia’s preservation file formats](http://www.naa.gov.au/information-management/managing-information-and-records/preserving/long-term-file-formats.aspx)
- [Mendeley Data: File Formats](https://data.mendeley.com/file-formats)
- [UK Data Archive’s file formats table](http://www.data-archive.ac.uk/create-manage/format/formats-table)
- [Library of Congress Recommended Formats Statement 2018-2019](https://www.loc.gov/preservation/resources/rfs/)
- [National Archives of America Tables of File Formats](https://www.archives.gov/records-mgmt/policy/transfer-guidance-tables.html)
- [Sydney Conservatorium of Music Research Data Management Provisions 2015](http://sydney.edu.au/policies/showdoc.aspx?recnum=PDOC2015/384&RendNum=0) (see “Schedule 4: Preferred file formats for audio visual and other data”)

## **Metadata**

Metadata provides vital contextual information that will help others find, understand and reuse your content. Metadata may include:

- title
- creator(s)
- date produced/collected
- location
- abstract
- subject
- method
- process
- quality
- format
- rights
- ownership.

Consider using a metadata standard (also called a metadata schema), which is a defined set of fields that can either be general or discipline specific. Using a metadata standard will not only provide a rich description of your content, but also increase the likelihood of people finding it.

### **Helpful guidelines**

- [Australian National Data Standard (ANDS) metadata page](https://ardc.edu.au/resource/metadata/)
- [Australian Research Data Commons (ARDC) metadata guide](https://ardc.edu.au/resource/metadata-guide/)

## **Creating documentation**

Once you’ve decided what metadata you need to collect and keep, you should record this information and store it with the digital content.

Some storage systems, like the University’s [eNotebook](https://sydneyuni.service-now.com/sm?id=kb_article_view&sysparm_article=KB0013721), provide mechanisms for you to do this when you save your content.

In other storage systems, like the [Research Data Store](https://sydneyuni.service-now.com/sm?id=kb_article_view&sysparm_article=KB0010833), you may have to record your metadata manually in a README document or using a [version control table](/content/dam/library/documents/support/doc_versioncontrol.docx) (.docx template).

### **Helpful guidelines**

- [Cornell University guide to writing “readme” metadata](https://data.research.cornell.edu/content/readme#bestpractices)

## **Retention periods**

Some digital content will [have a legally mandated retention period](https://intranet.sydney.edu.au/research-support/reporting-services-and-requirements/data-retention-periods.html). This is the minimum amount of time that you need to keep the content after its creation or the completion of a project.

Retention periods vary depending on the content and the context in which it was created, and you must keep your data for*at least*as long as the legal retention requirement.

- ## Related information

  [How to submit to Sydney eScholarship](https://ses.library.usyd.edu.au/pages/roles)
  [Research data management](https://intranet.sydney.edu.au/research-support/managing-research/research-data-management.html) 
  [Research Data Store](https://sydneyuni.service-now.com/sm) 
  [Data retention periods](https://intranet.sydney.edu.au/research-support/reporting-services-and-requirements/data-retention-periods.html)

- ## Contact

  For help with publishing data, contact Sydney eScholarship support. For help with digital preservation, contact the Digital Collections team.

  [Contact Sydney eScholarship support](https://ses.library.usyd.edu.au/pages/support)
  [Contact Digital Collections team](mailto:library.digitalcollections@sydney.edu.au)