Chapter 8 Repository
Explore the metadata information of random UCSC Xena datasets in the repository page. Go to Chapter 1 if you have little knowledge about the UCSC Xena datasets.
Firstly, users can query datasets according to the conditions of data hub (or further cohort) and data type (or further data subtype). By default, it will select the GDC hub and all data types.
Then, the basic information of eligible datasets will be display in the right panel.
Next, users can select one or multiple rows (datasets) and their external links to download or browse original data are available in the bottom.
- Furtherly, three buttons are designed for the selected datasets to offer specific functions as follows:
8.1 Show Metadata
- The “Show Metadata” button will give more detailed information of selected datasets.
8.2 Request Data
Through this button, three ways of downloading raw datasets are provided.
- “Download data directly”: Enable to directly download the data into your local device.
- “Batch download in terminal”: Generate one
.sh
script file to download in Linux environment. - “Copy R download code”: Generate R codes to download especially for R users.
8.3 Analyze Data
This button is the initial step for General Dataset Analysis, which will be introduced in Chapter 9.
- As the following figure shows, users need to firstly select one or more datasets in the repository page for the
Pre-selected Datasets for Analysis
panel in General Dataset Analysis page. - If one genomics matrix dataset (e.g. RNA-seq) is selected, its related clinical metadata (e.g. phenotype or survival data) will be automatically added.