UploadLargeData: Difference between revisions
Jump to navigation
Jump to search
No edit summary |
No edit summary |
||
Line 10: | Line 10: | ||
### Chmod og+x /lustre/scratch/''user''/proj1 | ### Chmod og+x /lustre/scratch/''user''/proj1 | ||
# transfer the files with SCP | # transfer the files with SCP | ||
## scp *.fastq.gz ''user''@cheaha.uabgrid.uab.edu:/lustre/scratch/''user''/proj1 | |||
## I used "Secure Shell Client" for windows, available at [http://www.uab.edu/it/software/displaytitle.php?ItemID=34 UABIT] | ## I used "Secure Shell Client" for windows, available at [http://www.uab.edu/it/software/displaytitle.php?ItemID=34 UABIT] | ||
## open source client is [http://www.chiark.greenend.org.uk/~sgtatham/putty/ PuTTY] | |||
# '''UNCOMPRESS fastq.gz files!!''' | # '''UNCOMPRESS fastq.gz files!!''' | ||
## cd /lustre/scratch/''user''/proj1 | ## cd /lustre/scratch/''user''/proj1 | ||
Line 27: | Line 29: | ||
### add Datasets | ### add Datasets | ||
#### Upload option: Upload files from system path | #### Upload option: Upload files from system path | ||
##### | ##### Get a list of absolute path names using one of the following | ||
###### cd /lustre/scratch/''user''/proj1 THEN RUN find `pwd` -name "*.fastq" | |||
###### find /lustre/scratch/''user''/proj1 -name "*.fastq" | |||
##### paste list of absolute path names into URL/Text box in Web Admin GUI | ##### paste list of absolute path names into URL/Text box in Web Admin GUI | ||
#### Change "Copy data into Galaxy?" to "Link to files without copying into Galaxy" | #### Change "Copy data into Galaxy?" to "Link to files without copying into Galaxy" |
Revision as of 20:56, 23 June 2011
Load and Link approach
transfer and uncompress (slow)
- login to cheaha.uabgrid.uab.edu (linux),
- create directory for this data set in your scratch dir
- mkdir /lustre/scratch/user/proj1
- make sure that directory is readable by galaxy user
- Chmod og+x /lustre/scratch/user
- Chmod og+x /lustre/scratch/user/proj1
- transfer the files with SCP
- UNCOMPRESS fastq.gz files!!
- cd /lustre/scratch/user/proj1
- find `pwd` -name "*.gz" -exec ksh -c 'qrsh "gzip -d \{}" &' \;
- ls -1 *.gz | xargs -L 1 -i_f_ ksh -c 'qrsh -cwd gzip -d _f_ &' \;
- gzip -d filename
- cd /lustre/scratch/user/proj1
- make sure the files are readable by galaxy user
- Chmod og+r /lustre/scratch/user/proj1/*
link into galaxy dataset (fast)
- get Admin privileges on galaxy
- either get Shantanu to make you admin in Galaxy
- or grab someone who is (John, Curtis)
- In Galaxy GUI:
- admin > Manage Data Libraries > create new library
- add Datasets
- Upload option: Upload files from system path
- Get a list of absolute path names using one of the following
- cd /lustre/scratch/user/proj1 THEN RUN find `pwd` -name "*.fastq"
- find /lustre/scratch/user/proj1 -name "*.fastq"
- paste list of absolute path names into URL/Text box in Web Admin GUI
- Get a list of absolute path names using one of the following
- Change "Copy data into Galaxy?" to "Link to files without copying into Galaxy"
- Put something mnemonic in Message box.
- Upload option: Upload files from system path
- add Datasets
- admin > Manage Data Libraries > create new library
link data into a history (fast)
I could then select the datasets and, at bottom of page "For selected datasets: <Import to histories>" and get them into a history so I can compute on them.