Course home > SRS worked example

We are going to try to find all the mammalian osteonectin proteins in Swissprot.

From the SRS front page, click on the Library Page tab. This takes you to the library select page. Click on the checkboxes beside UniprotKB/Swiss-Prot (this includes the final Swissprot versions of a protein entry) and beside UniprotKB/TrEMBL (includes the still preliminary versions that have not yet been fully annotated). SwissProt and TrEMBL (translated proteins from EMBL) are both subsets of the UniProt KnowledgeBase. We then click on the Standard Query Form box, which takes us to the standard query page. There are four main text boxes in the center of the page. Change the pull-down menu of the first one to 'Description', and type '*osteonectin*' into the text-entry box. The reason we surround this keyword with wildcards is because in some entries, the osteonectin keyword is surrounded by braces. The indexing system that SRS currently split words in the description field by whitespace, so the braces are included in the keyword. Change the second pull down menu to 'Organism Name', and type 'mammalia' into the relevant text-entry box. Change the Result Display Options view box as 'SeqSimpleView'. The UniprotView has more details, but it is easier to see at a glance what sequences one has when using the SeqSimpleView. Click on Search.

Show SRS QUERY PAGE image

The query found sixteen entries from Swissprot and TrEMBL. Twelve of these are osteonectin proteins from Swissprot proper, which include two partial sequences for pig and mink. Five sequences from Swissprot proper are testican sequences, which are related to osteonectin. There are also four TrEMBL sequences. If we look more carefully at these sequences from TrEMBL, we can see that two seem to be sequence variants of the human testican 2 sequence, and one is another testican 2 sequence from mouse, for which we already have full-length proteins. The fourth one appears to be a new osteonectin sequence from macaque.

We can now save these sequences to a file. Check the boxes beside all the sequences we want to keep, namely all the full-length, or nearly full-length, protein sequences from Swissprot, excluding the testican sequences (a total of five sequences) but including the TrEMBL macaque sequence. By default, as soon as you check a box, the radio button under the Apply Options To heading changes from 'unselected results only' to 'selected results only'. Change the 'View Results using' menu under the Display Options heading to FastaSeqs. Then, click on save. This will take you to another page where you are reminded how many sequences you are saving, and what the identities of the proteins that you are saving are. Make sure the view format is FastaSeqs and that you choose 'File(text)' from the Output To heading. Click on Save.

Show SRS SAVE PAGE image

Your browser should ask you what you would like to call your file (default on IE5: wgetz), and where you want to save it. Type 'osteonectin.fasta' and click Save.

News
Jul, 2009; ChIPseeqer, a comprehensive framework for analysis of ChIP-seq data developed in the Elemento lab, is now available for download. [More]
Apr, 2009; The BDVal program developed by the Campagne laboratory for MAQC-II is now available from http://bdval.org. The software supports the development and evaluation of predictive biomarker models from high-throughput data. The web site offers binary and source distributions. [More]
Jan, 2009; Twease now supports searching MEDLINE articles by Author, Journal, and Publication Year. Examples for performing these searches can be found in the updated Twease tutorial. [More]

[News Archives] [Mailing List]


Events
Dec 11th; 4:00pm-5:00pm: Institute for Computational Biomedicine Research in Progress Seminar Series - Fabien Campagne; ICB Conference Room - Y.1301
Jan 15th; 4:00pm-5:00pm: Institute for Computational Biomedicine Research in Progress Seminar Series - Lei Shi; ICB Conference Room - Y.1301
Feb 12th; 4:00pm-5:00pm: Institute for Computational Biomedicine Research in Progress Seminar Series - Christopher E. Mason; ICB Conference Room - Y.1301
Mar 12th; 4:00pm-5:00pm: Institute for Computational Biomedicine Research in Progress Seminar Series - Olivier Elemento; ICB Conference Room - Y.1301
Apr 9th; 4:00pm-5:00pm: Institute for Computational Biomedicine Research in Progress Seminar Series - Emre Aksay; ICB Conference Room - Y.1301
May 14th; 4:00pm-5:00pm: Institute for Computational Biomedicine Research in Progress Seminar Series - Jonathan D. Victor; ICB Conference Room - Y.1301
Jun 11th; 4:00pm-5:00pm: Institute for Computational Biomedicine Research in Progress Seminar Series - Harel Weinstein; ICB Conference Room - Y.1301
Jul 9th; 4:00pm-5:00pm: Institute for Computational Biomedicine Research in Progress Seminar Series - Duane Hassane; ICB Conference Room - Y.1301