Gene PICST_72245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_72245 
SymbolVAS1 
ID4839535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1117239 
End bp1120518 
Gene Length3280 bp 
Protein Length1051 aa 
Translation table12 
GC content46% 
IMG OID640390850 
Productvalyl-tRNA synthetase 
Protein accessionXP_001385224 
Protein GI126137401 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AAGAGTCTCA TACGTGGAAT AGTCATCAGT CTGCCTCTTA AGAATACACT TCCGATTTGC 
AGAAGACATT TAGCCACCAT GAGTGACCAG AAAGACAATT CTACTGCTAC TTCGGCTCCT
GCCCCAGCTG AAGGAGCACC AGTAAAGACC GCCAAACAGT TGGAAAACGA ACGTAAGAAG
GCCGAGAAGC TCGCAAAATT CGAAGCCAAG AAGGCCAAGC AAGCAGCTGC TGCCCAATCG
AAGTCGGCTG AGCCAAAGAA GCCCAAGAAG GAAAAGAAGG TTGCCGAACC TGTTCCAGAA
TTTGTAGATG AAACCAAGCC AGGAGAAAAG AAGATTCTTG TCTCTCTTGA GGACGCTGCC
TTCAAAGCTT ACAACCCCAA GAATGTTGAG TCATCCTGGT ACAGCTGGTG GGACAAGGAA
GGACTTTTCC AGCCTGAATT GACTGCCGAT GGTAACATTA AGCCTCAAGG TGCCTTTACT
ATTCCAGCAC CTCCACCCAA TGTCACCGGT GCCTTGCACA TTGGACATGC CTTGACTGTT
TCTATCCAAG ATGCGTTGAT CAGATACTAC AGAATGAAGG GAAAAACTAC ACTTTTCCTT
CCAGGTTTCG ACCACGCCGG TATCGCTACC CAATCTGTTG TGGAGAAGCA GATCTGGGCT
AAAGAGAAGA AGACAAGACA TGACTATGGA AGAGAGAAGT TCGTTGAAAA GGTCTGGGAA
TGGAAGGAAG AGTACCATTC CAGAATCAAG AACCAATTCA AAAAGTTGGG AGCTTCTTAT
GACTGGACTA GAGAAGCCTT TACCTTGAAT CCTGACTTGT CGGCTGCTGT CACTGAAGCC
TTTGTCAGAA TGCACGATGA CGGTACTATC TACAGAGCTA CCAGATTGGT CAACTGGTCC
ACCAAGTTGA ACACTGCTAT CTCGAACTTG GAAGTCGACA ACAAGAACAT TCCTGGTAAG
ACTCTATTGG CTGTTCCTGG CTATGATGAC AAGGTTGAGT TTGGTGTATT GACTTCGTTT
TCATACGAAG TAGATGGATC TGATGAAAAG CTTACTGTAG CCACTACTAG ACCAGAAACC
ATCTTTGGTG ATACTGGTGT AGCTGTTCAC CCCAAGGATC CTAGATACAC GCATTTGCAT
GGCAAGTTTG TGAAGCATCC ATTCTTGGAC CGTTTGTTGC CTATTGTTAC TGACTCTGAA
GCTGTTGACA TGGAATTCGG TACTGGTGCC GTCAAGATCA CTCCCGCCCA TGACCAGAAC
GATTACCAGA CAGGTAAGAG ACAAAACTTA GAGTTTATCA ACATCTTTAC TGATGATGGG
TTCTTGAACG AGAACTGTGG AGAGTACAAG GGCTTGAAGA GATTCGATGC AAGAACCGTT
GTGATTGAAC AGTTGAAGGA AAAGGGCTTG TTTGTTGAAC AAAAGGACAA TGAGATGACG
ATTCCTCTCT GTTCTAGATC GGGTGACGTC ATTGAACCCT TGTTGAAGCC CCAATGGTGG
GTCAACCAGC AAGAAATGGC CAAGGACGCC ATTGCTGCTG TCAAGAACGG TGATCTCACC
ATAACTCCTA AGACATCAGA ATCCGAGTAC TTCTACTGGA TGGAAAACAT CCAGGACTGG
TGTATCTCCC GTCAATTGTG GTGGGGCCAC AGATGTCCTG TTTACTTTGT GGTTGTAGAA
GGCGAAGAAG GTGACAGACT CGACAATAAC TATTGGATTG CTGCCAGATC TCACGAAGAA
GCTTTGGAAA AGGCTCAGAA GAAGTTTGCT GGTGTAAAGT TTACTTTGGA GCAAGACGAA
GACGTTTTGG ACACCTGGTT CTCTTCTGGT TTGTGGCCTA TTTCCACCTT GGGGTGGCCT
AATGCCACTA GAGACATGGA GTTGTTCAAT CCTATGTCAA TGTTAGAGAC CGGTTGGGAT
ATTTTGTTTT TCTGGGTGTC TCGTATGATC TTGTTGTCGT TGAAGTTGAC TGGAAAGGTA
CCTTTCAAGG AAGTATTCTG CCACTCGTTG GTTCGTGATG CTCAGGGCAG AAAGATGTCC
AAGTCGTTGG GTAACGTTAT CGACCCCTTG GATGTCATTG CTGGTATCCC CTTGCAAGGA
TTGCACGACA AGTTGAAGGT TGGTAACTTG GATCCTAGGG AATTACAAAA GGCCACAGAT
GGCCAAAAGC TCTCGTACCC TAACGGAATT CCTGAGTGTG GTACGGACGC CTTGAGATTC
GCTCTCTGTG CCTACTCCAC CGGTGGCAGA GATATCAACT TGGACATCTT GAGAGTTGAA
GGCTACCGTA AATTCTGTAA CAAGATCTAC CAAGCCACCA AGTTTGTATT GGGAAGATTG
GGTGAAGACT TCAAGCCAGC CGAAACCCCA GCTAAAACTG GCAATGAGTC TTTGGTAGAG
AAGTGGATTC TTTACAAGTT GACTCAAGCT ACTGCTAAAA CCAACAAGGC CATTGAAAAC
AGAGACTTCG GTGATGCCAC CAACCACATA TATAATTTCT GGTATGACTT GTGTGATGTC
TATATTGAAA ACTCCAAGGC TTTGATCCAG GACGGTACTG CTGAACAGAA GAAGTCTGCT
CAAGACACCT TGTACACTTG TATCGACTCA TCCTTGAGAT TGATTCACCC TTTCATGCCA
TTTGTAACGG AAGAAATGTG GCAGAGATTG CCTCGTCGTG CTGGTGACTC TACCAAATCG
ATTGTCGTAG CCGCATACCC TGAATACATT AAGGAGTTCG ATGACAAAGA AGCCCACGAA
GCCTATGAGT TGGTCTTGGA AATCACCAAG GGTGCCAGAT CGTTGTTGTC GCAATACAAC
ATCCTCAAGC ACGGCCAAGT GTATGTTGAA ACTGCCAAGA AGGACATCTT CGACATTGCC
TCGGAACAGC ACGACTCCAT CGTCTCGTTG ATCAAGGGTG TTGACAAGAT CACCGTTGTT
TCTAAGGTGG AAGATGTACC TTCTGGTTGT GCCTTGCAAG CCATTGGCCC AGACTGTACT
GTCCATGTGT TGGTGAAGGG CCAAATTGAC TTGGATGCTG AAATTGCCAA GGTGGAAAAG
AAGTTGGACG CTGCTACAGA ATTCGACAAG AAGACCAAGG AGGCCATCGA GAAGTTCACG
GAAAAGACCC AGCCTGCTGC CAAGGAGGCT GCTTTCAAGA GATTGGAGAA GGTCACTGCT
GAAATCGAAG GATACCAACA GACCATTGCC ATTTTGGAGA AGTTGAAGCT CTAGATAGAG
TAGAATGTAA ATACAGAAAT GAAATACAAG AATGGGAAAA
 
Protein sequence
MSDQKDNSTA TSAPAPAEGA PVKTAKQLEN ERKKAEKLAK FEAKKAKQAA AAQSKSAEPK 
KPKKEKKVAE PVPEFVDETK PGEKKILVSL EDAAFKAYNP KNVESSWYSW WDKEGLFQPE
LTADGNIKPQ GAFTIPAPPP NVTGALHIGH ALTVSIQDAL IRYYRMKGKT TLFLPGFDHA
GIATQSVVEK QIWAKEKKTR HDYGREKFVE KVWEWKEEYH SRIKNQFKKL GASYDWTREA
FTLNPDLSAA VTEAFVRMHD DGTIYRATRL VNWSTKLNTA ISNLEVDNKN IPGKTLLAVP
GYDDKVEFGV LTSFSYEVDG SDEKLTVATT RPETIFGDTG VAVHPKDPRY THLHGKFVKH
PFLDRLLPIV TDSEAVDMEF GTGAVKITPA HDQNDYQTGK RQNLEFINIF TDDGFLNENC
GEYKGLKRFD ARTVVIEQLK EKGLFVEQKD NEMTIPLCSR SGDVIEPLLK PQWWVNQQEM
AKDAIAAVKN GDLTITPKTS ESEYFYWMEN IQDWCISRQL WWGHRCPVYF VVVEGEEGDR
LDNNYWIAAR SHEEALEKAQ KKFAGVKFTL EQDEDVLDTW FSSGLWPIST LGWPNATRDM
ELFNPMSMLE TGWDILFFWV SRMILLSLKL TGKVPFKEVF CHSLVRDAQG RKMSKSLGNV
IDPLDVIAGI PLQGLHDKLK VGNLDPRELQ KATDGQKLSY PNGIPECGTD ALRFALCAYS
TGGRDINLDI LRVEGYRKFC NKIYQATKFV LGRLGEDFKP AETPAKTGNE SLVEKWILYK
LTQATAKTNK AIENRDFGDA TNHIYNFWYD LCDVYIENSK ALIQDGTAEQ KKSAQDTLYT
CIDSSLRLIH PFMPFVTEEM WQRLPRRAGD STKSIVVAAY PEYIKEFDDK EAHEAYELVL
EITKGARSLL SQYNILKHGQ VYVETAKKDI FDIASEQHDS IVSLIKGVDK ITVVSKVEDV
PSGCALQAIG PDCTVHVLVK GQIDLDAEIA KVEKKLDAAT EFDKKTKEAI EKFTEKTQPA
AKEAAFKRLE KVTAEIEGYQ QTIAILEKLK L