Gene P9211_18101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_18101 
SymbolvalS 
ID5730119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1638174 
End bp1640975 
Gene Length2802 bp 
Protein Length933 aa 
Translation table11 
GC content41% 
IMG OID641286196 
Productvalyl-tRNA synthetase 
Protein accessionYP_001551695 
Protein GI159904351 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGATC ATTTGGTAAA TCAAGGTTTA AGGCAAGCAG TTGATTCTTT GCCAAAGACG 
TATGACCCAG GTGGGACAGA AAGTCGCTGG CAGAAGTTGT GGGAATCCTC TGGTGCCTTT
CATCCAGACC CTAATGATTC TGGGGAGCCT TTTTCGCTGG TAATTCCTCC TCCAAATGTT
ACTGGCAGTC TTCACATGGG ACATGCTTTT AATACAGCAC TGATTGATAC GATTGTTCGC
TTTCAACGCT TGCAAGGGAA GAATGTACTT TGTTTACCAG GTACAGATCA CGCCTCTATT
GCAGTGCAGA CAATTCTTGA AAAGCAATTT AAAGAAGAAG GAATTAATCG TGATGATTTA
GGTAGAGAAG AATTTTTACA AAGGGCTTGG GCTTGGAAAT CAGAAAGTGG AGGACGTATT
GCGGCTCAAC TTCGACGTTT GGGCTATTCC GTTGATTGGC AAAGAGAAAG ATTTACGATG
GATGAAAGAT TGAGCAAAGC CGTTGTTGAG GCTTTTGTTC GTTTACATCA ACAGGGTTTG
ATTTATAGGG GTGAATATCT TGTGAACTGG TGCCCTGCTT CAGGTTCAGC AGTAAGTGAT
CTAGAAGTTG AAATGAAAGA AGTGGATGGA TATCTATGGC ATTTTCAATA TCCATTAACT
AATGGTTCTT CCGATGATGA GACTACGCAT CTTGAAGTTG CCACTACTCG TCCTGAAACG
ATGCTTGGAG ATGTAGCTGT TGCAGTTAAT CCTTCGGATA ATCGATATAG GCACCTTGTT
GGTAGAACCC TCACTTTGCC ATTTGTAGGT CGAGAAATTC CTGTCATTGC AGATGATCAT
GTTGATAAGG ATTTTGGGAC TGGATGCGTC AAAGTCACCC CAGCACATGA TCCAAATGAT
TTTGCAATAG GTCAGAGGCA TAATTTGCCC CAGATTACTG TTATGAATAA AGACGGCACT
ATGAATAGTA GTGCTGGTCA GTTTGAAGGT TTAGATCGCT TTAAAGCTCG TGAAGCTGTT
GTAAATGGCC TAAAAGAACT GTCATTGCTA ACCAAAATTG AACCATATAG GCATAGCGTG
CCTTTTTCAG ATCGGGGTAA GGTCCCTGTT GAACCATTAT TGTCTACACA GTGGTTTGTT
CGTATGGAGC CGATGGCTGA TCGATGTCGA GCACATCTCG TCAAGGGTGA GCCTCAGTTT
TTCCCTACTC GCTGGGAGAA AGTTTACCGG GATTGGTTAA CTGGTATTCG TGATTGGTGT
ATCAGTCGGC AATTATGGTG GGGTCATAGG ATCCCGGCAT GGTTTGTTGT TAGTGAAACT
GACTCTATAG TGACTGATCA TACTCCTTAT ATAGTGGCAC TTTCAGAGGA GGAAGCACTT
TTACAAGCCC GCGAAAAATT TGGAGACGAT GTTGCTATAG AACAAGATGC AGATGTATTG
GATACTTGGT TCTCGAGTGG TTTATGGCCT TTCTCTACTT TAGGATGGCC TGATGAAGAT
GATTCGGATT TTAACTGTTG GTACCCTACA AATACGTTAG TCACAGGTTT TGATATCATT
TTCTTCTGGG TTGCCAGAAT GACAATGATG GCTGGTGCTT TTACTGGTCA AATGCCTTTT
AAAGATGTTT ATATTCATGG GCTAGTAAGG GACGAACAGA ATCGTAAAAT GAGTAAAAGT
CTTGGTAACG GGATTGATCC CCTTGTATTA ATTGACCGTT ATGGCACTGA TGCGTTGCGC
TTTGCTTTAG TCAAGGAAGT CGCTGGTGCA GGGCAAGACA TCAGACTCGA TTACGATAGG
AAAACAGATA CTTCTGTTAC TGTTGAGGCG GCACGTAATT TTGCGAATAA GTTATGGAAT
GCCACTCGCT TTGCGTTAAT TAATCTAGGT ACTACTACTT TTGATGAGAC ATTTGAAGAA
CTTAATTTCT CTGATTTACA ACTTTCTGAT AAATGGATTT TGTCGAGGTT GGCAAGAATT
AACTCTGAAA CTTCTAAGAA ATTTACAAGT TATGCTCTTG GAGAGGCAGC AAAAGGACTG
TATGAATTTA GTTGGAATGA TTTTTGTGAT TGGTATTTGG AGTTAATTAA GCGTCGACTT
AATCCTGGAG ATTCATTGAC TCAATCTCAA TTGAAAGATA GAAGAATTGC TCAACAAGTA
ATGTTTAAGG TCCTTAGAGA ACTTTTAGTA ATGTTGCACC CCTTAATGCC TCATTTAACT
GAAGAACTTT GGCATGGCAT TACAGGTTTC CCTGACCAAA AGATGCTTGC GTTAAACCTT
TGGCCCGCAT GTCAAGAAGG CTTTTTAGAT GAAGAGTTAG AGAGTTCTTT CTCTGCACTT
TTTGCTTCTA TTAAATTGGT TCGTAATTTG CGTGCTGAAG CTGGGTTGAA GCCTGGTCAA
AATGTACCAG TCCGTTTTGT TACGACTAAC AGCAAACTTG CAGATATTCT TCACAAGGCC
AAAGCGGATA TCCAGTCACT AACACGAGCG AATAAAGTAG AAGTTTTTCA TCCGGACGAA
CTTCTTGGTA AACCTTCTAT CAAAGCTTTA GCAGGTGTTT CTGGCGATCT AGAGGTCCTT
TTGCCTATTG AGGGCTTAGT TGATTTGGAT GGATTGCGGA AACGTTTAGA AAAAGATTTA
ATCAAAGCAC AAAAAGAAAT AACTTCTCTG TCTAAAAGAT TAGATAATCC AAGCTTTATT
AACAAAGCTC CTGAAGCCAT TATTTTAGAT TGTAAGAGTA AACTAATTGC CGCAAAATCA
CAAGCTGATT TAGTTATTAA GCGCATTGCA GGTTTGAGTT GA
 
Protein sequence
MADHLVNQGL RQAVDSLPKT YDPGGTESRW QKLWESSGAF HPDPNDSGEP FSLVIPPPNV 
TGSLHMGHAF NTALIDTIVR FQRLQGKNVL CLPGTDHASI AVQTILEKQF KEEGINRDDL
GREEFLQRAW AWKSESGGRI AAQLRRLGYS VDWQRERFTM DERLSKAVVE AFVRLHQQGL
IYRGEYLVNW CPASGSAVSD LEVEMKEVDG YLWHFQYPLT NGSSDDETTH LEVATTRPET
MLGDVAVAVN PSDNRYRHLV GRTLTLPFVG REIPVIADDH VDKDFGTGCV KVTPAHDPND
FAIGQRHNLP QITVMNKDGT MNSSAGQFEG LDRFKAREAV VNGLKELSLL TKIEPYRHSV
PFSDRGKVPV EPLLSTQWFV RMEPMADRCR AHLVKGEPQF FPTRWEKVYR DWLTGIRDWC
ISRQLWWGHR IPAWFVVSET DSIVTDHTPY IVALSEEEAL LQAREKFGDD VAIEQDADVL
DTWFSSGLWP FSTLGWPDED DSDFNCWYPT NTLVTGFDII FFWVARMTMM AGAFTGQMPF
KDVYIHGLVR DEQNRKMSKS LGNGIDPLVL IDRYGTDALR FALVKEVAGA GQDIRLDYDR
KTDTSVTVEA ARNFANKLWN ATRFALINLG TTTFDETFEE LNFSDLQLSD KWILSRLARI
NSETSKKFTS YALGEAAKGL YEFSWNDFCD WYLELIKRRL NPGDSLTQSQ LKDRRIAQQV
MFKVLRELLV MLHPLMPHLT EELWHGITGF PDQKMLALNL WPACQEGFLD EELESSFSAL
FASIKLVRNL RAEAGLKPGQ NVPVRFVTTN SKLADILHKA KADIQSLTRA NKVEVFHPDE
LLGKPSIKAL AGVSGDLEVL LPIEGLVDLD GLRKRLEKDL IKAQKEITSL SKRLDNPSFI
NKAPEAIILD CKSKLIAAKS QADLVIKRIA GLS