Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_18101 |
Symbol | valS |
ID | 5730119 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1638174 |
End bp | 1640975 |
Gene Length | 2802 bp |
Protein Length | 933 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641286196 |
Product | valyl-tRNA synthetase |
Protein accession | YP_001551695 |
Protein GI | 159904351 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCTGATC ATTTGGTAAA TCAAGGTTTA AGGCAAGCAG TTGATTCTTT GCCAAAGACG TATGACCCAG GTGGGACAGA AAGTCGCTGG CAGAAGTTGT GGGAATCCTC TGGTGCCTTT CATCCAGACC CTAATGATTC TGGGGAGCCT TTTTCGCTGG TAATTCCTCC TCCAAATGTT ACTGGCAGTC TTCACATGGG ACATGCTTTT AATACAGCAC TGATTGATAC GATTGTTCGC TTTCAACGCT TGCAAGGGAA GAATGTACTT TGTTTACCAG GTACAGATCA CGCCTCTATT GCAGTGCAGA CAATTCTTGA AAAGCAATTT AAAGAAGAAG GAATTAATCG TGATGATTTA GGTAGAGAAG AATTTTTACA AAGGGCTTGG GCTTGGAAAT CAGAAAGTGG AGGACGTATT GCGGCTCAAC TTCGACGTTT GGGCTATTCC GTTGATTGGC AAAGAGAAAG ATTTACGATG GATGAAAGAT TGAGCAAAGC CGTTGTTGAG GCTTTTGTTC GTTTACATCA ACAGGGTTTG ATTTATAGGG GTGAATATCT TGTGAACTGG TGCCCTGCTT CAGGTTCAGC AGTAAGTGAT CTAGAAGTTG AAATGAAAGA AGTGGATGGA TATCTATGGC ATTTTCAATA TCCATTAACT AATGGTTCTT CCGATGATGA GACTACGCAT CTTGAAGTTG CCACTACTCG TCCTGAAACG ATGCTTGGAG ATGTAGCTGT TGCAGTTAAT CCTTCGGATA ATCGATATAG GCACCTTGTT GGTAGAACCC TCACTTTGCC ATTTGTAGGT CGAGAAATTC CTGTCATTGC AGATGATCAT GTTGATAAGG ATTTTGGGAC TGGATGCGTC AAAGTCACCC CAGCACATGA TCCAAATGAT TTTGCAATAG GTCAGAGGCA TAATTTGCCC CAGATTACTG TTATGAATAA AGACGGCACT ATGAATAGTA GTGCTGGTCA GTTTGAAGGT TTAGATCGCT TTAAAGCTCG TGAAGCTGTT GTAAATGGCC TAAAAGAACT GTCATTGCTA ACCAAAATTG AACCATATAG GCATAGCGTG CCTTTTTCAG ATCGGGGTAA GGTCCCTGTT GAACCATTAT TGTCTACACA GTGGTTTGTT CGTATGGAGC CGATGGCTGA TCGATGTCGA GCACATCTCG TCAAGGGTGA GCCTCAGTTT TTCCCTACTC GCTGGGAGAA AGTTTACCGG GATTGGTTAA CTGGTATTCG TGATTGGTGT ATCAGTCGGC AATTATGGTG GGGTCATAGG ATCCCGGCAT GGTTTGTTGT TAGTGAAACT GACTCTATAG TGACTGATCA TACTCCTTAT ATAGTGGCAC TTTCAGAGGA GGAAGCACTT TTACAAGCCC GCGAAAAATT TGGAGACGAT GTTGCTATAG AACAAGATGC AGATGTATTG GATACTTGGT TCTCGAGTGG TTTATGGCCT TTCTCTACTT TAGGATGGCC TGATGAAGAT GATTCGGATT TTAACTGTTG GTACCCTACA AATACGTTAG TCACAGGTTT TGATATCATT TTCTTCTGGG TTGCCAGAAT GACAATGATG GCTGGTGCTT TTACTGGTCA AATGCCTTTT AAAGATGTTT ATATTCATGG GCTAGTAAGG GACGAACAGA ATCGTAAAAT GAGTAAAAGT CTTGGTAACG GGATTGATCC CCTTGTATTA ATTGACCGTT ATGGCACTGA TGCGTTGCGC TTTGCTTTAG TCAAGGAAGT CGCTGGTGCA GGGCAAGACA TCAGACTCGA TTACGATAGG AAAACAGATA CTTCTGTTAC TGTTGAGGCG GCACGTAATT TTGCGAATAA GTTATGGAAT GCCACTCGCT TTGCGTTAAT TAATCTAGGT ACTACTACTT TTGATGAGAC ATTTGAAGAA CTTAATTTCT CTGATTTACA ACTTTCTGAT AAATGGATTT TGTCGAGGTT GGCAAGAATT AACTCTGAAA CTTCTAAGAA ATTTACAAGT TATGCTCTTG GAGAGGCAGC AAAAGGACTG TATGAATTTA GTTGGAATGA TTTTTGTGAT TGGTATTTGG AGTTAATTAA GCGTCGACTT AATCCTGGAG ATTCATTGAC TCAATCTCAA TTGAAAGATA GAAGAATTGC TCAACAAGTA ATGTTTAAGG TCCTTAGAGA ACTTTTAGTA ATGTTGCACC CCTTAATGCC TCATTTAACT GAAGAACTTT GGCATGGCAT TACAGGTTTC CCTGACCAAA AGATGCTTGC GTTAAACCTT TGGCCCGCAT GTCAAGAAGG CTTTTTAGAT GAAGAGTTAG AGAGTTCTTT CTCTGCACTT TTTGCTTCTA TTAAATTGGT TCGTAATTTG CGTGCTGAAG CTGGGTTGAA GCCTGGTCAA AATGTACCAG TCCGTTTTGT TACGACTAAC AGCAAACTTG CAGATATTCT TCACAAGGCC AAAGCGGATA TCCAGTCACT AACACGAGCG AATAAAGTAG AAGTTTTTCA TCCGGACGAA CTTCTTGGTA AACCTTCTAT CAAAGCTTTA GCAGGTGTTT CTGGCGATCT AGAGGTCCTT TTGCCTATTG AGGGCTTAGT TGATTTGGAT GGATTGCGGA AACGTTTAGA AAAAGATTTA ATCAAAGCAC AAAAAGAAAT AACTTCTCTG TCTAAAAGAT TAGATAATCC AAGCTTTATT AACAAAGCTC CTGAAGCCAT TATTTTAGAT TGTAAGAGTA AACTAATTGC CGCAAAATCA CAAGCTGATT TAGTTATTAA GCGCATTGCA GGTTTGAGTT GA
|
Protein sequence | MADHLVNQGL RQAVDSLPKT YDPGGTESRW QKLWESSGAF HPDPNDSGEP FSLVIPPPNV TGSLHMGHAF NTALIDTIVR FQRLQGKNVL CLPGTDHASI AVQTILEKQF KEEGINRDDL GREEFLQRAW AWKSESGGRI AAQLRRLGYS VDWQRERFTM DERLSKAVVE AFVRLHQQGL IYRGEYLVNW CPASGSAVSD LEVEMKEVDG YLWHFQYPLT NGSSDDETTH LEVATTRPET MLGDVAVAVN PSDNRYRHLV GRTLTLPFVG REIPVIADDH VDKDFGTGCV KVTPAHDPND FAIGQRHNLP QITVMNKDGT MNSSAGQFEG LDRFKAREAV VNGLKELSLL TKIEPYRHSV PFSDRGKVPV EPLLSTQWFV RMEPMADRCR AHLVKGEPQF FPTRWEKVYR DWLTGIRDWC ISRQLWWGHR IPAWFVVSET DSIVTDHTPY IVALSEEEAL LQAREKFGDD VAIEQDADVL DTWFSSGLWP FSTLGWPDED DSDFNCWYPT NTLVTGFDII FFWVARMTMM AGAFTGQMPF KDVYIHGLVR DEQNRKMSKS LGNGIDPLVL IDRYGTDALR FALVKEVAGA GQDIRLDYDR KTDTSVTVEA ARNFANKLWN ATRFALINLG TTTFDETFEE LNFSDLQLSD KWILSRLARI NSETSKKFTS YALGEAAKGL YEFSWNDFCD WYLELIKRRL NPGDSLTQSQ LKDRRIAQQV MFKVLRELLV MLHPLMPHLT EELWHGITGF PDQKMLALNL WPACQEGFLD EELESSFSAL FASIKLVRNL RAEAGLKPGQ NVPVRFVTTN SKLADILHKA KADIQSLTRA NKVEVFHPDE LLGKPSIKAL AGVSGDLEVL LPIEGLVDLD GLRKRLEKDL IKAQKEITSL SKRLDNPSFI NKAPEAIILD CKSKLIAAKS QADLVIKRIA GLS
|
| |