Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_18731 |
Symbol | valS |
ID | 4910912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 1598044 |
End bp | 1600800 |
Gene Length | 2757 bp |
Protein Length | 918 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640161478 |
Product | valyl-tRNA synthetase |
Protein accession | YP_001092097 |
Protein GI | 126697211 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGAGA TGAATGATCA ATTATCTTTA GAGAATTATT CACCTTTTGA AGTAGAGAAA AAGTGGCAAG AAAAATGGGA AAGCCTAGAG GCGTTTAGTC CTAATCCTGA GGATGATGGT GAGTCTTTTT GTGTTGTTAT TCCGCCACCA AATGTAACTG GATCTTTGCA TATGGGGCAT GCATTTAATA CGGCTTTGAT AGATGTTGTA GTACGTTTTC AAAGACTTTT GGGTAAGAAT GTTTTGTGTT TACCGGGAAC TGATCATGCT TCAATAGCTG TTCAAACTAT TCTCGAAAAA CAATTAAAAA GTGAAGGCAA AACAAGCGAG GATATTGGAA GAGATGAATT TCTTAAAAGA GCATGGAACT GGAAAGAACA AAGTGGTGGA AGAATAGTTT CTCAATTAAA AAGAATAGGA TACTCAGTTG ACTGGACTAG AGAAAGATTT ACTCTTGATC AAAAATTAAA TGAAGCAGTC ATTGAGGCTT TTAATATACT TTATAAAAAG AATTTAATTT ATAGAGGCGA ATATTTGGTT AATTGGTGTC CTGAATCTCA ATCTGCCGTA AGTGATCTTG AAGTTGAAAT GCAAGAAGTA AATGGTCATT TATGGCATTT TAAATACCCT TTAATTTCTG AAAGTGGTGA ACAGTTAGAT AAGTACTTAG AAGTTGCAAC AACAAGGCCA GAAACTCTTT TGGGTGATAC CGCTGTGGCA GTTAACCCTG ATGATGATAG ATATAAAGAA TTTATTGGTG CCAAAGTAAA AGTTCCTTTC GTTGATAGGG AAATACCTAT TATTGCTGAT TCACATGTTG ATAAAGATTT TGGTACAGGT TGTGTGAAGG TTACTCCAGC TCATGATCCA AATGATTTTG CAATAGGAAA AAGGCATAAT TTAAAACAAA TTAATGTAAT GAACAAAGAT GGAACTTTAA ATATTAATGC AGGTATTTTT CAAAATTTGG AAAGATATGA GGCTAGAAAG AAAATTATCA AAGAATTGGA TAACTTAGGT CTTTTGACAA AGATAGAGGA TTATAAACAT ACTGTTCCTT TTTCTGATAG AGGTAAGGTG CCAATTGAAC CTTTATTGTC AACACAATGG TTTTTGAAAA TGGATGGTAT ATCACAAGGA TGTCTTAATG AAATTGATTC TAAAAAACCA TCGTTTATTC CTCCGCGCTG GGAGAAAGTT TATAAGGATT GGTTAGAGAA TATTAATGAT TGGTGTATCA GTCGGCAATT GTGGTGGGGG CACCAAATAC CAGCCTGGTA TGTTTTAGAT GAATCTCAAG ACTCAATAGA ACAAAATACT CCATATATCG TTGCGAGAAA TGAAGAGGAT GCCCTAATCG AAGCTAATAA AAAATTTGGA TTAAATATTA AATTGGTTCG TGATAAAGAT GTTTTGGATA CATGGTTTTC AAGTGGTTTA TGGCCTTTCT CAACCCTTGG TTGGCCAAAT ACAAATGATC CGGATTTCAA AAAATGGTAT CCAAATAGTG TTCTTGTTAC TGGTTTCGAT ATTATTTTCT TCTGGGTTGC AAGAATGACA ATGATGGGGA ATACTTTTAC AAATAATATT CCTTTTAAGG ATGTTTATAT TCATGGTCTA GTTCGAGATG AAAACAATAA AAAAATGAGT AAAAGTTCAG GTAATGGTAT TGATCCAATA CTATTAATTG ATAAATATGG TTCTGATGCT CTACGATTTG CTTTAATTCG AGAAGTTGCA GGCGCTGGAC AAGATATTCG GCTTGATTTT GATAGGAAAA AAGATACGTC TTCAACTGTT GAAGCTTCAA GAAATTTTGC GAATAAATTA TGGAATGCAA CTAAATTTGT GTTAATTAAT AAAACTTCTA ATAATAATTA TTCTTTTAAT GAGAGTGATG AAAATTCTTT AGAGTTATGT GATAAGTGGA TTTTATCGAA ATTGAATCAG GTAAATATAA AAGTCACTGC TTTATTAAAA GAATATAAAT TGGGAGAATC TGCGAAACTT CTATATGAAT TTACTTGGAA TGATTTTTGT GACTGGTATG TAGAATTTGC TAAACAAAGG TTTAATAATA AAGAGACCAA AAATAGACAA ATATCTGAAA AAGTTTTAAT AAAAGTGCTC AATGATATTT TGGTAATGAT TCATCCTTTT ATGCCGCACA TTACCGAGGA GCTTTGGCAT GTGCTGCAAC TGAAACCAGA CAATGCATTA TTATCTCTTC AAAAATGGCC AATTCATGAA AATAAATTTG TTGATAATAA ACTTGATAAT TCCTTCCAAC AACTCTTTGA AATTATTAGG CTGATTAGAA ATTTGAGATC TGAATTAGGT CTTAAGCCAT CAGAAAAAGG TCCTGTATAT TTAATTTCTG ACAATGATGA ATTGATTGAT TTTTTAAAAA CTTTAGTTGG TGATATTCAA ACCTTAACTA AATCTTCTGA AGTATTTATT TTTAAAAATA ATGCTGTTGA TAAAAAAGAG TTTGCTAAAT CATTTTCCGG GATAATTAGT GATTTAGAGG TTTACTTACC TTTTCAGGAT TTTGTAAATA TAGATGCATT AAAGGAAAGG TTAACCAAGG ATTTAAAAAA GGTGACTATT GAATTAGAAA ATTTAAATAA GAGATTATCT AATAAAAATT TCGTTGATAA GGCTCCAAAA GATATTGTTG ACGAATGCAG ATTTAAATTA AATGAGGGTT CTGTACAAAA GGAAAGAATT ACAAAAAAAC TCGAACTTTT GAATTGA
|
Protein sequence | MTEMNDQLSL ENYSPFEVEK KWQEKWESLE AFSPNPEDDG ESFCVVIPPP NVTGSLHMGH AFNTALIDVV VRFQRLLGKN VLCLPGTDHA SIAVQTILEK QLKSEGKTSE DIGRDEFLKR AWNWKEQSGG RIVSQLKRIG YSVDWTRERF TLDQKLNEAV IEAFNILYKK NLIYRGEYLV NWCPESQSAV SDLEVEMQEV NGHLWHFKYP LISESGEQLD KYLEVATTRP ETLLGDTAVA VNPDDDRYKE FIGAKVKVPF VDREIPIIAD SHVDKDFGTG CVKVTPAHDP NDFAIGKRHN LKQINVMNKD GTLNINAGIF QNLERYEARK KIIKELDNLG LLTKIEDYKH TVPFSDRGKV PIEPLLSTQW FLKMDGISQG CLNEIDSKKP SFIPPRWEKV YKDWLENIND WCISRQLWWG HQIPAWYVLD ESQDSIEQNT PYIVARNEED ALIEANKKFG LNIKLVRDKD VLDTWFSSGL WPFSTLGWPN TNDPDFKKWY PNSVLVTGFD IIFFWVARMT MMGNTFTNNI PFKDVYIHGL VRDENNKKMS KSSGNGIDPI LLIDKYGSDA LRFALIREVA GAGQDIRLDF DRKKDTSSTV EASRNFANKL WNATKFVLIN KTSNNNYSFN ESDENSLELC DKWILSKLNQ VNIKVTALLK EYKLGESAKL LYEFTWNDFC DWYVEFAKQR FNNKETKNRQ ISEKVLIKVL NDILVMIHPF MPHITEELWH VLQLKPDNAL LSLQKWPIHE NKFVDNKLDN SFQQLFEIIR LIRNLRSELG LKPSEKGPVY LISDNDELID FLKTLVGDIQ TLTKSSEVFI FKNNAVDKKE FAKSFSGIIS DLEVYLPFQD FVNIDALKER LTKDLKKVTI ELENLNKRLS NKNFVDKAPK DIVDECRFKL NEGSVQKERI TKKLELLN
|
| |