Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_18921 |
Symbol | valS |
ID | 4718630 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 1626052 |
End bp | 1628808 |
Gene Length | 2757 bp |
Protein Length | 918 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640079626 |
Product | valyl-tRNA synthetase |
Protein accession | YP_001010282 |
Protein GI | 123969424 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAGA TTAATGATCA ATTATCTTTA GAGAATTATT CACCTTTTGA AGTAGAGAAA AAGTGGCAAG AAAAATGGGA AATTCTTAAG GCGTTTAGTC CTAACCCTGA GGATGATGGT GAGCCTTTTT GTGTTGTTAT TCCGCCACCA AATGTAACTG GATCTTTGCA TATGGGGCAT GCATTTAATA CGGCTTTGAT AGATGTTGTA GTACGTTTTC AAAGACTTTT AGGTAAGAAT GTTTTGTGTT TACCGGGAAC TGATCATGCT TCAATAGCTG TTCAAACTAT TCTCGAAAAA CAATTAAAAA GTGAAGGCAA AACAAGCGAG GATATTGGAA GAGATGAATT TCTTAAAAGA GCATGGAACT GGAAAGAACA AAGTGGTGGA AGAATAGTTT CTCAATTAAA AAGGATAGGA TATTCAGTAG ACTGGACTAG AGAAAGATTT ACTCTTGATC AAAAATTAAA TGAAGCAGTT ATTGAGGCTT TTAATATTCT CTATAAAAAG AATTTAATTT ATAGAGGCGA ATATTTGGTT AATTGGTGCC CTGAATCTCA ATCTGCCGTA AGTGACCTTG AAGTTGAAAT GCAAGAAGTA AATGGTCATT TATGGCATTT TAAATACCCT TTAATTTCTG AAAGTGGTGA ACACTTAGAT AAGTACTTAG AAGTTGCAAC AACAAGACCA GAAACTCTTT TGGGCGATAC TGCTGTGGCA GTTAATCCTG ATGATGATAG ATATAAAGAA TTTATTGGAG TCAAAGTAAA AGTTCCTTTC GTCGATAGAG AAATACCTAT TATCGCTGAT TCACATGTTG ATAAGGATTT TGGTACGGGT TGTGTGAAGG TTACTCCAGC CCATGATCCA AATGATTTTG CAATAGGAAA AAGGCATAAT TTAAAACAGA TTAATGTAAT GAACAAGGAT GGAACTTTAA ATATTAATGC AGGTATTTTT CAAAATTTAG ATAGATATGA GGCTAGAAAG AAAATTATCA AAGAATTGGA TAACTTAGGC CTTTTGACAA AGATAGAGGA TTATAAACAT ACTGTTCCTT TTTCTGATAG AGGTAAGGTG CCAATTGAAC CTTTATTGTC AACACAATGG TTTTTGAAAA TGGATGATAT ATCACAGGGA TGTCTTAATG AAATTGATTC TAAAAAACCA TCGTTTATTC CTCCACGCTG GGAGAAAGTT TATAAGGATT GGTTAGAAAA TATTAATGAT TGGTGTATCA GTCGGCAATT GTGGTGGGGG CACCAAATAC CAGCCTGGTA TGTTTTAGAT GAATCTCAAG ACTCAATAGA ACAAAATACT CCATACATCG TCGCAAGAAA TGAAGAGGAT GCCTTAATCG AAGCTAATAA AAAATTTGGA TTAAATATTA AATTGGTTCG TGATAAAGAT GTTTTGGATA CATGGTTTTC AAGTGGTTTG TGGCCTTTCT CAACCCTTGG TTGGCCAAAC ACAAATGATC CGGATTTTAA AAAATGGTAT CCAAATAGTG TTCTTGTTAC TGGTTTCGAT ATTATTTTCT TCTGGGTAGC AAGAATGACA ATGATGGGGA ATACTTTTAC AAATAATATT CCTTTTAATG ATGTATATAT TCATGGTCTA GTTCGAGATG AAAATAATAA AAAAATGAGT AAAAGTTCAG GTAATGGTAT TGATCCAATA CTATTAATTG ATAAATATGG TTCTGATGCT CTACGATTTG CTTTAATTCG AGAAGTTGCA GGCGCTGGAC AAGATATCCG CCTTGATTTT GATAGGAAAA AAGATACATC TTCAACTGTT GAAGCTTCAA GAAATTTTGC GAATAAATTA TGGAATGCAA CTAAATTTGT GTTAATTAAT AAAACTTCTA ATAATAATTA TTCGCTTAAT GAGAGTGATG AAACTTCTTT AGAGTTATGT GATAAGTGGA TATTATCGAA ATTGAATCAG GTAAATATAA AAGTCGCTGG TTTGTTGAAA GAATATAAAT TGGGAGAATC TGCGAAACTT CTATATGAAT TTACGTGGAA TGATTTTTGT GACTGGTATG TAGAATTTGC TAAACAAAGA TTTAATAATA AAGAGACTAA AAATAGACAA ATATCTGAGA AAGTTTTAAT AAAAGTGCTC AATGATATTT TGGTAATGAT TCATCCTTTT ATGCCGCACA TAACTGAGGA ACTTTGGCAT GTGCTGCAAC TAAAACCAGA CAATTTATTA TTATCTCTTC AAAAATGGCC AATTCACGAA AATAAATTTG TTGATAATAA GCTTGATAAT TCCTTTCAGC AACTCTTTGA AATTATTAGA CTGATTAGAA ATTTGAGAGC TGAGTTAGGT CTCAAGCCAT CAGAAAAAGT TCCCGTATAT TTAATTTCAG AGAATGATGA ATTGATTGAT TTTTTAAAAA CTTTAGTTGA TGATATTCAA ACCTTAACTA AATCTTCTGA AGTATTTATT TTTAAAACTA ATGCTGTTGA TAAAAAAGAG TTTGCTAAAT CTTTTTCCGG GATAATTAGT GATTTAGAGG TTTACTTACC TTTTCAGGAT TTTGTAAATA TAGATTCATT AAAGGAAAGG TTAAATAAGG ATTTAAAAAA GGTGACTATC GAATTAGAAA ATTTAAATAA GAGATTATCT AATAAAAATT TCGTTGATAA GGCTCCAAAA GATATTGTTG ACGAATGCAG ATTTAAATTA AATGAGGGTT CGGTACAGAA GGAAAGAATT ACTAAAAAAC TCGAACTTTT GAATTGA
|
Protein sequence | MTEINDQLSL ENYSPFEVEK KWQEKWEILK AFSPNPEDDG EPFCVVIPPP NVTGSLHMGH AFNTALIDVV VRFQRLLGKN VLCLPGTDHA SIAVQTILEK QLKSEGKTSE DIGRDEFLKR AWNWKEQSGG RIVSQLKRIG YSVDWTRERF TLDQKLNEAV IEAFNILYKK NLIYRGEYLV NWCPESQSAV SDLEVEMQEV NGHLWHFKYP LISESGEHLD KYLEVATTRP ETLLGDTAVA VNPDDDRYKE FIGVKVKVPF VDREIPIIAD SHVDKDFGTG CVKVTPAHDP NDFAIGKRHN LKQINVMNKD GTLNINAGIF QNLDRYEARK KIIKELDNLG LLTKIEDYKH TVPFSDRGKV PIEPLLSTQW FLKMDDISQG CLNEIDSKKP SFIPPRWEKV YKDWLENIND WCISRQLWWG HQIPAWYVLD ESQDSIEQNT PYIVARNEED ALIEANKKFG LNIKLVRDKD VLDTWFSSGL WPFSTLGWPN TNDPDFKKWY PNSVLVTGFD IIFFWVARMT MMGNTFTNNI PFNDVYIHGL VRDENNKKMS KSSGNGIDPI LLIDKYGSDA LRFALIREVA GAGQDIRLDF DRKKDTSSTV EASRNFANKL WNATKFVLIN KTSNNNYSLN ESDETSLELC DKWILSKLNQ VNIKVAGLLK EYKLGESAKL LYEFTWNDFC DWYVEFAKQR FNNKETKNRQ ISEKVLIKVL NDILVMIHPF MPHITEELWH VLQLKPDNLL LSLQKWPIHE NKFVDNKLDN SFQQLFEIIR LIRNLRAELG LKPSEKVPVY LISENDELID FLKTLVDDIQ TLTKSSEVFI FKTNAVDKKE FAKSFSGIIS DLEVYLPFQD FVNIDSLKER LNKDLKKVTI ELENLNKRLS NKNFVDKAPK DIVDECRFKL NEGSVQKERI TKKLELLN
|
| |