Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1706 |
Symbol | valS |
ID | 4078282 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 1803784 |
End bp | 1806879 |
Gene Length | 3096 bp |
Protein Length | 1031 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638007020 |
Product | valyl-tRNA synthetase |
Protein accession | YP_613701 |
Protein GI | 99081547 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.987588 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTGCA CATATATACT CCAAGACGTG ACGCACGGGG CATCTGATAT GGCACTGGAC AAGACCTTTA ACGCAGCCGA GGCGGAGAGC CGCCTGTTTG ACGCCTGGGA AAAAGCGGGC TGCTTTACCG CTGGCGCAAA TGCCAAACCG GGCGCGTCGA CCTACTGCAT CATGATCCCG CCGCCCAATG TCACCGGCGT TCTGCACATG GGGCACGCGT TCAACAACAC GCTGCAGGAC ATCCTCATCC GCTGGAAACG CATGCAGGGC TATGACACGC TCTGGCAGCC CGGCACAGAC CACGCTGGCA TCGCCACGCA GATGGTGGTG GAGCGCAAGC TCGCCGAAAC TCAGCAGCCT TCACGCCGTG AGCTGGGCCG TGAAAAATTC CTTGAAAAAG TCTGGGAATG GAAAGAGCAA TCCGGCGGCA CCATCATCAA CCAGCTCAAG CGCCTCGGCG CGTCGTGTGA CTATGAGCGC ACCGCCTTCA CCATGGCGGG CGCGCAGGGG GACACCCGCA CGGGCCATGA AAACTCGCCC AACTTCCACG ACGCCGTCAT CAAGGTGTTT GTGGAGATGT ACAACAAGGG CCTCATCTAT CGCGGCAAGC GACTGGTAAA CTGGGACCCG CATTTTGAGA CCGCGATTTC CGACCTTGAG GTCGAAAACA TCGAAGTTGC GGGCCACATG TGGCACTTCA AATATCCGCT CGCCGGAGGG GAGACCTATA CCTACATCGA GAAGGACGAG GACGGGAACG TCACGCTCGA AGAAGAGCGC GACTATATCT CCATCGCGAC CACCCGCCCC GAGACCATGC TCGGCGACGG CGCGGTTGCG GTGCACCCTT CGGACGAACG CTATGCGCCG ATCGTGGGCA AGCTGGTCGA AATCCCGGTC GGCCCCAAGG AGCACCGCCG TCAGATCCCG ATCATCACCG ATGAATACCC GGACAAGGAT TTTGGTTCCG GTGCCGTAAA GATCACCGGC GCGCATGACT TCAACGACTA TCAGGTCGCC AAGCGCGGCG GCATCCCGAT GTACCGTCTG ATGGACATGA AAGGCGCAAT GCGCGCTGAT GGTGCGCCTT ATGCCGAAGA AGCGGCAAAG GCCCAAGCCC ACGCGAAGGG TGCGGCGTTC ACTGAAAACG AGGTAGATGC AATCAACCTC GTGCCCGATC ATCTGCGCGG GCTGGATCGC TTTGAGGCGC GCAAGCTGGT GGTGCAGGAA ATCACCGACG AGGGTCTGGC GGTGATGCAG ACCGTGACTA AAACCGTCAA AGACGATGAG GGCAATGAGA CCGAGGTGTC CGAGGTGGTC CCGATGGTCG AAAACAAGCC GATCATGCAG CCCTTTGGCG ACCGCTCCAA AGTCGTGATC GAGCCGATGC TGACTGACCA GTGGTTTGTC GACGCCGAAA AGGTTGTTGG CCCGGCGCTC GATGCGGTGC GCAATGGCGA TGTGAAAATC CTGCCGGAAA GCGGCGAGAA GACCTATTAC CACTGGCTGG AGAACATCGA ACCATGGTGT ATCTCACGTC AGCTGTGGTG GGGTCATCAG ATCCCGGTTT GGTACGGGCC TGAAAAAGAT CTGCCTGATC CATTTGACAA AAACAAAGTC GCCACAAAGG CATTTTGTGG TGCATCCAAA GAAGAGATTC TTCAACAAGC CAAAGACTAC TATGGTGAAG ATGTACAGGT CTCCATCGGC GAAGATGGCT TTGAGTTCAT TGCAACCCCC GATGCGGTGA TCCTTGAATC GGTTCAACTT CGTCGCGACC CCGACGTGCT CGACACCTGG TTCTCCTCCG GCCTCTGGCC GATCGGCACG CTGGGCTGGC CCGAATGGAC CGAGGAGACG TCCAAGTACT TCCCGACCTC GACGCTGGTC ACCGGTCAGG ACATCCTGTT CTTCTGGGTG GCCCGGATGA TGATGATGCA GCTTGCCGTC GTCGATCAGA TCCCGTTTGA CACCGTCTAT CTCCATGGCC TCGTGCGCGA TGCCAAGGGC AAAAAGATGT CGAAATCCAC CGGCAACGTC ATCGACCCGC TTGAGATCAT TGACGAATAC GGCGCCGACG CGCTGCGGTT CACCAATGCG GCGATGGCGA GCCTTGGCGG CGTGCTGAAG CTCGATATGC AGCGCATCGC GGGCTACCGC AACTTTGGCA CCAAACTCTG GAACGCCGTG CGTTTTGCCG AGATGAACGA GGTCTTTACC GACGCCGTAC CGCAGTTGAC CGTGGACGAC CTTGCCCCCA AGGCGGCCGT CAACGCCTGG ATCGTGGGCG AGACCGCCCG CGTGCGCGAA GCGGTGGACG AGGCGATGGA AACATTCCGG TTCAACGACG CGGCGCAAGC GCTTTATGGC TTTGTCTGGG GCAAGGTCTG CGACTGGTAT GTCGAGCTCT CCAAGCCGCT GCTGCAAGGC GATGACACCG AGGCGCAGGC CGAAACCCGC GCCACCATGC GCTGGGTCAT GGATCAGTGC ATGATCCTCT TGCACCCCAT TATGCCCTTC ATCACCGAAG AGCTCTGGGG CGCCACCGGA CCGCGCGCCA AGATGCTGGT GCATGCAGAT TGGCCGACCT ACAAGGCCGC CGATCTGGTA AAGCCCGAAG CCGATGCAGA GATGAACTGG GTCATTTCCG TCATTGAGAA CACCCGCTCC GCCCGCGCGC AGATGCATGT GCCCGCTGGG CTCTATGTGG AGATGCTCGT GACGGAGATT GATGCAGCCG GTCAGGCCGC ATGGGAGGCC AACGAGACGC TGATCAAACG TCTGGCGCGG ATTGAGAGCC TGTCCAAAAT CGAGACCGCC CCCAAAGGCT GCGTGTCGAT CTCTGCGCCG GGGGCGTCGT TCCTGCTGCC GCTCGCGGAC ATCATCGACA TCGACGGCGA AAAGGCCCGG CTTGAAAAAT CGCTTGGAAA ACTCGCCAAG GAGCTCGGTG GTCTGCGCGG ACGTCTGAAC AATCCGAAGT TCATCGCTTC CGCTCCCGAA GAGGTGGTCG AGGAAGCCCG CGAAAACCTC GCCGCGCGCG AAGAGGAAGA GACCAAGCTC AAAGAGGCGC TCTCGCGGCT TGCCGAAATC GGCTGA
|
Protein sequence | MACTYILQDV THGASDMALD KTFNAAEAES RLFDAWEKAG CFTAGANAKP GASTYCIMIP PPNVTGVLHM GHAFNNTLQD ILIRWKRMQG YDTLWQPGTD HAGIATQMVV ERKLAETQQP SRRELGREKF LEKVWEWKEQ SGGTIINQLK RLGASCDYER TAFTMAGAQG DTRTGHENSP NFHDAVIKVF VEMYNKGLIY RGKRLVNWDP HFETAISDLE VENIEVAGHM WHFKYPLAGG ETYTYIEKDE DGNVTLEEER DYISIATTRP ETMLGDGAVA VHPSDERYAP IVGKLVEIPV GPKEHRRQIP IITDEYPDKD FGSGAVKITG AHDFNDYQVA KRGGIPMYRL MDMKGAMRAD GAPYAEEAAK AQAHAKGAAF TENEVDAINL VPDHLRGLDR FEARKLVVQE ITDEGLAVMQ TVTKTVKDDE GNETEVSEVV PMVENKPIMQ PFGDRSKVVI EPMLTDQWFV DAEKVVGPAL DAVRNGDVKI LPESGEKTYY HWLENIEPWC ISRQLWWGHQ IPVWYGPEKD LPDPFDKNKV ATKAFCGASK EEILQQAKDY YGEDVQVSIG EDGFEFIATP DAVILESVQL RRDPDVLDTW FSSGLWPIGT LGWPEWTEET SKYFPTSTLV TGQDILFFWV ARMMMMQLAV VDQIPFDTVY LHGLVRDAKG KKMSKSTGNV IDPLEIIDEY GADALRFTNA AMASLGGVLK LDMQRIAGYR NFGTKLWNAV RFAEMNEVFT DAVPQLTVDD LAPKAAVNAW IVGETARVRE AVDEAMETFR FNDAAQALYG FVWGKVCDWY VELSKPLLQG DDTEAQAETR ATMRWVMDQC MILLHPIMPF ITEELWGATG PRAKMLVHAD WPTYKAADLV KPEADAEMNW VISVIENTRS ARAQMHVPAG LYVEMLVTEI DAAGQAAWEA NETLIKRLAR IESLSKIETA PKGCVSISAP GASFLLPLAD IIDIDGEKAR LEKSLGKLAK ELGGLRGRLN NPKFIASAPE EVVEEARENL AAREEEETKL KEALSRLAEI G
|
| |