Gene TM1040_1706 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1706 
SymbolvalS 
ID4078282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1803784 
End bp1806879 
Gene Length3096 bp 
Protein Length1031 aa 
Translation table11 
GC content60% 
IMG OID638007020 
Productvalyl-tRNA synthetase 
Protein accessionYP_613701 
Protein GI99081547 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.987588 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTGCA CATATATACT CCAAGACGTG ACGCACGGGG CATCTGATAT GGCACTGGAC 
AAGACCTTTA ACGCAGCCGA GGCGGAGAGC CGCCTGTTTG ACGCCTGGGA AAAAGCGGGC
TGCTTTACCG CTGGCGCAAA TGCCAAACCG GGCGCGTCGA CCTACTGCAT CATGATCCCG
CCGCCCAATG TCACCGGCGT TCTGCACATG GGGCACGCGT TCAACAACAC GCTGCAGGAC
ATCCTCATCC GCTGGAAACG CATGCAGGGC TATGACACGC TCTGGCAGCC CGGCACAGAC
CACGCTGGCA TCGCCACGCA GATGGTGGTG GAGCGCAAGC TCGCCGAAAC TCAGCAGCCT
TCACGCCGTG AGCTGGGCCG TGAAAAATTC CTTGAAAAAG TCTGGGAATG GAAAGAGCAA
TCCGGCGGCA CCATCATCAA CCAGCTCAAG CGCCTCGGCG CGTCGTGTGA CTATGAGCGC
ACCGCCTTCA CCATGGCGGG CGCGCAGGGG GACACCCGCA CGGGCCATGA AAACTCGCCC
AACTTCCACG ACGCCGTCAT CAAGGTGTTT GTGGAGATGT ACAACAAGGG CCTCATCTAT
CGCGGCAAGC GACTGGTAAA CTGGGACCCG CATTTTGAGA CCGCGATTTC CGACCTTGAG
GTCGAAAACA TCGAAGTTGC GGGCCACATG TGGCACTTCA AATATCCGCT CGCCGGAGGG
GAGACCTATA CCTACATCGA GAAGGACGAG GACGGGAACG TCACGCTCGA AGAAGAGCGC
GACTATATCT CCATCGCGAC CACCCGCCCC GAGACCATGC TCGGCGACGG CGCGGTTGCG
GTGCACCCTT CGGACGAACG CTATGCGCCG ATCGTGGGCA AGCTGGTCGA AATCCCGGTC
GGCCCCAAGG AGCACCGCCG TCAGATCCCG ATCATCACCG ATGAATACCC GGACAAGGAT
TTTGGTTCCG GTGCCGTAAA GATCACCGGC GCGCATGACT TCAACGACTA TCAGGTCGCC
AAGCGCGGCG GCATCCCGAT GTACCGTCTG ATGGACATGA AAGGCGCAAT GCGCGCTGAT
GGTGCGCCTT ATGCCGAAGA AGCGGCAAAG GCCCAAGCCC ACGCGAAGGG TGCGGCGTTC
ACTGAAAACG AGGTAGATGC AATCAACCTC GTGCCCGATC ATCTGCGCGG GCTGGATCGC
TTTGAGGCGC GCAAGCTGGT GGTGCAGGAA ATCACCGACG AGGGTCTGGC GGTGATGCAG
ACCGTGACTA AAACCGTCAA AGACGATGAG GGCAATGAGA CCGAGGTGTC CGAGGTGGTC
CCGATGGTCG AAAACAAGCC GATCATGCAG CCCTTTGGCG ACCGCTCCAA AGTCGTGATC
GAGCCGATGC TGACTGACCA GTGGTTTGTC GACGCCGAAA AGGTTGTTGG CCCGGCGCTC
GATGCGGTGC GCAATGGCGA TGTGAAAATC CTGCCGGAAA GCGGCGAGAA GACCTATTAC
CACTGGCTGG AGAACATCGA ACCATGGTGT ATCTCACGTC AGCTGTGGTG GGGTCATCAG
ATCCCGGTTT GGTACGGGCC TGAAAAAGAT CTGCCTGATC CATTTGACAA AAACAAAGTC
GCCACAAAGG CATTTTGTGG TGCATCCAAA GAAGAGATTC TTCAACAAGC CAAAGACTAC
TATGGTGAAG ATGTACAGGT CTCCATCGGC GAAGATGGCT TTGAGTTCAT TGCAACCCCC
GATGCGGTGA TCCTTGAATC GGTTCAACTT CGTCGCGACC CCGACGTGCT CGACACCTGG
TTCTCCTCCG GCCTCTGGCC GATCGGCACG CTGGGCTGGC CCGAATGGAC CGAGGAGACG
TCCAAGTACT TCCCGACCTC GACGCTGGTC ACCGGTCAGG ACATCCTGTT CTTCTGGGTG
GCCCGGATGA TGATGATGCA GCTTGCCGTC GTCGATCAGA TCCCGTTTGA CACCGTCTAT
CTCCATGGCC TCGTGCGCGA TGCCAAGGGC AAAAAGATGT CGAAATCCAC CGGCAACGTC
ATCGACCCGC TTGAGATCAT TGACGAATAC GGCGCCGACG CGCTGCGGTT CACCAATGCG
GCGATGGCGA GCCTTGGCGG CGTGCTGAAG CTCGATATGC AGCGCATCGC GGGCTACCGC
AACTTTGGCA CCAAACTCTG GAACGCCGTG CGTTTTGCCG AGATGAACGA GGTCTTTACC
GACGCCGTAC CGCAGTTGAC CGTGGACGAC CTTGCCCCCA AGGCGGCCGT CAACGCCTGG
ATCGTGGGCG AGACCGCCCG CGTGCGCGAA GCGGTGGACG AGGCGATGGA AACATTCCGG
TTCAACGACG CGGCGCAAGC GCTTTATGGC TTTGTCTGGG GCAAGGTCTG CGACTGGTAT
GTCGAGCTCT CCAAGCCGCT GCTGCAAGGC GATGACACCG AGGCGCAGGC CGAAACCCGC
GCCACCATGC GCTGGGTCAT GGATCAGTGC ATGATCCTCT TGCACCCCAT TATGCCCTTC
ATCACCGAAG AGCTCTGGGG CGCCACCGGA CCGCGCGCCA AGATGCTGGT GCATGCAGAT
TGGCCGACCT ACAAGGCCGC CGATCTGGTA AAGCCCGAAG CCGATGCAGA GATGAACTGG
GTCATTTCCG TCATTGAGAA CACCCGCTCC GCCCGCGCGC AGATGCATGT GCCCGCTGGG
CTCTATGTGG AGATGCTCGT GACGGAGATT GATGCAGCCG GTCAGGCCGC ATGGGAGGCC
AACGAGACGC TGATCAAACG TCTGGCGCGG ATTGAGAGCC TGTCCAAAAT CGAGACCGCC
CCCAAAGGCT GCGTGTCGAT CTCTGCGCCG GGGGCGTCGT TCCTGCTGCC GCTCGCGGAC
ATCATCGACA TCGACGGCGA AAAGGCCCGG CTTGAAAAAT CGCTTGGAAA ACTCGCCAAG
GAGCTCGGTG GTCTGCGCGG ACGTCTGAAC AATCCGAAGT TCATCGCTTC CGCTCCCGAA
GAGGTGGTCG AGGAAGCCCG CGAAAACCTC GCCGCGCGCG AAGAGGAAGA GACCAAGCTC
AAAGAGGCGC TCTCGCGGCT TGCCGAAATC GGCTGA
 
Protein sequence
MACTYILQDV THGASDMALD KTFNAAEAES RLFDAWEKAG CFTAGANAKP GASTYCIMIP 
PPNVTGVLHM GHAFNNTLQD ILIRWKRMQG YDTLWQPGTD HAGIATQMVV ERKLAETQQP
SRRELGREKF LEKVWEWKEQ SGGTIINQLK RLGASCDYER TAFTMAGAQG DTRTGHENSP
NFHDAVIKVF VEMYNKGLIY RGKRLVNWDP HFETAISDLE VENIEVAGHM WHFKYPLAGG
ETYTYIEKDE DGNVTLEEER DYISIATTRP ETMLGDGAVA VHPSDERYAP IVGKLVEIPV
GPKEHRRQIP IITDEYPDKD FGSGAVKITG AHDFNDYQVA KRGGIPMYRL MDMKGAMRAD
GAPYAEEAAK AQAHAKGAAF TENEVDAINL VPDHLRGLDR FEARKLVVQE ITDEGLAVMQ
TVTKTVKDDE GNETEVSEVV PMVENKPIMQ PFGDRSKVVI EPMLTDQWFV DAEKVVGPAL
DAVRNGDVKI LPESGEKTYY HWLENIEPWC ISRQLWWGHQ IPVWYGPEKD LPDPFDKNKV
ATKAFCGASK EEILQQAKDY YGEDVQVSIG EDGFEFIATP DAVILESVQL RRDPDVLDTW
FSSGLWPIGT LGWPEWTEET SKYFPTSTLV TGQDILFFWV ARMMMMQLAV VDQIPFDTVY
LHGLVRDAKG KKMSKSTGNV IDPLEIIDEY GADALRFTNA AMASLGGVLK LDMQRIAGYR
NFGTKLWNAV RFAEMNEVFT DAVPQLTVDD LAPKAAVNAW IVGETARVRE AVDEAMETFR
FNDAAQALYG FVWGKVCDWY VELSKPLLQG DDTEAQAETR ATMRWVMDQC MILLHPIMPF
ITEELWGATG PRAKMLVHAD WPTYKAADLV KPEADAEMNW VISVIENTRS ARAQMHVPAG
LYVEMLVTEI DAAGQAAWEA NETLIKRLAR IESLSKIETA PKGCVSISAP GASFLLPLAD
IIDIDGEKAR LEKSLGKLAK ELGGLRGRLN NPKFIASAPE EVVEEARENL AAREEEETKL
KEALSRLAEI G