Gene Tmz1t_2790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2790 
SymbolvalS 
ID7873199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3019259 
End bp3022243 
Gene Length2985 bp 
Protein Length994 aa 
Translation table11 
GC content68% 
IMG OID643699712 
Productvalyl-tRNA synthetase 
Protein accessionYP_002889767 
Protein GI237653453 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.150829 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACTGG CCAAGAGCTT CGAGCCCGCC GACATCGAAC GTCGCTGGTA CCCGGAATGG 
GAATCCCGCG GCTTTTTCGA CGCCGGGCTC GACAAGTCGA ACCCCGCCGC CTTCTGCATC
CTGCTGCCGC CGCCCAACGT CACCGGCACG CTGCACATGG GGCACGGCTT CAACCAGACG
ATCATGGACG CGCTCACGCG CTACCACCGC ATGCGCGGCG ACAACACGCT GTGGCAGCCG
GGCACCGACC ACGCCGGCAT CGCCACCCAG ATCGTCGTCG AGCGCCAGCT CGACGCCCAG
GGCGTCAGCC GCCACGACCT CGGCCGCGAG CGCTTCCTGG AGAAGGTGTG GGAGTGGAAG
GAATACTCCG GCGGCACCAT CACCCGCCAG ATGCGCCGCC TCGGCACCAG CCCGGACTGG
AAGCGCGAGC GCTTCACGAT GGACGAGGGC CTGTCGCGCA CCGTCACCGA GACCTTCGTG
CGCCTCTACA ACGAGGGCCT GATCTACCGC GGCAAGCGCC TGGTGAACTG GGACCCCAAG
CTCGGCACCG CGGTGTCCGA CCTCGAGGTG GTGTCGGAGG AAGAGGACGG CAAGCTCTAC
CACATCCTGT ATCCGTTCTC CGACGGCCCC ATCGGTGACC TGCAAGGCCT GACGGTCGCC
ACCACCCGCC CCGAGACCCT GCTCGGCGAC GTCGCGGTGA TGGTGCATCC CGAGGACGAG
CGCTACGCCC ACCTCATCGG CAAGACCGTC GAGCTGCCGC TCACCGGCCG TCATATCCCG
ATCATCGCCG ACGACTACGT CGATCGCGAG TTCGGCACCG GCTGCGTGAA GGTCACGCCG
GCGCACGACT TCAACGACTA CGCGGTGGGC CAGCGCCACA AGCTCGACAC CATCGTCGTG
CTCACCCTCG AAGGCGCCGT GCCCGCGGTG GCCGAGCGCT ACACCGCCGA CGGCGTCACC
CTGGAGGGTG TGCCGATGCC GGCGGGCGTC GTCGGGCTCG ACCGCGTACC GGCGCGCGAG
AAGGTGGTCG AGGCGCTCGA GGCGCTCGGC CTGCTGCTCG AGGTCAAGGC GCACAAGATG
CAGGTGCCGC GCGGCGACCG CACCGGGGTC GTCATCGAGC CGATGCTGAC CGATCAATGG
TTCGTCGCCA TGAGCAAGCC GGGCGCGGAC GGCAGGTCGA TCACCGACAA GGCGCTCGAG
GTCGTGGCCT CGGGCGAGAT CAAGTTCTAC CCCGAGAACT GGGTCAACAC CTACAACCAG
TGGCTCAACA ACATCCAGGA CTGGTGCATC TCGCGCCAGC TGTGGTGGGG CCACCGCATC
CCGGCCTGGT ACGACGAGGA AGGCCGCATC TACGTCGCCA CCTGCGAGGA AGAGGCGATC
CGCGCCTGGA AGGCCGACCT GCAGCTCGGC ATCGACGCCC TCGACGCCGA GGTGCAGACG
CGCCAGCGCG AAGGCCAGAC CGCCGAGCAA TACCCCGAGA TCGCCGAGCG CCTCGCCCTG
CTCCACGCCC GCCACGACGC CGGCCGCCTG CGCCAGGAAG ACGACGTGCT CGACACCTGG
TACTCGTCCG CGCTGTGGCC GTTCTCCACG CTCGACTGGA CCGCCGAGTG GCCGGAGAAG
AGCAACGACG CGCTCGACCT CTACCTGCCC TCCACCGTGC TCGTCACCGG CTTCGACATC
ATCTTCTTCT GGGTCGCCCG CATGGTGATG ATGACGAAGC ACATCACCGG CAGGATCCCC
TTCAAGCACG TGTATGTGCA CGGCCTGATC CGCGATGCGG AAGGCCAGAA GATGAGCAAG
TCCAAGGGCA ACGTGCTCGA CCCGATCGAC CTCATCGACG GCATCGCGCT CGACGAACTC
ATCAAGAAGC GCACCTTCGG CCTGATGAAC CCGAAGCAGG CGCAGAGCAT CGAGAAGAAG
ACGCGCAAGG AATTCCCCGA GGGCATCCCC GCCTTCGGCA CCGACGCGCT GCGCTTCACC
TTCGCCTCGC TCGCCAGCCC TGGCCGCGAC ATCAAGTTCG ACCTCGCGCG CTGCGAGGGC
TACCGCAACT TCTGCAACAA GCTGTGGAAC GCCACCCGCT TCGTGCTGAT GAACTGCGAG
GGCCAGGACT GCGGCATCGG CGAGACCGTC GCCTGCTCCA CCGAGGTGCT CGACTTCTCC
TTCGCCGACC GCTGGATCGT GTCGCGCCTG CAACGCACCG AGGCCGAGGT CGCAGAACAC
TTCGCGGCCT ACCGCTTCGA CCTGGTCGCG CGTGCGGTGT ATGAGTTCGT CTGGGACGAG
TACTGCGACT GGTACCTGGA GCTCGCCAAG GTGCAGATCC AGTCGGGCAC CCCGGCGCAG
CAGCGCGCCA CCCGCCGCAC GCTGCTGCGC GTGCTCGAGA CCGTGCTGCG CCTGGCACAC
CCGCTGATCC CCTTCATCAC CGAAGAACTC TGGCAGACGG TCGCCCCGCT CGCCGGGCGC
AAGGAAGGCG ACAGCATCAT GCGCGCGCGC TACCCGCAGG CCGAGCCCAA GCGCATCGAC
GAGGCCTCCG AGGCCAAGGT CGCCGAGCTC AAGGCGATGA TCTACGCCTG CCGCAACCTG
CGCGGCGAGA TGAACATCTC CCCGGCGCAG CGCCTGCCGC TGGTGGCCGC GGGCGACAAG
GCGGCACTGG CGCTGTACGC GCCTTACCTC GCCGGCCTCG CCAAGCTCGC CGAGGTGCAG
GTCGTGGACG AGATCGGCGC CGACGAGCTC GCCCCGGTCG CGGTCGCAGG CGAGACCCGC
CTGATGCTGA AGGTGGAGAT CGACGTCGCC GCCGAGCGCG AGCGTCTGGG CAAGGAGATC
GCCCGCCTGG AGGGCGAGAT CGCCAAGGCC GAAGGCAAGC TCGGCAACGC CAGCTTCGTA
GACCGCGCGC CGGCCGCGGT GGTGCAGCAG GAACGCGACC GCCTGGCGGG CTTCAAGGCC
ACGGTGGGCA AGCTCAAGCC GCAGCTCGCC AAGCTGGGGG GCTGA
 
Protein sequence
MELAKSFEPA DIERRWYPEW ESRGFFDAGL DKSNPAAFCI LLPPPNVTGT LHMGHGFNQT 
IMDALTRYHR MRGDNTLWQP GTDHAGIATQ IVVERQLDAQ GVSRHDLGRE RFLEKVWEWK
EYSGGTITRQ MRRLGTSPDW KRERFTMDEG LSRTVTETFV RLYNEGLIYR GKRLVNWDPK
LGTAVSDLEV VSEEEDGKLY HILYPFSDGP IGDLQGLTVA TTRPETLLGD VAVMVHPEDE
RYAHLIGKTV ELPLTGRHIP IIADDYVDRE FGTGCVKVTP AHDFNDYAVG QRHKLDTIVV
LTLEGAVPAV AERYTADGVT LEGVPMPAGV VGLDRVPARE KVVEALEALG LLLEVKAHKM
QVPRGDRTGV VIEPMLTDQW FVAMSKPGAD GRSITDKALE VVASGEIKFY PENWVNTYNQ
WLNNIQDWCI SRQLWWGHRI PAWYDEEGRI YVATCEEEAI RAWKADLQLG IDALDAEVQT
RQREGQTAEQ YPEIAERLAL LHARHDAGRL RQEDDVLDTW YSSALWPFST LDWTAEWPEK
SNDALDLYLP STVLVTGFDI IFFWVARMVM MTKHITGRIP FKHVYVHGLI RDAEGQKMSK
SKGNVLDPID LIDGIALDEL IKKRTFGLMN PKQAQSIEKK TRKEFPEGIP AFGTDALRFT
FASLASPGRD IKFDLARCEG YRNFCNKLWN ATRFVLMNCE GQDCGIGETV ACSTEVLDFS
FADRWIVSRL QRTEAEVAEH FAAYRFDLVA RAVYEFVWDE YCDWYLELAK VQIQSGTPAQ
QRATRRTLLR VLETVLRLAH PLIPFITEEL WQTVAPLAGR KEGDSIMRAR YPQAEPKRID
EASEAKVAEL KAMIYACRNL RGEMNISPAQ RLPLVAAGDK AALALYAPYL AGLAKLAEVQ
VVDEIGADEL APVAVAGETR LMLKVEIDVA AERERLGKEI ARLEGEIAKA EGKLGNASFV
DRAPAAVVQQ ERDRLAGFKA TVGKLKPQLA KLGG