Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2790 |
Symbol | valS |
ID | 7873199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3019259 |
End bp | 3022243 |
Gene Length | 2985 bp |
Protein Length | 994 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643699712 |
Product | valyl-tRNA synthetase |
Protein accession | YP_002889767 |
Protein GI | 237653453 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.150829 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACTGG CCAAGAGCTT CGAGCCCGCC GACATCGAAC GTCGCTGGTA CCCGGAATGG GAATCCCGCG GCTTTTTCGA CGCCGGGCTC GACAAGTCGA ACCCCGCCGC CTTCTGCATC CTGCTGCCGC CGCCCAACGT CACCGGCACG CTGCACATGG GGCACGGCTT CAACCAGACG ATCATGGACG CGCTCACGCG CTACCACCGC ATGCGCGGCG ACAACACGCT GTGGCAGCCG GGCACCGACC ACGCCGGCAT CGCCACCCAG ATCGTCGTCG AGCGCCAGCT CGACGCCCAG GGCGTCAGCC GCCACGACCT CGGCCGCGAG CGCTTCCTGG AGAAGGTGTG GGAGTGGAAG GAATACTCCG GCGGCACCAT CACCCGCCAG ATGCGCCGCC TCGGCACCAG CCCGGACTGG AAGCGCGAGC GCTTCACGAT GGACGAGGGC CTGTCGCGCA CCGTCACCGA GACCTTCGTG CGCCTCTACA ACGAGGGCCT GATCTACCGC GGCAAGCGCC TGGTGAACTG GGACCCCAAG CTCGGCACCG CGGTGTCCGA CCTCGAGGTG GTGTCGGAGG AAGAGGACGG CAAGCTCTAC CACATCCTGT ATCCGTTCTC CGACGGCCCC ATCGGTGACC TGCAAGGCCT GACGGTCGCC ACCACCCGCC CCGAGACCCT GCTCGGCGAC GTCGCGGTGA TGGTGCATCC CGAGGACGAG CGCTACGCCC ACCTCATCGG CAAGACCGTC GAGCTGCCGC TCACCGGCCG TCATATCCCG ATCATCGCCG ACGACTACGT CGATCGCGAG TTCGGCACCG GCTGCGTGAA GGTCACGCCG GCGCACGACT TCAACGACTA CGCGGTGGGC CAGCGCCACA AGCTCGACAC CATCGTCGTG CTCACCCTCG AAGGCGCCGT GCCCGCGGTG GCCGAGCGCT ACACCGCCGA CGGCGTCACC CTGGAGGGTG TGCCGATGCC GGCGGGCGTC GTCGGGCTCG ACCGCGTACC GGCGCGCGAG AAGGTGGTCG AGGCGCTCGA GGCGCTCGGC CTGCTGCTCG AGGTCAAGGC GCACAAGATG CAGGTGCCGC GCGGCGACCG CACCGGGGTC GTCATCGAGC CGATGCTGAC CGATCAATGG TTCGTCGCCA TGAGCAAGCC GGGCGCGGAC GGCAGGTCGA TCACCGACAA GGCGCTCGAG GTCGTGGCCT CGGGCGAGAT CAAGTTCTAC CCCGAGAACT GGGTCAACAC CTACAACCAG TGGCTCAACA ACATCCAGGA CTGGTGCATC TCGCGCCAGC TGTGGTGGGG CCACCGCATC CCGGCCTGGT ACGACGAGGA AGGCCGCATC TACGTCGCCA CCTGCGAGGA AGAGGCGATC CGCGCCTGGA AGGCCGACCT GCAGCTCGGC ATCGACGCCC TCGACGCCGA GGTGCAGACG CGCCAGCGCG AAGGCCAGAC CGCCGAGCAA TACCCCGAGA TCGCCGAGCG CCTCGCCCTG CTCCACGCCC GCCACGACGC CGGCCGCCTG CGCCAGGAAG ACGACGTGCT CGACACCTGG TACTCGTCCG CGCTGTGGCC GTTCTCCACG CTCGACTGGA CCGCCGAGTG GCCGGAGAAG AGCAACGACG CGCTCGACCT CTACCTGCCC TCCACCGTGC TCGTCACCGG CTTCGACATC ATCTTCTTCT GGGTCGCCCG CATGGTGATG ATGACGAAGC ACATCACCGG CAGGATCCCC TTCAAGCACG TGTATGTGCA CGGCCTGATC CGCGATGCGG AAGGCCAGAA GATGAGCAAG TCCAAGGGCA ACGTGCTCGA CCCGATCGAC CTCATCGACG GCATCGCGCT CGACGAACTC ATCAAGAAGC GCACCTTCGG CCTGATGAAC CCGAAGCAGG CGCAGAGCAT CGAGAAGAAG ACGCGCAAGG AATTCCCCGA GGGCATCCCC GCCTTCGGCA CCGACGCGCT GCGCTTCACC TTCGCCTCGC TCGCCAGCCC TGGCCGCGAC ATCAAGTTCG ACCTCGCGCG CTGCGAGGGC TACCGCAACT TCTGCAACAA GCTGTGGAAC GCCACCCGCT TCGTGCTGAT GAACTGCGAG GGCCAGGACT GCGGCATCGG CGAGACCGTC GCCTGCTCCA CCGAGGTGCT CGACTTCTCC TTCGCCGACC GCTGGATCGT GTCGCGCCTG CAACGCACCG AGGCCGAGGT CGCAGAACAC TTCGCGGCCT ACCGCTTCGA CCTGGTCGCG CGTGCGGTGT ATGAGTTCGT CTGGGACGAG TACTGCGACT GGTACCTGGA GCTCGCCAAG GTGCAGATCC AGTCGGGCAC CCCGGCGCAG CAGCGCGCCA CCCGCCGCAC GCTGCTGCGC GTGCTCGAGA CCGTGCTGCG CCTGGCACAC CCGCTGATCC CCTTCATCAC CGAAGAACTC TGGCAGACGG TCGCCCCGCT CGCCGGGCGC AAGGAAGGCG ACAGCATCAT GCGCGCGCGC TACCCGCAGG CCGAGCCCAA GCGCATCGAC GAGGCCTCCG AGGCCAAGGT CGCCGAGCTC AAGGCGATGA TCTACGCCTG CCGCAACCTG CGCGGCGAGA TGAACATCTC CCCGGCGCAG CGCCTGCCGC TGGTGGCCGC GGGCGACAAG GCGGCACTGG CGCTGTACGC GCCTTACCTC GCCGGCCTCG CCAAGCTCGC CGAGGTGCAG GTCGTGGACG AGATCGGCGC CGACGAGCTC GCCCCGGTCG CGGTCGCAGG CGAGACCCGC CTGATGCTGA AGGTGGAGAT CGACGTCGCC GCCGAGCGCG AGCGTCTGGG CAAGGAGATC GCCCGCCTGG AGGGCGAGAT CGCCAAGGCC GAAGGCAAGC TCGGCAACGC CAGCTTCGTA GACCGCGCGC CGGCCGCGGT GGTGCAGCAG GAACGCGACC GCCTGGCGGG CTTCAAGGCC ACGGTGGGCA AGCTCAAGCC GCAGCTCGCC AAGCTGGGGG GCTGA
|
Protein sequence | MELAKSFEPA DIERRWYPEW ESRGFFDAGL DKSNPAAFCI LLPPPNVTGT LHMGHGFNQT IMDALTRYHR MRGDNTLWQP GTDHAGIATQ IVVERQLDAQ GVSRHDLGRE RFLEKVWEWK EYSGGTITRQ MRRLGTSPDW KRERFTMDEG LSRTVTETFV RLYNEGLIYR GKRLVNWDPK LGTAVSDLEV VSEEEDGKLY HILYPFSDGP IGDLQGLTVA TTRPETLLGD VAVMVHPEDE RYAHLIGKTV ELPLTGRHIP IIADDYVDRE FGTGCVKVTP AHDFNDYAVG QRHKLDTIVV LTLEGAVPAV AERYTADGVT LEGVPMPAGV VGLDRVPARE KVVEALEALG LLLEVKAHKM QVPRGDRTGV VIEPMLTDQW FVAMSKPGAD GRSITDKALE VVASGEIKFY PENWVNTYNQ WLNNIQDWCI SRQLWWGHRI PAWYDEEGRI YVATCEEEAI RAWKADLQLG IDALDAEVQT RQREGQTAEQ YPEIAERLAL LHARHDAGRL RQEDDVLDTW YSSALWPFST LDWTAEWPEK SNDALDLYLP STVLVTGFDI IFFWVARMVM MTKHITGRIP FKHVYVHGLI RDAEGQKMSK SKGNVLDPID LIDGIALDEL IKKRTFGLMN PKQAQSIEKK TRKEFPEGIP AFGTDALRFT FASLASPGRD IKFDLARCEG YRNFCNKLWN ATRFVLMNCE GQDCGIGETV ACSTEVLDFS FADRWIVSRL QRTEAEVAEH FAAYRFDLVA RAVYEFVWDE YCDWYLELAK VQIQSGTPAQ QRATRRTLLR VLETVLRLAH PLIPFITEEL WQTVAPLAGR KEGDSIMRAR YPQAEPKRID EASEAKVAEL KAMIYACRNL RGEMNISPAQ RLPLVAAGDK AALALYAPYL AGLAKLAEVQ VVDEIGADEL APVAVAGETR LMLKVEIDVA AERERLGKEI ARLEGEIAKA EGKLGNASFV DRAPAAVVQQ ERDRLAGFKA TVGKLKPQLA KLGG
|
| |