Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3271 |
Symbol | |
ID | 7874492 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3582717 |
End bp | 3585023 |
Gene Length | 2307 bp |
Protein Length | 768 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643700205 |
Product | glycosyl transferase group 1 |
Protein accession | YP_002890243 |
Protein GI | 237653929 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCATCC TGCACATCCT CGATCACTCG ATCCCGCTGC ACAGCGGCTA CACCTTCCGT ACCGCGGCGA TCCTGCGCGA GCAGCGCGCG CTCGGCTGGG AGACCTTCCA CCTGACCTCG CCCAAGCAGG GCGAGACGAA GGCCATGGTG GAGGAGATCG AGGGCTTGCG TTTCCACCGT ACCGGCGTGC CGACGCCGGC GAGCAGCGGC ATCGCCGAGC TGCGCCAGAT CCGTGCGGTG CAGGCGCGCA TCGAGCAGCT CGCGGCCGAA CTGCGCCCCG ACATCCTCCA CGCCCACTCG CCGGTGCTCA ACGCCATTCC GGCCATCCGC GCCGGGCGCA AGCTCGGCAT CCCGGTCGTC TACGAGATCC GCGCCTTCTG GGAAGACGCC GCGGTCGACC ACGGCACCAC CACCGAGGGC AGCCTGCGCT ACCGCGCCAC CAAGGCGCTC GAGACCTGGG CGATCAAGCG CGCCGACCAC GTCTTCACCA TCTGCGAAGG CCTGCGCGCC GACATCGTCG GCCGCGGCAT CCCGGCGGCC AAGGTCACGG TCATCCCCAA CGCGGTCGAC ATCGAGTCCT TCCAGCTCTC GGGCGACGCC GACCCTGCCT TGCGGGAGCA GCTCGGCCTC GCCGGCACCA CCGTGGTGGG CTTCGTCGGC TCCTTCTACG CCTACGAGGG CCTCGATCTG CTGCTCGAGG CCTTCCCCGC GCTGCTGCAG AAGCGCCCCG AGCTGCGCCT GCTGCTCGTC GGCGGCGGGC CGCAGGACGA AAACCTCAAG GCTCAGGCCC TGCGCCTCGG CGTGGCCGAC AAGGTCGTCT TCACCGGGCC GCGTGCCGCA CAAGGACGTC AGCCGCTACT ACGACCAGAT CGACCTGCTC GCCTATCCGC GCCACTCCAT GCGACTCACC GAGCTCGTCA CCCCGCTCAA GCCGCTCGAG GCCATGGCGC AGGGTCGGCT CTTCGTCGCC TCCGACGTCG GCGGCCACAA GGAGCTGATC CGCGACGGCG AGACCGGCAG GCTGTTCAAG GCCGGCAGCG CCGAGGCGCT CGCCGCCGCC ATCGACGACC TGCTCGCGCA CCGCGAGCGC TGGCCGGCGA TGCGCGCGGC GGGGCGGCAG TTCGTCGAGG ACGTGCGCAA CTGGACGAAC AGCGTGGCGA ATTACACGCC GGTGTATCGC AGCCTGGTCG CGAAGCCTCG CGCTGCGGCA TGAATTTCTC CGCGCTGCGC GTCCTGCTCG TCGGGCCGCT GCCGCCGCCT GCCGGCGGCA TGGCCAACCA GACCCGCCAG CTCGCCGAGC TGCTGCGCGG CGAGGGCGCG TCGGTCGAGG TCGTGCAGGT CAATGCGCCG TATCGTCCGG CTTGGGTCGA AAGGCTGAAG GGCGTGCGCG CGCTGTTCCG GCTCCTGCCC TACCTGCTGC GGCTGTGGCA GGCCGCAGGG CGGGCGAACC TCATGCATGT GATGGCCAAC TCCGGCTGGT CCTGGCATCT CTTCGCCGCG CCGGCGGTGT GGATCGGCTG GCTGCGTGGC GTGCCGGTGG TGGTCAATTA CCGCGGAGGC GAGGCGGCCG GCTTCCTTGC CCGCTCGGCG GCGGTCGTGC GTGCCACCCT GCGCCGCGCG TCCAGCCTGG TGCTGCCGTC CGGCTTCCTG CTCGAAGTCT TCGCGCGCCA TGGCATGGCC GGGCGCATCG TCTCCAACAT CGTCGACCTC GAGCGTTTCC ATCCCGCGAC GGCGCCGCGC GCGGCCGGCG AGGGCCCGCA TCTGGTCGTC GCCCGCAACC TCGAAGCCCT CTACGGCAAC GACACCGCGC TGCGCGCCTT CGCGCGGCTG TGCGCGGCCT GGCCGGCGGC GCGCCTCTCG ATCGCCGGCA GCGGGCCGGA GGCCGCGGCG CTGGCCGCGC TCGCGCAGGA ACTCGGCATC GCCGAGCGGG TCCGCTTCAC CGGTCGCCTG GACGGCGAGC AGATGGCTGC GCTCTACCGT GACGCAGATC TCATGCTCAA CCCGAGTCGG GTGGATAACA TGCCCAATGC CATCCTCGAG GCGCTGGCGA GCGGCCTGCC GGTGGTCACC ACCGACGTCG GCGGCATTCC CTTCATCGTC ACGCAAGGCC ACACGGCCAT GCTGGTTCCA CCCGACGATC CGCAGGCGAT GGCCGACGCC GCGCACAAGG TGCTTGCCGA CACCGGCCTG CGCGCCGCGC TGGTCGCCGC CGGGTGCGCC GAGGTGCAGC GCTACCGCTG GAGCTCGGTG CGCGCACAAC TGCTGTCCGC CTACGCGGAT GCGATTGCCG CGAAGAGGGC CACATGA
|
Protein sequence | MRILHILDHS IPLHSGYTFR TAAILREQRA LGWETFHLTS PKQGETKAMV EEIEGLRFHR TGVPTPASSG IAELRQIRAV QARIEQLAAE LRPDILHAHS PVLNAIPAIR AGRKLGIPVV YEIRAFWEDA AVDHGTTTEG SLRYRATKAL ETWAIKRADH VFTICEGLRA DIVGRGIPAA KVTVIPNAVD IESFQLSGDA DPALREQLGL AGTTVVGFVG SFYAYEGLDL LLEAFPALLQ KRPELRLLLV GGGPQDENLK AQALRLGVAD KVVFTGPRAA QGRQPLLRPD RPARLSAPLH ATHRARHPAQ AARGHGAGSA LRRLRRRRPQ GADPRRRDRQ AVQGRQRRGA RRRHRRPARA PRALAGDARG GAAVRRGRAQ LDEQRGELHA GVSQPGREAS RCGMNFSALR VLLVGPLPPP AGGMANQTRQ LAELLRGEGA SVEVVQVNAP YRPAWVERLK GVRALFRLLP YLLRLWQAAG RANLMHVMAN SGWSWHLFAA PAVWIGWLRG VPVVVNYRGG EAAGFLARSA AVVRATLRRA SSLVLPSGFL LEVFARHGMA GRIVSNIVDL ERFHPATAPR AAGEGPHLVV ARNLEALYGN DTALRAFARL CAAWPAARLS IAGSGPEAAA LAALAQELGI AERVRFTGRL DGEQMAALYR DADLMLNPSR VDNMPNAILE ALASGLPVVT TDVGGIPFIV TQGHTAMLVP PDDPQAMADA AHKVLADTGL RAALVAAGCA EVQRYRWSSV RAQLLSAYAD AIAAKRAT
|
| |