Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0325 |
Symbol | |
ID | 7085626 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 371626 |
End bp | 374361 |
Gene Length | 2736 bp |
Protein Length | 911 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643697362 |
Product | Glycosyltransferase 28 domain protein |
Protein accession | YP_002354010 |
Protein GI | 217968776 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0707] UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCA CCGGGAAGGT CGTCATCTTC TATTCTTCCA TCGGATACGG ACACATCAGT GCCGCGCAAT CCATTCAGGA CGAGATACGG CGGCAATCTC CGGCGACCCG GGTGCTGATG CAGGACATCC GCACGTTCAT GCATCCGGTG TGGCGACGGG TTGACGAGCG CCTGTACTGG TTCGTCGCCA ACCATCTGCC GGAGTGCTTC GAGAGTCTGT TCCGTACGAT GCAGGCGCGC GGCAGCCGTG TGGCCTCGCT GTCGATGCTG CCCAACGATT ATCCAGAAGA AAGCGTATCG GCCTACTTGA CCGCGCAGCG GCCTGATGCC GTTCTCGCCA GCCATTACGG GGCGGCCCAA GTGCTGGGAA CCTTGCGGGA AAAAGGACTG CTGTCCGACA CGAGAATCGG TTGGCTGCAT ACGGATTTCT TCGAGGGCTA TTTCCCGCGC ATCTCCAAGC GGATCGACCG TACCTTTCTT GCCCACCCGG AACTGAAAAC GCGCTGGCTG GCCGCAGGCG TTCCGGCCGA TAAAATCGTT ACCAGCGGCA TGCCGGTGCG GATTCCCGCG GCATCCGCCG ACGCCCGCCG TGCCACGCTC CAGGGCCTTG GGCTGTCGTT GGAGGTGCCG ACACTGCTGC TGACCGGCGG CAAGGAAGGA GCGGGCGACT ATCTGGGGGT CGTCGAAAGC ATCGTTCGCC GCCGCCCGGG CCGCCTGCAG ATCATCGCAG TGTGCGGCAC GAATACGCGG CAATACGAGG CGCTTGCCGA TCTGCGCGAA AGGCTGCCGG ACACGGTGAC GTTGAAGCCG CTGGGGCTGC TGCCGCGCAG CGAGATGGCG TCGTGCATGG CAGCCACCGA CATCCTGGTC ACCAAGGCCG GCGGGATGAC GCCCGCAGAG GCTTTTGCCC TCGGGGTGCC GACCGTGCTG CTCGACGTGA TCAGCGGGCA TGAACGCGAA AACGCCGCGC TGTTCCTGCG CCAGGGCCTG GCCCGGTTCG CGGCCAGCGC CGACGATGCG GGCAGATCCG TGATGGAACT GCTGGGCGAC CCTGCCGAGC GCGAGGCCAT GCTGCGAGCC CAGCAGGAGT TCCGGCAGGG CATCGACATC GCGAGCATCG TGCGGTTTGC GCTGGACGAT GGCTTCCGGC CAGGCCGCCC GCTGCCCGGC TATGGCGTCG AGAACGGCGC GCCCGTCCAG GGCATCGATC AGGCGCTGGC GCAACTGGAT TCCGAAGCAC CGGCAGAGGT CGAGCTGCTG TTGTCCTACG CCACGTCCAA AACGCCGCAA CGGGTCGTTC TCGAAAACCC GTTCGGGCAT CTTGCCGTGC GCATCGGCGA CACCGTCTAC AGCGCGAACT ACATCGCGGA CCCGTCCGTC GACCCGAACT TCCTGCAGCA CGTGAGCCTG GCGGACTACC TGTACGGCAT CCATCGTCCG TCCCGTTCGC AAGTTCACAC CAACACCTAC GGCATGGCCT ATGGCCGTGA GACGCTCGGG CTGCGGGTCC AGGGCATTCC CGCCGAGCGC CGTGCGGCGA TGGTGGCCGA GGCCCACCGC ATCGAAAACG GATTCCGGGA CGCAAGCCTG CGCTGGAGCA GGAGCGATCT CAATTGCGCC GACGTGGTTG CGCGAATTCT CGCCGCCGGT GGCTACGACG ACCGCTCCTT GTACGACCGG GCGGGTCTGC CGAGCATGCC GCTGGATTTG TTCGAGCGGA TGCGGGCGCA CTTCGAGGCC GATACTTCCT TGCGGGAAGA ACTGGTGGCC TACCGATTGC TGCCTGGAAC GCAGGCAAGC TACCGCTTCT CCCGTTTTCC GCTGTCGGTC GGGCAGCCAT TGCGCTCGAT GGCGCGCGTC CTGAGCGATG CACCGCGGGA TGCGCTTGAG CAGGCAGCGA CCCGGCAGGT CACCGGTTAC TTCGGCGACC GGCGGCTCTA CGTGGAGAAC CTGCGGGCAT GCTGGCCAGC TTCGGAATCT GCGCACTCCT CCTTGACCAG CCGTCCTCAT TTGGCGCTGG CGCAGGCCAT CCGCGCCGAT TTGCGGCGTC TGCTCGCGGT ATACGTGAAA CTGCCGCTCA AGCGGATCGA ACGCCTGGGC AGCTTCCCCG CCGCACAGGA GTTCCACCGT CTCGCCGACC GCGGGCTCCA GCTCGCCCGA CTCGCCACCG AGCACGTCGA AGATGACTTG CATCCCCGCA CCGATCGCCT GCGCGCGCTG TTCACCCGGC TCGTGGAGGA CTACGGGCGG ATTAACCCGC AGCGTTTGGA GTCGCGCCAC GTGCGGTCAT ACCTTGCGCG GCTTCGGGCG TTCGAGTCGA CGCTGGGACG TGAACTCGTA CCTACCGGTA CTGCGTGGGC GCTCCTGCCC GCCTGGTGGC ACAGGATAGC GGCGCTCGCT CGCACCAGTG CGCGGTCGCA CCCTCGCAGC GGGGGGACGA GTGGGTTCAG CGGCATGCGG GTTGATCCGG GCGAAGTCGA GCTTAACGGC GATGCGAAAG AGCACGGTGC GCATCCTCTC GAAGCTGATC CGGCCGCGCG TTTGACGCTT GGTGACCTGG AAAAACCCAT TCAGGGCTTC CAGGAGGCTG TCAGCCTGGT AGGTCTGGGT CCAGGCGGCG AAGCCCTCAA AGGAGCTTCG GATCAGCCAG GCGACGTCCC TCATCGGCTT GAGACCCCTA TGACCGTTGC CACTGACCAT GAGGCACCCG ATTTTTCGCA ATTCCATTAC CATTGA
|
Protein sequence | MSTTGKVVIF YSSIGYGHIS AAQSIQDEIR RQSPATRVLM QDIRTFMHPV WRRVDERLYW FVANHLPECF ESLFRTMQAR GSRVASLSML PNDYPEESVS AYLTAQRPDA VLASHYGAAQ VLGTLREKGL LSDTRIGWLH TDFFEGYFPR ISKRIDRTFL AHPELKTRWL AAGVPADKIV TSGMPVRIPA ASADARRATL QGLGLSLEVP TLLLTGGKEG AGDYLGVVES IVRRRPGRLQ IIAVCGTNTR QYEALADLRE RLPDTVTLKP LGLLPRSEMA SCMAATDILV TKAGGMTPAE AFALGVPTVL LDVISGHERE NAALFLRQGL ARFAASADDA GRSVMELLGD PAEREAMLRA QQEFRQGIDI ASIVRFALDD GFRPGRPLPG YGVENGAPVQ GIDQALAQLD SEAPAEVELL LSYATSKTPQ RVVLENPFGH LAVRIGDTVY SANYIADPSV DPNFLQHVSL ADYLYGIHRP SRSQVHTNTY GMAYGRETLG LRVQGIPAER RAAMVAEAHR IENGFRDASL RWSRSDLNCA DVVARILAAG GYDDRSLYDR AGLPSMPLDL FERMRAHFEA DTSLREELVA YRLLPGTQAS YRFSRFPLSV GQPLRSMARV LSDAPRDALE QAATRQVTGY FGDRRLYVEN LRACWPASES AHSSLTSRPH LALAQAIRAD LRRLLAVYVK LPLKRIERLG SFPAAQEFHR LADRGLQLAR LATEHVEDDL HPRTDRLRAL FTRLVEDYGR INPQRLESRH VRSYLARLRA FESTLGRELV PTGTAWALLP AWWHRIAALA RTSARSHPRS GGTSGFSGMR VDPGEVELNG DAKEHGAHPL EADPAARLTL GDLEKPIQGF QEAVSLVGLG PGGEALKGAS DQPGDVPHRL ETPMTVATDH EAPDFSQFHY H
|
| |