Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3769 |
Symbol | |
ID | 7873766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4153555 |
End bp | 4154679 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643700713 |
Product | glycosyl transferase group 1 |
Protein accession | YP_002890737 |
Protein GI | 237654423 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCCC TCAAGGTCCT GCAACTTCTC CCCGCGCTGG ACAGCGGCGG CGTCGAGCGT GGCACGCTCG AGATCGCGCG CGCGCTGGTC GCGGCCGGGC ACGAATCGGT GGTGCTGTCC AGGGGCGGGC GCCTGGTCGG GCAGTTGCAG GACGAAGGCT CGCGCCACCT CGCGCGCGAC CTCGGGCGCA AGTCGCCGAC CACCTTCCTG CACTACCGCG CGCTGCGCAG GCTCTTCGAG GCCGAGCGCT TCGACATCGT GCACGCGCGC TCGCGCCTGC CGGCCTGGGT CGCCTGGCTC GCCTGGCGCG GCATGCCGGC CGACGCCCGC CCACGCTTCG TCACCACGGT GCACGGCATG CACTCGGTCA GCCGCTACAG CGCCATCATG TGCGCGGGCG AGCGCGTGAT CGCGGTCAGC GACACGGTGC GCGACTACAT TCGTACCCAT TACCCGCCGT CGCGCTGGCC GCACCTGGCC GATGAGCACA TCACGGTGAT CCCGCGCGGT ATCGACCCGG CGGAGTTTCC GCGCGACTAC CAGCCTTCAG ACGAATGGCT GGCGCGCTTC CATGCCGAGT TTCCGCAGCT TGGCCAGCGC AAGGTGCTGA CGCTGCCGGG GCGGCTCACG CGGCTGAAGG GACATCACGA CTTCATCACC CTCATCGGCA AGCTGGTCGC GGACGGACTG GACGTGGTCG GGCTGATCGT CGGCGGCGAG GACCCGAAGC GGCCCGGCTA CGCGAAGGAG ATCCGCGAAC GGGTGCAGGC GGAGGGGCTA GGGGAACGCA TCCTCTTCAC CGGTCACCGC AGCGACGTGC GCGAGATCTA CGCGATCTCG GACTGCGTGC TGAGCCTGTC CTCCACGCCC GAATCCTTCG GGCGCACCGT GCTGGAGCCG CTGGCGATGG GGCGGCCGGT GGTGGGGTAT GCGCATGGGG GGGTGGCGGA GATCCTGGGC GAGGTGTTCC CGCATGGGGC GGTGGCGAAG GGGGACGTGG CGGCCGCGAC AAAGCGGGCC GGGGACGTGG TCGCCGGACG GACGCCGGTG GTGGAGTTCA ACACGCGCTT CCTGCTCGAG CGCATGCAGG CGCAGACGCT GGCGGTGTAT GGAGCGCTCG CATGA
|
Protein sequence | MKALKVLQLL PALDSGGVER GTLEIARALV AAGHESVVLS RGGRLVGQLQ DEGSRHLARD LGRKSPTTFL HYRALRRLFE AERFDIVHAR SRLPAWVAWL AWRGMPADAR PRFVTTVHGM HSVSRYSAIM CAGERVIAVS DTVRDYIRTH YPPSRWPHLA DEHITVIPRG IDPAEFPRDY QPSDEWLARF HAEFPQLGQR KVLTLPGRLT RLKGHHDFIT LIGKLVADGL DVVGLIVGGE DPKRPGYAKE IRERVQAEGL GERILFTGHR SDVREIYAIS DCVLSLSSTP ESFGRTVLEP LAMGRPVVGY AHGGVAEILG EVFPHGAVAK GDVAAATKRA GDVVAGRTPV VEFNTRFLLE RMQAQTLAVY GALA
|
| |