Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1125 |
Symbol | |
ID | 7084654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1234066 |
End bp | 1235283 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643698140 |
Product | glycosyl transferase group 1 |
Protein accession | YP_002354780 |
Protein GI | 217969546 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.243965 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGTTC TTTACTTCCA TCAGCATTTC TCCACCCCCA AGGGCGCCGC GGGTATCCGG TCCTATGAAA TGGCGCGCCG CCTGGTCGCG CGCGGGCATC AGGTGACCAT GGTGTGCGGC AGCTACGGCC TCGGCGAGAC CGGGCTGTCG GCGCCGTTCG CCAAGGGCGT GCGGCGCGGC AACGTCGATG GAATCGACAT CGTCGAGTTC GACCTGGCGT ATTCCAACGC CGATGGCTTC GTGAAGCGGG CGATGACCTT CGTGAAGTTC GCCCTGCGCA GCGTGAAGTT GGCGCTGACC GAGCGCTACG ACGTGGTCTT TGCCACCACC ACGCCGCTGA CCGCCGGCAT CCCCGGCATC TTTGCGCGCT GGCTGCGCGG CAAGCCCTTC GTGTTCGAGG TGCGCGACCT GTGGCCGGAG CTGCCCAAGG CGATGGGCGT GATCCGCAAC CCGCTGGTGC TGGGCGCAAT GTCCTTCTTG GAGTGGGCGA GCTACCGTTC GGCGCACCGC CTGGTCGGGT TGTCGCCCGG CATCGTGGAG GGCATCGCCC GCCGCGGTGT GCCGCGTGAG CGCATCACGC TGGTGCCCAA CGGCTGCGAC CTGGAGATCT TCGGCGGCGA GGTTGTGCCG TGGCGGCCGG AGGCGGTCAA GCCGACCGAC CTGCTGGCGG CCTTCACCGG CACGCACGGC ATGGCCAACG GGCTGGATGC GGTGCTGGAT GCGGCGGCGG TGCTCAAGCG CCGCGGGCGC GACGACATCA AGATCCTGCT CATAGGGCAG GGCAAGCTCA AGCCCGCGCT GCAGGCGCGT GCCGAGCGCG AAGGGCTGTG GAACGTGGTG TTCCATGACC CGGTGAACAA GGCCAGGCTG GCCGGGCTGA TGGCAGGCAC CGACGTGGGC ATGCAGATCC TGGCGAACGT GCCGGCCTTC TACTACGGCA CCTCGCCCAA CAAGTTCTTC GACTACATCG CTGCCGGGCT GCCGGTGCTG AACAACTATC CGGGCTGGCT GGCCGGGATG ATCGAGGAGC ACCGCTGCGG CTTCGCCGTG CCGCCGGACA ACCCCAACGC CTTTGCCGAT GCGCTGGAGA AGGCGGCCGA CGACCGCGGG GCGCTGAAGG AAATGGGTCA GCGCGGCAAG GAACTGGCGA TCCGCGAGTT CGACCGCCAG AAGCTGGCCG ACCGCTGGGT GGATTGGCTG GAGGGCGCGC GGCGATGA
|
Protein sequence | MRVLYFHQHF STPKGAAGIR SYEMARRLVA RGHQVTMVCG SYGLGETGLS APFAKGVRRG NVDGIDIVEF DLAYSNADGF VKRAMTFVKF ALRSVKLALT ERYDVVFATT TPLTAGIPGI FARWLRGKPF VFEVRDLWPE LPKAMGVIRN PLVLGAMSFL EWASYRSAHR LVGLSPGIVE GIARRGVPRE RITLVPNGCD LEIFGGEVVP WRPEAVKPTD LLAAFTGTHG MANGLDAVLD AAAVLKRRGR DDIKILLIGQ GKLKPALQAR AEREGLWNVV FHDPVNKARL AGLMAGTDVG MQILANVPAF YYGTSPNKFF DYIAAGLPVL NNYPGWLAGM IEEHRCGFAV PPDNPNAFAD ALEKAADDRG ALKEMGQRGK ELAIREFDRQ KLADRWVDWL EGARR
|
| |