Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1801 |
Symbol | |
ID | 7085771 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2023415 |
End bp | 2024641 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643698823 |
Product | glycosyl transferase group 1 |
Protein accession | YP_002355449 |
Protein GI | 217970215 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.115818 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAACGC TCGACTACAC CACCGAAGAC CTCTGCCGCG CGCCGCGCCT GCGCATCGCG CTTGTGACCG AGACCTGGGC GCCCGAGGTC AATGGCGTGG CCATGACCCT GGGGCGGATG GTCGACGGCC TCATCCGCCG CGGCCACGGC GTACAGCTCA TCCGCCCGCG CCAGCGCCCC GGGGAGACCG CGGCGCACGG CGAGGGCCTG GAAGAAGTGC TCGCCCGCGG CCTGCGCCTG CCGCGCTACG ACGGCCTCAA GCTCGGGCTG CCGGCGCGGG TGCGTCTCGT GCGCGAGTGG TCGCGCCAGC GCCCGGACCT GGTGCACGTG GCGACCGAGG GGCCGCTCGG GTGGACCGCG GTCACCGCGG CCAACAAGCT GCGCATCCCG GTCAGCTCCG ACTTCCACAC CAATTTCGAC CACTACAGCG GCCACTATGG CATGGGCTGG CTGCGCCAGC CGGTGGCGGC CTACCTGCGC CGCTTCCACA ACCGCAGCGC GGCGACCTTC GTGCCAACCG CGGCGCTCGC GGCGCAACTC TCGGCGCAGG GTTACCGCAG CGTGGAGGTG ATCTCGCGCG GGGTCGACAC CGCGCTGTAT TCGCCGGCGC GCCGCGACGA GGCCTTGCGC CGTGCCTGGG GCCTACCCCC GGGCGGGCTG GCGGTGATCA GCGTCGGCCG CCTGGCGCCG GAGAAGAACC TCGGCCTCGC GATGCGCGCC TTCGCAGCGA TCCGCCGCCT GCGCCCGGAC GCGCGCATGG TCCTGGTCGG CGACGGCCCC CAGCGTGCGG CGCTGGCGCG CGCCCACCCC GACGCCGTCT TCGTCGGCAT GCGTCACGGC GAGGACCTCG CCGCGCACTA TGCGTCGGCC GATCTGTTCC TGTTCCCCAG TCTCACCGAG ACCTTCGGCA ACGTCACCCT CGAGGCGATG GCGAGCGGCG TGTGCCCGGT GGCCTACGAC TACGCCGCCG CCGCCGAGGT GATCCGCGAC CTCGGCAACG GTGCCAGCGT GGCCTGTGGC GACGAAGAGG GCTTCATCGC GCGTGCCGTA CAGATGGCCG GGGCCGATGC GCTGCGCGCG GAGCTCGCGC GCGCCGCGCG CCGCAGCGCC GAGGCGATCG ATTGGGAGCG GGTGAACGAT CGCTTCGCCG CGGCCCTGCT GCGCGTGTGG CGGGCGGGCA GCGGCCGGCC CCTCGATTTG TCCGAACCCC GCCCGGAGGA GACCTGA
|
Protein sequence | MRTLDYTTED LCRAPRLRIA LVTETWAPEV NGVAMTLGRM VDGLIRRGHG VQLIRPRQRP GETAAHGEGL EEVLARGLRL PRYDGLKLGL PARVRLVREW SRQRPDLVHV ATEGPLGWTA VTAANKLRIP VSSDFHTNFD HYSGHYGMGW LRQPVAAYLR RFHNRSAATF VPTAALAAQL SAQGYRSVEV ISRGVDTALY SPARRDEALR RAWGLPPGGL AVISVGRLAP EKNLGLAMRA FAAIRRLRPD ARMVLVGDGP QRAALARAHP DAVFVGMRHG EDLAAHYASA DLFLFPSLTE TFGNVTLEAM ASGVCPVAYD YAAAAEVIRD LGNGASVACG DEEGFIARAV QMAGADALRA ELARAARRSA EAIDWERVND RFAAALLRVW RAGSGRPLDL SEPRPEET
|
| |