Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3904 |
Symbol | |
ID | 7873552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4302505 |
End bp | 4303629 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643700843 |
Product | glycosyl transferase group 1 |
Protein accession | YP_002890866 |
Protein GI | 237654552 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCATCG CCTTCCTCTG CAAACGCCGC TACATGGGCA AGGACGTGAT CCTCGACCGT TACGCCCGGC TCTATGAGAT TCCTCGCCAG CTTGCCCATC TGGACAACGA AGTGGGCGCT TTCTGCTTGG ACTACCACGC AGCGGACACC GACGGTTGCT GGGAACATGA GGCAGCGCCG GGCAGGCTGA GATGGCATTC GCTTTCGGTC GGAAGAACCC GTCTGCCCAG GCTGGCGGCC TACCCTTGGC ATTTGCTGCG GCAACTGCGC GCCTTCAAGC CCGACATCCT GGTGGGTGCC TCCGATATTC CCCACGTGGT GCTGGCACGG TGGCTGGCCA GGCGCTTGCA AGTTCCCTAC GCAATAGACC TCTATGACAA TTTCGAAGGC TTCGGCCAGG CCCGTATTCC CGGCTTCGTG CCGGCACTGC GCCGCGCCGT GCGCGACGCA ACTGTCGTAA CCACGACCAG CGAACCGCTT CGCCAGAAGG TGCTGGCCGA CGGCGCCCGG GGCACCGTCA TCGCCATGCC CAGCAGCGTG GACCTTGCGG TCTTTCACCC CGGCGACAAG GCGCAGGCCC GCCAGGCCCT GAGCCTGCCG CAGGACGGCA AACTGGTCGG CACGGCCGGT GGCCTGTACC GGGAAAAAGG CATCGAGCCA CTGTACGCCG CCTGGCCAGC GCTCGCAGCC AGCCGCCCCG ACGTGCATCT GGTGCTGGCC GGCCCACTGG AAAACGGCTT CGCGCCTCCA CAGGGCGAGC GCGTGCACTA CCTCGGTCAC CTCGCACACG GCCAGATCGC CAACCTGTTT CGTGCGCTGG ATGTGGGCAT CATCTCCATC CTCGACACCC CCTTCGGCCG CTACTGCTTC CCGCAGAAGG CGTACGAAAT GCTTGCCTGC AAGCTACCGG TCGTGGCCAC CGCCATCGGG CAGATGCGCG AAGTGTGCGC CAGCACGCCG CAGGCGCTTT TTGCCCCAGG CGATTCGACG GCACTCACCC GCGCCGTGCT GTGGCAATTG CAGTCGGGTT CCACGCCAGC CGTGCCCATC GCCGACTGGA AGACGCTGAT TGGCAGCATC GAACCCGTGC TGAAGAGGCA GTCAGGAGCG GCAGCAGGGG GATAA
|
Protein sequence | MRIAFLCKRR YMGKDVILDR YARLYEIPRQ LAHLDNEVGA FCLDYHAADT DGCWEHEAAP GRLRWHSLSV GRTRLPRLAA YPWHLLRQLR AFKPDILVGA SDIPHVVLAR WLARRLQVPY AIDLYDNFEG FGQARIPGFV PALRRAVRDA TVVTTTSEPL RQKVLADGAR GTVIAMPSSV DLAVFHPGDK AQARQALSLP QDGKLVGTAG GLYREKGIEP LYAAWPALAA SRPDVHLVLA GPLENGFAPP QGERVHYLGH LAHGQIANLF RALDVGIISI LDTPFGRYCF PQKAYEMLAC KLPVVATAIG QMREVCASTP QALFAPGDST ALTRAVLWQL QSGSTPAVPI ADWKTLIGSI EPVLKRQSGA AAGG
|
| |