Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3032 |
Symbol | |
ID | 7874502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3282317 |
End bp | 3283492 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643699955 |
Product | tetratricopeptide repeat protein |
Protein accession | YP_002890007 |
Protein GI | 237653693 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2956] Predicted N-acetylglucosaminyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.362688 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAATCG AATACTGGTG GCTGCTCGCG CTGCCGCTGT TCTTTGCGCT CGGCTGGCTG GCCGCGCGCA TCGACATCCG CCAGGTGGTG CAGGAGTCGC GCGCGCTTCC GCGCTCCTAC CTGAGCGGGC TCAATTTCCT GCTCAACGAG CAGCCCGACA AGGCGATCGA CGCCTTCGTC GAGGCGGTGC GCATCGACCC GCAGACCGTC GAGCTGCACT TCGCGCTCGG CAGCCTTTTT CGCCGTCGCG GCGAGACCGA CCGCGCGATC CGCATCCACC AGCTGCTCGT CGACCGCGAG GACATCAGTG ACGAGCATCG CCTGCAGGCG CTCGGCGAGC TTGGCCAGGA TTTTCTCAAG GCCGGCCTGC TGGACCGCGC GGAGGCTGCC TTTCTGCGCC TGCGCGGCAC GCGCGCCAAC GATGTCGCGC TGCGCTATCT GCTCGAGATC TACCAGCAGG AGAAGGACTG GGCCAAGGCG ATCGAGGTCG CCGAGGCGCT GCCTGGCCAT GAAGGCGTGA TGTGGCACAC CGAGGTCGCC AACTTCCATT GCGAGCTCGC TGCCACGGCG CTCGCGAACT CGCGCCACGA CGAGGCGCGG GGCCATCTCG ATCGTGCCTT CGAGGTCAAT CGCCGCTGCG TGCGCGCGAG CCTGCTGCTC GGCGACCTGC ATGCCGCCCA GGGCCGTGAC GAGGAAGCGC TGGAAGCCTG GCAGCGCATC GAAAACCAGG ACCCCAATTA CCTCGCCCTC GTCGCCGAGC GCGTCATGGA CGCCTGCGGG CGCCTCGGTC GCGTGGCTCA GGGGCACCAG CTGTTGCGCG CCTGGCTCGC CGGGCACGCC TCGCTCGACC TGCTCGACGA GCTCTTCCAC TGGGAGCTCG AGCGCGAGGG GCCCAAGGCG GCCTACGAGA TGGTGCGCGA GGAGCTGCGC CGCAACCCCA CGCTGCTCGG GCTCGACAAG CTGCTCGAGG CGGCGGCGCT CAACGCGCCG GCCGAGCAGC GCGCCGACAT CGACCTCATC AAGCAGCTCA TCCACGGTCA CACCCGACGC GTCGCACGCT ATCGCTGCAA CACCTGCGGC TTCAAGGCGC GCCAGTTCCA CTGGCGCTGC CCGGCCTGCG GCGGCTGGGA GACCTATCCA CCGCGGCGCA CCGAAGAGTT CGACCTGACG CCCTGA
|
Protein sequence | MEIEYWWLLA LPLFFALGWL AARIDIRQVV QESRALPRSY LSGLNFLLNE QPDKAIDAFV EAVRIDPQTV ELHFALGSLF RRRGETDRAI RIHQLLVDRE DISDEHRLQA LGELGQDFLK AGLLDRAEAA FLRLRGTRAN DVALRYLLEI YQQEKDWAKA IEVAEALPGH EGVMWHTEVA NFHCELAATA LANSRHDEAR GHLDRAFEVN RRCVRASLLL GDLHAAQGRD EEALEAWQRI ENQDPNYLAL VAERVMDACG RLGRVAQGHQ LLRAWLAGHA SLDLLDELFH WELEREGPKA AYEMVREELR RNPTLLGLDK LLEAAALNAP AEQRADIDLI KQLIHGHTRR VARYRCNTCG FKARQFHWRC PACGGWETYP PRRTEEFDLT P
|
| |