Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2307 |
Symbol | |
ID | 7085294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2599321 |
End bp | 2600466 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643699328 |
Product | beta-hexosaminidase |
Protein accession | YP_002355942 |
Protein GI | 217970708 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0826435 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCCAC CCATCCGCCG CCCACGCGGC CCCGTCATGA TCGATGTCGC CGGCACCGCG CTCACCGACG AGGAGCGCGA ACGCCTGCGC GACCCTCTGG TCGGCGGGGT GATCCTGTTC GCGCGCAACT ACACCGGCTC CGAGCAGCTG CGCGCGCTCA CCGCCGAGAT CCGCGGGCTG CGCGACCCGG CGCTGATCAT CGCGGTCGAC CACGAAGGCG GCAGGGTGCA GCGCTTCCGC ACCGACGGCT TCACCCGCCT GCCGTCGATG CGCAGCCTCG GCGCCTTGTG GGCGCAGGAC CATCTGGTGG CGCTCGACGC GGCGCGCGCC ACCGGCGTCG TGCTCGCCGC CGAGCTGCGC GCGCACGGGG TCGACCTGAG CTTCACCCCG GTGCTGGATC TCGACTACGG CTGCTGCCGC GCGATCGGCA ACCGCGCCTT CCATCGCGAT CCGCAGGTGG TCGCGGCGCT CGCGCAGGCG CTGTGCGCCG GCATGGCGGA GGCGGGCATG GGCTGCGTGG GCAAGCACTT CCCCGGCCAC GGCTTCGTCG AGGCCGACTC GCACCACGAC GTGCCGGTGG ACGAGCGCGA CTTCGACACG GTTTGGAACG AGGACATCGC CCCCTACCGC CATCGTCTCG GCCGCCAGCT CGCCGGCGTC ATGCCCGCCC ACGTCGTCTA CCCCAACGCC GACCCCAGCC CCGAACCGCA GCCCGCAGGC TTCTCGCCGT TCTGGCTGAA GGAGGTGCTG CGCGACCGCC TCGGCTTCCA GTGGGTGATC TTCAGCGACG ATCTCAACAT GGAAGGCGCC CGCGTCGCCG GTGACATCGT CGGCCGTGCG AAAGCAGCCT ACGCGGCCGG CTGCGACATG CTGCTGGTGT GCAATCGACC TGACCTCGCG GCCGAGCTGC TGGATCGCTG GGCGCCGGAC CTGGACGCCG GCAACCTGGC CCGACTCGCC GCGATCTTGC CGGACACGGC CAGGCCAGCC TGGCTTGCCG ACCCCTTCGC ACTCGAACTG CACGCCCCCT ACCTCCGGGC CCGCGAGCAC CTCGCGTCCA TTCCCGAGGA CAAGAGCGCC GCGCCAACCA TGACCGCCGC CACCATCGGT GAGCAACGTA CCGAAGTCCT GCGCAAGGAA GGATAA
|
Protein sequence | MNPPIRRPRG PVMIDVAGTA LTDEERERLR DPLVGGVILF ARNYTGSEQL RALTAEIRGL RDPALIIAVD HEGGRVQRFR TDGFTRLPSM RSLGALWAQD HLVALDAARA TGVVLAAELR AHGVDLSFTP VLDLDYGCCR AIGNRAFHRD PQVVAALAQA LCAGMAEAGM GCVGKHFPGH GFVEADSHHD VPVDERDFDT VWNEDIAPYR HRLGRQLAGV MPAHVVYPNA DPSPEPQPAG FSPFWLKEVL RDRLGFQWVI FSDDLNMEGA RVAGDIVGRA KAAYAAGCDM LLVCNRPDLA AELLDRWAPD LDAGNLARLA AILPDTARPA WLADPFALEL HAPYLRAREH LASIPEDKSA APTMTAATIG EQRTEVLRKE G
|
| |