Gene Tmz1t_2307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2307 
Symbol 
ID7085294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2599321 
End bp2600466 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content71% 
IMG OID643699328 
Productbeta-hexosaminidase 
Protein accessionYP_002355942 
Protein GI217970708 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0826435 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCAC CCATCCGCCG CCCACGCGGC CCCGTCATGA TCGATGTCGC CGGCACCGCG 
CTCACCGACG AGGAGCGCGA ACGCCTGCGC GACCCTCTGG TCGGCGGGGT GATCCTGTTC
GCGCGCAACT ACACCGGCTC CGAGCAGCTG CGCGCGCTCA CCGCCGAGAT CCGCGGGCTG
CGCGACCCGG CGCTGATCAT CGCGGTCGAC CACGAAGGCG GCAGGGTGCA GCGCTTCCGC
ACCGACGGCT TCACCCGCCT GCCGTCGATG CGCAGCCTCG GCGCCTTGTG GGCGCAGGAC
CATCTGGTGG CGCTCGACGC GGCGCGCGCC ACCGGCGTCG TGCTCGCCGC CGAGCTGCGC
GCGCACGGGG TCGACCTGAG CTTCACCCCG GTGCTGGATC TCGACTACGG CTGCTGCCGC
GCGATCGGCA ACCGCGCCTT CCATCGCGAT CCGCAGGTGG TCGCGGCGCT CGCGCAGGCG
CTGTGCGCCG GCATGGCGGA GGCGGGCATG GGCTGCGTGG GCAAGCACTT CCCCGGCCAC
GGCTTCGTCG AGGCCGACTC GCACCACGAC GTGCCGGTGG ACGAGCGCGA CTTCGACACG
GTTTGGAACG AGGACATCGC CCCCTACCGC CATCGTCTCG GCCGCCAGCT CGCCGGCGTC
ATGCCCGCCC ACGTCGTCTA CCCCAACGCC GACCCCAGCC CCGAACCGCA GCCCGCAGGC
TTCTCGCCGT TCTGGCTGAA GGAGGTGCTG CGCGACCGCC TCGGCTTCCA GTGGGTGATC
TTCAGCGACG ATCTCAACAT GGAAGGCGCC CGCGTCGCCG GTGACATCGT CGGCCGTGCG
AAAGCAGCCT ACGCGGCCGG CTGCGACATG CTGCTGGTGT GCAATCGACC TGACCTCGCG
GCCGAGCTGC TGGATCGCTG GGCGCCGGAC CTGGACGCCG GCAACCTGGC CCGACTCGCC
GCGATCTTGC CGGACACGGC CAGGCCAGCC TGGCTTGCCG ACCCCTTCGC ACTCGAACTG
CACGCCCCCT ACCTCCGGGC CCGCGAGCAC CTCGCGTCCA TTCCCGAGGA CAAGAGCGCC
GCGCCAACCA TGACCGCCGC CACCATCGGT GAGCAACGTA CCGAAGTCCT GCGCAAGGAA
GGATAA
 
Protein sequence
MNPPIRRPRG PVMIDVAGTA LTDEERERLR DPLVGGVILF ARNYTGSEQL RALTAEIRGL 
RDPALIIAVD HEGGRVQRFR TDGFTRLPSM RSLGALWAQD HLVALDAARA TGVVLAAELR
AHGVDLSFTP VLDLDYGCCR AIGNRAFHRD PQVVAALAQA LCAGMAEAGM GCVGKHFPGH
GFVEADSHHD VPVDERDFDT VWNEDIAPYR HRLGRQLAGV MPAHVVYPNA DPSPEPQPAG
FSPFWLKEVL RDRLGFQWVI FSDDLNMEGA RVAGDIVGRA KAAYAAGCDM LLVCNRPDLA
AELLDRWAPD LDAGNLARLA AILPDTARPA WLADPFALEL HAPYLRAREH LASIPEDKSA
APTMTAATIG EQRTEVLRKE G