Gene Dtur_0222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtur_0222 
Symbol 
ID7082407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDictyoglomus turgidum DSM 6724 
KingdomBacteria 
Replicon accessionNC_011661 
Strand
Start bp219310 
End bp221025 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content34% 
IMG OID643457338 
ProductGlycoside hydrolase, family 20, catalytic core 
Protein accessionYP_002352165 
Protein GI217966659 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAAA TATTTATTGT TCCGGAGCCA AAAAGGTTAG AATTTATGGG AAAATGGCTT 
GAGTTTAAGG GCTTTGAAAA CTTTCCAGAA TTTTTATCTC AAGAATTTCA CATCCCTAAA
GGAAGCTTAA GAATAAGAAA AATTGAAAAA AAAGGAAATG GAATAGAAAT CAAAGAAGAT
GAGGTAATAA TATGGGGAGA CGAAAATATA GCTTATGCTA CTATTCTTCA ACTTCTAATG
CAAAATCCCA ACAAGCTTCC TAAGGTAATA ATAGAAGAAG AATTTTCCTT TAAATTTCGT
GGCTATCATT TAGATATTGC CAGGGGTGGA GTTCCTCACT TAAGAGAATT CAAAAGAATC
TTAAAGTGGC TTTTTATTCT AAAATACAAC TACTTCGCCA TTTATTTTGA AGATCTCTTT
CCTTGGAAAA AATATCCTGA GATAGGGGCC CTAAGGGGAA GACTTACAGA AGAGGAAATA
AGAGAAATCA TTAACTATGG AAGAAGGTTA AACATAGAAG TATTTCCTTC CCTTGAGCTC
TGTGGACATA TGGAAAATAT ATTAGTACTC CCTAATTTTA TGAAATTTAG TGAATGGCAC
AGACCCGATG AAGGCTGTAT AGATGTATCC AATGATGAGG CAAGAAAATT TACCTATGAA
CTCTTAGAAG AAGTAATAAA CTTTTTCCCA TCAAAATATG TCCATATAGG TGGCGATGAA
ACCTGGGCAC TTGGAAGAGG AAGAAGTCTT GACAAAGAAG GAATATTCAA GGGACCAGAG
CTTTTTGAGA TGTATCACAG AAATTTAATC TATAAGGTGA AAGAGAGTGG AAAAATCCCT
ATGGTATGGG GAGATATGCT AACAGGTATG TATTTAAGAG AAGAGGAAAA AGAAAGATGG
AGAATTGTTT TAGAAAGTAA TATTTGGGAT GAAACGGTAA TAGCAAATTG GGATTATACT
CACCTTCCTC AAGACCATTT CCAAAATAAA ATAAATATGT TTGGAAAGAG AAAAGAAAAA
GAGCTTGCAT GTCCAGGACT TTCCAATTGG AATAGATTTT ATCCTAACTT TGACATTGCT
CTTACCAATA TTACTAACTT CTTGATCCCT GCAAGAAAAG AGAAACTTCT TGGTTTTCTA
CTTACTTCTT GGGGAGATGA TGGAGCAGAA TGTTTATACT CCTTCTTAGA TCCTCTTATC
CTTGCCACCA TGGAGATCGC AGAGGGTAAT GGAAATTGGG AAGAAAAATG GCTTGCACTG
AAAAGAGAAG ATAAAGAAGT TCTTGAAGTA AGAAAAACTT TAGGGCAAAA TGATATAGCA
GAAACCATTA AGCATGTGTT TCTTGGAGAT CAAATATATA GATATGCTAC TGAAATATTA
AAAGACAAAG AAAGAAAATC CACTGGAGAT TTTTGGGCTG ACTATTACTT GGGAATAACA
AGTCTTCTTT CTAATAAGGA AAAACTAAAG GATAAATATG AAGAGGTTTT GAATGCCGTC
TCTCATGTAA ATCTACCTGA AGATTTATCC CTTATAAGAG ATATGCTAAA GATCTCCCTA
AATAGAGTTA AGGGAAGATT AAAATTTTCT GATTTTATAA GTTTTGGCAA TAAGTATGCA
GAGCTTTGGC TTTCTGAAAG AAAGAAGGAG AATCTTGAAA AGGTAATCAA TAAAATCTAT
GGGGCAGGAG GAAGAGCAGA CTTAGAAATC TATTAA
 
Protein sequence
MEKIFIVPEP KRLEFMGKWL EFKGFENFPE FLSQEFHIPK GSLRIRKIEK KGNGIEIKED 
EVIIWGDENI AYATILQLLM QNPNKLPKVI IEEEFSFKFR GYHLDIARGG VPHLREFKRI
LKWLFILKYN YFAIYFEDLF PWKKYPEIGA LRGRLTEEEI REIINYGRRL NIEVFPSLEL
CGHMENILVL PNFMKFSEWH RPDEGCIDVS NDEARKFTYE LLEEVINFFP SKYVHIGGDE
TWALGRGRSL DKEGIFKGPE LFEMYHRNLI YKVKESGKIP MVWGDMLTGM YLREEEKERW
RIVLESNIWD ETVIANWDYT HLPQDHFQNK INMFGKRKEK ELACPGLSNW NRFYPNFDIA
LTNITNFLIP ARKEKLLGFL LTSWGDDGAE CLYSFLDPLI LATMEIAEGN GNWEEKWLAL
KREDKEVLEV RKTLGQNDIA ETIKHVFLGD QIYRYATEIL KDKERKSTGD FWADYYLGIT
SLLSNKEKLK DKYEEVLNAV SHVNLPEDLS LIRDMLKISL NRVKGRLKFS DFISFGNKYA
ELWLSERKKE NLEKVINKIY GAGGRADLEI Y