Gene Dtur_0219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtur_0219 
Symbol 
ID7082404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDictyoglomus turgidum DSM 6724 
KingdomBacteria 
Replicon accessionNC_011661 
Strand
Start bp215461 
End bp217710 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content40% 
IMG OID643457335 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_002352162 
Protein GI217966656 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGTGG ATATAAAAAA GCTCATATCT CAAATGACCC TTGAGGAAAA GGCAAGCCTT 
TGTTCTGGGC TTGACTTTTG GCATACAAAA CCCATTGAAA GACTTGGAAT ACCATCTATA
AGAATGTCGG ATGGTCCTCA TGGACTTAGA AAAGAAGAAA CCATGTTTAG TAAGACTGTT
CCTGCTACAT GCTTCCCTAC TGCTGTAACT ATTGCAGCAT CCTGGGATAA ATCCCTTGCT
GAGAAGATGG GAAAGGCTAT TGGCGAAGAA TGCCAGGCTG AAAATGTACA AATACTCCTT
GGACCTGGAG TTAACATAAA AAGATCTCCC CTTTGTGGAA GAAACTTTGA ATACTACTCT
GAAGATCCAC TCCTTGCAGG AGAACTTGCA GCCCATTTTA TTAAGGGAGT ACAAAGTGAG
GGAGTAGGAA CTTCTCTTAA ACATTTTGCA GCAAACAACC AAGAACATAG AAGGCTTACT
GTAAATGCCA TCATTGACGA GAGAACCTTG AGAGAGATTT ATCTTTCTGC CTTTGAGAGG
GCAGTAAAAG AGGCAAAACC ATGGACCGTT ATGTGCTCAT ACAATAGAGT AAATGGGACT
TATGCATCAG AGAATGAATT TCTTCTCACA AAGGTATTGA AAGAAGAATG GGGATTTGAA
GGATTTGTAG TATCCGATTG GGGAGCAGTA AATGATAGGG TAATGGGACT TTCTGCAGGA
CTTGATCTTC AAATGCCTTA TGATGGTGGA TATGGAGACA AAAAGATCAT TGAGGCAGTA
AAAAGTGGAA AAATTCCTGA GGAGGTTCTT GATAGGGCTG TAGAAAGAAT TCTTAGGATT
GTATTCAAGG CAATAGAAAA CAAAAAAGAA AATGCCACCT ATGACAAGGA AACTCATCAT
AAAATTGCAA GAGAGATTGC AAGGGAATGT TTTGTGCTTC TCAAAAATGA AGGGGACATT
CTTCCACTGA AAAAAGAAGG AAAGATTGCA TTAATAGGAG CCTTTGCTAA GAACCCTCAA
ATACAAGGAG GAGGAAGTGC CCATGTAAAT CCAACCATGG TAGATGATGC AGTAGAAGAG
ATAAGAAAGA TGGTAGAAGG AAAGGCAGAG ATTCTTTATG CCGATGGATA TCATATAGAA
AAGGATGAGG TAGATGAAAG ACTTATAGAG GAGGCAAAAG AGGTTGCAAA GGTAGCGGAT
GTGGTGGTAA TTTTTGCAGG ACTTCCCGAA AGATACGAAT CGGAAGGCTT TGACAGGCCT
CACATGAAGA TGCCTGAAAG CCACAATAGG CTAATAGAAG AAATCTCAAA GGTAAATGAA
AATCTTGTGG TGGTACTTAG CAATGGAGCA CCTATAGAGA TGCCATGGTT AGAGAAGCCA
AAGGCAATCC TTGAGACCTA CAGAGGAGGA CAAGCCTGGG GAGGAGCAGT TGCAGATGTT
CTTTTTGGAG TAGTAAGTCC TTCTGGAAAG CTTCCAGAGA CCTTTCCAAA GAAGCTAAGT
GATAATCCGT CTTACTTATT CTTCCCAGGG GAGGATGACC GAGTAGAGTA CAGGGAAGGG
ATATTTGTAG GATATAGGTA TTATGATAAG AAGGAGATGG AAGTATTGTT TCCCTTTGGA
TATGGTCTTT CCTATACTAC CTTTGAGTAT TCAGACTTAA AGCTTGACAA GAAAGAGATG
AGAGATGATG AAGTACTTAA GGTAAGTGTG AAGGTAAAGA ATACGGGGAA GGTAAAAGGT
AAAGAAGTAG TGCAGTTATA TGTGAGGGAT GTGGAAAGTA GCTATATAAG GCCTGAGAAG
GAGCTAAAGG GATTTGAGAA GGTAGAGCTT GAGCCAGGAG AAGAAAAGGA AGTAGTGTTT
TATCTTGATA AGAGGGCTTT TGCCTTCTAT AACATAGACA TAAAGGATTG GTATGTGGAG
GATGGAGATT TTGAGATATT AATTGGGAAG TCATCAAGGG ATATTGTGCT AAGGGATAAA
GTATTTGTTA AGTCTACCAC AAAGATAAAG AGAACTTACC ATATTAATTC TACTATTGGG
GATATTATGA GGGACCCTAT AGCATGGGAG AAGTTTAAGG ATATACTACA GCAGTTTGCA
AGTGCCTTTC CTGCCTTTTC ATCGGAAGAA GCAATCATGA ACTTTGCAGA GATGATGAAA
TACATGCCTC TTCGAAGCCT TATTCACTTT GGTCAAGGAA AAATCACACC AGAAATAGTA
GAAAACCTAC TGAGAGAACT TAATAGCTAA
 
Protein sequence
MSVDIKKLIS QMTLEEKASL CSGLDFWHTK PIERLGIPSI RMSDGPHGLR KEETMFSKTV 
PATCFPTAVT IAASWDKSLA EKMGKAIGEE CQAENVQILL GPGVNIKRSP LCGRNFEYYS
EDPLLAGELA AHFIKGVQSE GVGTSLKHFA ANNQEHRRLT VNAIIDERTL REIYLSAFER
AVKEAKPWTV MCSYNRVNGT YASENEFLLT KVLKEEWGFE GFVVSDWGAV NDRVMGLSAG
LDLQMPYDGG YGDKKIIEAV KSGKIPEEVL DRAVERILRI VFKAIENKKE NATYDKETHH
KIAREIAREC FVLLKNEGDI LPLKKEGKIA LIGAFAKNPQ IQGGGSAHVN PTMVDDAVEE
IRKMVEGKAE ILYADGYHIE KDEVDERLIE EAKEVAKVAD VVVIFAGLPE RYESEGFDRP
HMKMPESHNR LIEEISKVNE NLVVVLSNGA PIEMPWLEKP KAILETYRGG QAWGGAVADV
LFGVVSPSGK LPETFPKKLS DNPSYLFFPG EDDRVEYREG IFVGYRYYDK KEMEVLFPFG
YGLSYTTFEY SDLKLDKKEM RDDEVLKVSV KVKNTGKVKG KEVVQLYVRD VESSYIRPEK
ELKGFEKVEL EPGEEKEVVF YLDKRAFAFY NIDIKDWYVE DGDFEILIGK SSRDIVLRDK
VFVKSTTKIK RTYHINSTIG DIMRDPIAWE KFKDILQQFA SAFPAFSSEE AIMNFAEMMK
YMPLRSLIHF GQGKITPEIV ENLLRELNS