Gene Dtur_0857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtur_0857 
Symbol 
ID7082810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDictyoglomus turgidum DSM 6724 
KingdomBacteria 
Replicon accessionNC_011661 
Strand
Start bp882019 
End bp883878 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content34% 
IMG OID643457931 
ProductArabinogalactan endo-1,4-beta-galactosidase 
Protein accessionYP_002352752 
Protein GI217967246 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3867] Arabinogalactan endo-1,4-beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAAGA AAAAAGTATT GTTAATATTA TTTTTACTGG GGGTTTTTAT GACAACTTCA 
GGATATGGAG AGAGTAAAGT GTTAGTAAAT CCCATAGAAA ATATATCGAA GGATTTTATT
AAGGGTGTTG ATATATCAAT GGTATATGAG GTTGAGAAAA ATGGAGGAAA ATACTTTGAT
AATGGGGTTC AAAAAGATCC ACTTCAGATA TTAAAAGATC ACGGTGTTAA TTGGGTAAGG
GTTAGAATAT GGAATGATCC ATATGATGAA AAAGGAAATC CCTATGGAGG AGGAAACTGC
GATTATAAAA ATATGACTGA GCTTGCCAAA AGAGCAAAAT CTATTGGTTT AAAAGTGCTT
GTAGATTTTC ATTACAGTGA TTTTTGGGCT GATCCTGGTA AACAGGAAAA ACCAAAAGCA
TGGAAAAATT TAAAAGGTAA GGCTCTTGAG AAAGCAGTTT CTGATTTTAC TTATCAAGTT
GTTAAATACA TGAAGGATAA CAAAGCTCTA CCAGATATGG TACAGATCGG TAATGAAGTA
AACAATGGAT TTTTGTGGCC AGATGGAAAA CTTGTTGGAG ATGATGCAGG GGGTTTTGAA
AATTTTGTCA AACTATTCAA CGCTGGAGTC AATGCTGTAA GAAGAGTAGA CAAGAATATT
AAAATAGCAG TACATCTTGC TGAAGGAGGA AATAGTGCCT TATTTAGATG GTTTTTTGGA
AATGTTTTAA GTTTAAAAAT GGATTTTGAT GTTATTGGAG TATCCTATTA TCCTTATTGG
CATGGTACCA TAGATGAGCT TAGGGAAAAC TTAAACAGTA TTGCTCTTTG GCTAAATAAG
GAAATAGCAA TTTTTGAGAC CGCCTATGCT TGGACCTTAG ATGATGCTGA CGGCCATCCA
AATATCTTTG GCGGAGATTT ATGGAAAATA GGAGGATATA AACCTACAAT ACAGGGGCAA
GCTACAGCCA TAAGGGATAT TATGGATGTG GTGGCTCATA TTCCAAATAA TAAGGGGATA
GGAATATTTT ATTGGGAAGG GTGTTGGATT CCTGTTAAAG GAGCAGGATG GAAACAAGGA
GAGGGTAATC CATGGGAAAA TCAAGCATTA TTTGATTTTA AAGGTAATAC TCTTCCATCC
TTAGATGTTT TTAATCTTGT TTATGGAAAA GAAAAGATAA CTCCTAATCC AATAGAGGTA
TTGTCAGAAG TAAATATAAA AGTTTCTACA GGAGAAATTC CAAATTTGCC TGAAAAAGCT
AAAGTGTTGT TTGATGATGA TTCAATTAGA AGTATTAAAA TTAAGTGGGA TAATATTGAT
CCCAAATTCC TTACAACTCC TGGAGAGTTT AAACTAAGAG GGATTATTGA GGGGATTCAG
AAAGAGATAT ATGCAAATAT TTATGTAAGT GGCCAGAAAA ACTATATACA AAATCCAAGT
TTTGAATCTG GTACCCTTTC TCCTTGGAAG GTTGAAGGAG ATATCTCGGC TGTAAAAGTT
GTAAAAGCAA CCCCACCTCA AAACGCAAAG TCAGGGGATT ATGCTCTTAA TTACTGGCTT
GATAAACCCT TTAAATTTGA ACTATACCAA GTTATTAAAA ATCTTAGTCC TGGTAAATAT
AAAGTAAGTT TTTGGATACA GGGTGGAGGT GGAGAAAATT TAATAAGATT TAAAGTAAGT
GGGTATGGTG GTGAAGATAA ATTTATTGAT ATAATTAATA CTGGTTGGCT AAATTGGAAG
AATCCTACTA TAAGTGATAT TGAAGTTACT ACAGGAGAAA TAAAAATAAG CATAATAGTA
GATGGAAATA CAGGTAATTG GGCTTGGATT GATGATTTTG AATTAAAAGA GCAAGAGTAA
 
Protein sequence
MGKKKVLLIL FLLGVFMTTS GYGESKVLVN PIENISKDFI KGVDISMVYE VEKNGGKYFD 
NGVQKDPLQI LKDHGVNWVR VRIWNDPYDE KGNPYGGGNC DYKNMTELAK RAKSIGLKVL
VDFHYSDFWA DPGKQEKPKA WKNLKGKALE KAVSDFTYQV VKYMKDNKAL PDMVQIGNEV
NNGFLWPDGK LVGDDAGGFE NFVKLFNAGV NAVRRVDKNI KIAVHLAEGG NSALFRWFFG
NVLSLKMDFD VIGVSYYPYW HGTIDELREN LNSIALWLNK EIAIFETAYA WTLDDADGHP
NIFGGDLWKI GGYKPTIQGQ ATAIRDIMDV VAHIPNNKGI GIFYWEGCWI PVKGAGWKQG
EGNPWENQAL FDFKGNTLPS LDVFNLVYGK EKITPNPIEV LSEVNIKVST GEIPNLPEKA
KVLFDDDSIR SIKIKWDNID PKFLTTPGEF KLRGIIEGIQ KEIYANIYVS GQKNYIQNPS
FESGTLSPWK VEGDISAVKV VKATPPQNAK SGDYALNYWL DKPFKFELYQ VIKNLSPGKY
KVSFWIQGGG GENLIRFKVS GYGGEDKFID IINTGWLNWK NPTISDIEVT TGEIKISIIV
DGNTGNWAWI DDFELKEQE