Gene Dtur_1586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtur_1586 
Symbol 
ID7082021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDictyoglomus turgidum DSM 6724 
KingdomBacteria 
Replicon accessionNC_011661 
Strand
Start bp1598566 
End bp1600203 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content33% 
IMG OID643458694 
ProductCellulase 
Protein accessionYP_002353473 
Protein GI217967967 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGAAG TATTAATTAT AGTTTTGTTA ATTCTTATCT GTCTTGGGTC AGGCAAACTG 
TATGCTATCA CCTATAAGGG TAGTTTCCAT TATATAGATC CCCATAAGGC AATTGAAGAA
ATGGATCCTG GTTGGAATCT GGGTAATGCT TTAGATGCCA TACCTGATGA AACTTCATGG
GGAAATCCCA GGACTGAGGA ATATGTTTTT GAGGATATAA AGAAAACTGG TTTCAAAAGT
GTGAGAATTC CTGTTACATG GATTGATCAT ATGGGACCTG CTCCAAATTA TAGGGTTGAT
GAAGATTGGA TGAATAGAGT AGAGGAAGTA GTAAATATGG CTCTAAGGAA GGATCTATAC
GTAATAATTA ATGTACATCA TGACTCTTGG CGGTGGCTGA GTAAGGAGAT GAAATCGGAC
AAAGAAGCCA CGATTGATAA ACTTGAGAAA TTGTGGCTTC AAATAGCAGA CAGATTTAAG
GATTATTCAG AAAAGCTAAT TTTTGAGATT ATTAATGAAC CAACGTATGA AGGATTTTCT
GAGGAAGAGG CAGGTGCTCT GCAAAATGAA GTAAATGAAA GGATTTTAAA GGTGATAAGA
AATTCTGGAG GATATAATGA TAAAAGACTG GTAGTAGTCC CTCCATTGTG GACTGACACA
TATAAAGCAG AAAAATATTT TGTGCCTCCA AAGGATCCTA ATATTATAAT AGGTATACAT
TATTACTCTC CCTGGGATTT TACCGCTAAC TGGTGGGGAA GAAAAAGCTG GGGCACAGAG
AGAGATAAGG AACAGATGGA TAAAGATATA AAAATTGTAA AGGAAAAGTT TCCTACTTAC
GCTTTTATCA TTGGGGAATA TGGGCTTTTT AATGGTAATA AACCTGCTGA ATGGTACTAT
TTTGATAATC TTATTAGAGT TGCTAAGAAG TACAAGATGG CAACTTTTTA CTGGGATAAT
GGGGAGAATT ATGATCGTAG GAATAGGATT TGGAGAGATG AACAAGCAAT AAAAGTTATT
ATAAATGCTT CTTATGGCAA AAGAAATGCT TTTTTAAATC CAGGGGTCTT ATATGTTAGA
GAAAATAAAA TAAAGGACGA GAATATAGAA ATAGAACTAA ATGGAAATTC TTTGATTGGA
ATATATTTAA ATGGAAAAAC TTTACAGCAA GGTAAAGATT ATATTCAAGA GTCTCCAGAC
AAGGTAATTC TTAAAAGGGA ATTTTTAGAA AATATTGTTA AACCACAAAA TTATGGTATA
TTGGCTACCC TTAACTTTAG GTTTACGGAA AGTGCAGACT ATCCTTTAAG TATAATACAG
TATAAAGATC CCATGCTTCT TGACAAACCT TTTAATATTA TAAGGGGGGT TCCTATAGAT
TTAAGATTTA TTATAGCTTT TAATGGTACA AAGCTTTGTG CTATAAAAAT TTTTGATGCA
GAAACAGGAA GACCCATAAG AGATTCATGG ACACCGTATT TAAGAGGATG GGATGATTTT
TCTGTGGTTG ATTCTCAGGT GGTTGTAAAG AAGCACGTTT TTGAGAACCT ATCAAAGTAT
CAGAATATAA AGAATATTAA AATAATTTTT GAATTTTTCC CAGGAATTTC TCTTGAAACA
GTTGTTAAGG TAATTTAA
 
Protein sequence
MKEVLIIVLL ILICLGSGKL YAITYKGSFH YIDPHKAIEE MDPGWNLGNA LDAIPDETSW 
GNPRTEEYVF EDIKKTGFKS VRIPVTWIDH MGPAPNYRVD EDWMNRVEEV VNMALRKDLY
VIINVHHDSW RWLSKEMKSD KEATIDKLEK LWLQIADRFK DYSEKLIFEI INEPTYEGFS
EEEAGALQNE VNERILKVIR NSGGYNDKRL VVVPPLWTDT YKAEKYFVPP KDPNIIIGIH
YYSPWDFTAN WWGRKSWGTE RDKEQMDKDI KIVKEKFPTY AFIIGEYGLF NGNKPAEWYY
FDNLIRVAKK YKMATFYWDN GENYDRRNRI WRDEQAIKVI INASYGKRNA FLNPGVLYVR
ENKIKDENIE IELNGNSLIG IYLNGKTLQQ GKDYIQESPD KVILKREFLE NIVKPQNYGI
LATLNFRFTE SADYPLSIIQ YKDPMLLDKP FNIIRGVPID LRFIIAFNGT KLCAIKIFDA
ETGRPIRDSW TPYLRGWDDF SVVDSQVVVK KHVFENLSKY QNIKNIKIIF EFFPGISLET
VVKVI