Gene Dtur_1495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtur_1495 
Symbol 
ID7081930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDictyoglomus turgidum DSM 6724 
KingdomBacteria 
Replicon accessionNC_011661 
Strand
Start bp1505762 
End bp1506823 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content36% 
IMG OID643458604 
ProductCellulase 
Protein accessionYP_002353383 
Protein GI217967877 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGATC TATTAAATAT GCTCAAAGAA ATCACAGAAG CTCCTGGAGT ATCAGGATAT 
GAGAAAGGGA TAAGAGAAGT CTTAAAAAAA TATCTATCTG AAATTGCAGT ATTAGAAGAG
GATCGTCTTG GAAGCTTGAT ATTTAAAAAA CAAGGCTCTA AAGAAACTCC AAAAGTAATG
CTAGCAGCTC ATATGGATGA AATTGGATTC ATGGTAAAAA GCATAACTTC CAACGGCTTT
ATAAAATTCC TACCTCTTGG AGGCTGGTGG GATCAGGTTC TTTTATCTCA AAGGGTAATT
ATACATACCC AAAATGGTCC CATCGTAGGA GTAATAGGTT CTAAACCCCC TCATATATTA
TCAGAAGAGG AAAGAAAAAA AGTAGTAGAA AAGAAAGATA TGTATATTGA TATTGGAGCA
AATAGCGAGG AAGAAGCCTT AAATTGGGGA GTAAGACCTG GAGACCCTAT AACCCCTTAT
AGTGAATTCC AAGTAATGCA TAACCCTGAT TTTCTCTTAG CTAAGGCATG GGATGATAGG
GTGGGATGTG CCCTTCTTGT GGAAATAATA AAAGAACTGA AAAATATAGA TCATCCTAAC
ACCATATATG GAGCTGCTAC AGTCCAAGAG GAAATAGGAC TAAGAGGAGC AACCACATCC
TCTTTTGTAG TTAATCCTGA TGTCGCCATT ATATTAGAAT CTGATATCGC TACAGATGTA
CCAGGCATAA ACGAAGAAAA GAGGATCTAT TTAGGAAAAG GTCCTTCTAT AATAATTTAC
GATGCTACCA TGATTCCTAA CTCCAACTTA AGAAGAATAT TTATAGAAAC TGCTGAAAAA
CTAAATATAC CAATACAATA TTCTGCTCTT GAAAGAGGAG GAACCGATGG GGGAAGAATT
CACATTCATG CTAAAGGTGT ACCTTCAATA GTAGTTGGCG TGCCTGCAAG ATATATCCAT
TCTCATACCA GTATTATTAA TGTGAAAGAT TTCTTAAATG CCAAAAAACT GATAGTGGAA
GTTATCAAGA GTCTTAATAA GGAAATAGTA GAGAGTCTAT GA
 
Protein sequence
MSDLLNMLKE ITEAPGVSGY EKGIREVLKK YLSEIAVLEE DRLGSLIFKK QGSKETPKVM 
LAAHMDEIGF MVKSITSNGF IKFLPLGGWW DQVLLSQRVI IHTQNGPIVG VIGSKPPHIL
SEEERKKVVE KKDMYIDIGA NSEEEALNWG VRPGDPITPY SEFQVMHNPD FLLAKAWDDR
VGCALLVEII KELKNIDHPN TIYGAATVQE EIGLRGATTS SFVVNPDVAI ILESDIATDV
PGINEEKRIY LGKGPSIIIY DATMIPNSNL RRIFIETAEK LNIPIQYSAL ERGGTDGGRI
HIHAKGVPSI VVGVPARYIH SHTSIINVKD FLNAKKLIVE VIKSLNKEIV ESL