Gene Ccel_3243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3243 
Symbol 
ID7312467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3787580 
End bp3789829 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content41% 
IMG OID643610144 
Productglycoside hydrolase family 65 central catalytic 
Protein accessionYP_002507512 
Protein GI220930603 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1554] Trehalose and maltose hydrolases (possible phosphorylases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA AAAATCAAAC ACATGTAGTA GACAATTTAT GGTGTATCGA AGAAAACAGG 
TTTAACCAAG ATACAAACAA GCATTTTGAA GGACTATTTA CTCAAGGCAA TGGCTATATG
CACGTAAGAG GAAGCTTTGA GGAAGGGCTA CGGGGGGCAG CACAAGATGA AGAGTATATG
CGTATGCCTG CAAATGTTAC CCTTGAACAG CCAAGGCATA CAAAAAGTAA ATGGGGGACC
TTTATACCGG GTATTGTGGG AAAGCACCCG CTTCTAAAAG AGGAAATAAT AAATCTTCCA
TATTTTCTGG GTATAGACTT AATCATAGCA GATGAAGAAT TGAATATGGA GGCAAGTACC
ATCAGGGAAT ACAAAAGATG GCTGGATTTG AGGGATGGCT GCCTTTATAG AAGTCTCATA
TGGGAAACGG AACAAGGGAT AGATCTTAAA CTTTGTTATA AACGTTTTGT CAGTATGGAA
GAGAAACATC TTTGTTTGCA GCAGGTTCAG ATAGAGGTGC TCTCCGGCGA GGGGATTCTA
GACATCAGAT GCGGTATAAA TCCGGAAGTC AGAACCAACG GCTTTAATCA TTTCAAAGAA
GTACTGACTG AGGATAATAA CTCTGAATAC ATTTGTACAA AGACTATTAC AGATGGAGGA
AACAGTGTTT TAATGCTGTC TGGTATTAGT GCTTCCGAGA ATATATGCTG GAGCAGGTTA
ACCCAAAACC AAAGAATATT TTTTTCTGGA TCAAGATTAT TAAAGGCCGG AGATTGCCTA
GAGATTCAGA AAGTAACGTC GGTTGTCACC GACAGGGATC TTGAGGAAGG GACTCTTATT
GAAAGAGGAG AGAAATATCT CAAATATTTC TTTAAAGAAG GCTGGGAGAA AATATATGAG
AGACATTCAA GGGCTTGGGA CAGTAAATGG CTTTACAGTG ATATTGAAAT AACCGGGGAT
GATGAAGCAC AGCTTGCCAT TCGTACATAC ATATATCATC TTATCAGGGC AAACTCTGAA
AATGATTCAA GAGTTGCCAT TTGTGCAAAG GGGTACGCCG GAGAGGCATA CTTCGGACAT
TATTTCTGGG ATACGGAAAT AAATATGTTG CCCTTCTTTA TTCATACCAA CCCTCAAGCT
GCAAGAAACT TATTAATGTA TAGATACAAT ACTTTGGAAG GTGCACGGAA GAACGCTCTG
GAGAATGGCT ACGAAGGAGC AAAGTATGCT TGGGAGGCAT CGGTTACGGG AGAGGAGCAA
TGTCCCTGTT GGCAGTATGC AGATCATGAA ATACATATTA CAGCAGATAT TATATACGCC
ATGTACCATT ATGTAAACTC TACGGAAGAC ATTGCTTTTG TCAGAAACTT CGGTGTCGAC
ATGATGGTTG AAACTGCAAG ATATTGGGTG CAAAGGGTTG ACAGAAACAA AGACGGATAT
TACGAGCTGT TAGGTGTCAT GGGTCCTGAC GAGTACCTCC CCATTACAAG GAACAATGCA
TTTACTAACC GTATGGTTAA ATTCAGCTTG GAAAAAACAG CAGACTTACT CCAAAAAATT
AAGCAGGAAG ATGCAAAAGG ATATCTGGAA ATTGAAAACC GACTCGGTTT AAAGGAATTG
GAGATAGAAA AGATCAAGGA AGTCGCAGAA AAGCTAATTC TCCCATATGA CGAAAATACA
GATATCGTTC CGCAGTCGGA GGATTTTGAA AGCTATGCAG ATGTTGACTT TAATGCTATT
TGGAAGGATA GGACCAAGCC CTTTGGAAAC TTTATATCAC AGGAGAAAAA CTACCGTTCC
AAAGCTTTAA AACAGGCTGA TGTTCTTGAA CTAATGCTGC TTTACCCCGA TGATTTCACC
CGGGAACAGT TAACTAATGC CTATGACTAT TACGAACCAA TAACAACCCA TGACTCATCA
CTTTCAGCAT CAGTTCACGG AATTGTGGCT GCATGGATGG GAAGGATGCC GGAAGCGGAA
AAATTCCTTA AAAAGGTTAT GGATATAGAT ATGTCCGAGG AGAAGAAAGG GGCGGCGGAA
GGTATACACA TAGCTAACTG CGGAGGTTTG TGGCAGATGA TTGTATATGG CTTTGCCGGC
CTTAAAAGTG CAATGTGGTG TGATGAAATA CAATTAGCAC CGCACCTTCC TGACAAATGG
ACCAAGCTGG AATTTACACT TGCATGGCAT GGGAAAAGGT ATAGGATTAC TGTTACAAAA
GAAAAGCATG AGATTACTGA ATTATATTGA
 
Protein sequence
MSKKNQTHVV DNLWCIEENR FNQDTNKHFE GLFTQGNGYM HVRGSFEEGL RGAAQDEEYM 
RMPANVTLEQ PRHTKSKWGT FIPGIVGKHP LLKEEIINLP YFLGIDLIIA DEELNMEAST
IREYKRWLDL RDGCLYRSLI WETEQGIDLK LCYKRFVSME EKHLCLQQVQ IEVLSGEGIL
DIRCGINPEV RTNGFNHFKE VLTEDNNSEY ICTKTITDGG NSVLMLSGIS ASENICWSRL
TQNQRIFFSG SRLLKAGDCL EIQKVTSVVT DRDLEEGTLI ERGEKYLKYF FKEGWEKIYE
RHSRAWDSKW LYSDIEITGD DEAQLAIRTY IYHLIRANSE NDSRVAICAK GYAGEAYFGH
YFWDTEINML PFFIHTNPQA ARNLLMYRYN TLEGARKNAL ENGYEGAKYA WEASVTGEEQ
CPCWQYADHE IHITADIIYA MYHYVNSTED IAFVRNFGVD MMVETARYWV QRVDRNKDGY
YELLGVMGPD EYLPITRNNA FTNRMVKFSL EKTADLLQKI KQEDAKGYLE IENRLGLKEL
EIEKIKEVAE KLILPYDENT DIVPQSEDFE SYADVDFNAI WKDRTKPFGN FISQEKNYRS
KALKQADVLE LMLLYPDDFT REQLTNAYDY YEPITTHDSS LSASVHGIVA AWMGRMPEAE
KFLKKVMDID MSEEKKGAAE GIHIANCGGL WQMIVYGFAG LKSAMWCDEI QLAPHLPDKW
TKLEFTLAWH GKRYRITVTK EKHEITELY