Gene Ccel_0649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0649 
Symbol 
ID7309514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp750568 
End bp752166 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content37% 
IMG OID643607590 
Productcellulosome protein dockerin type I 
Protein accessionYP_002505010 
Protein GI220928101 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5520] O-Glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TCATTCGCTT ATTAGGTCTA ACTATGGTTT TGATGCTTGT ATTTACAATG 
GTATTACCAT TAAATCTTTA TGCAGCATCA ACTGTTACCG TGGATTGGGG TACCAATTAT
CAAACAATTG ATGGTTTTGG TGTTTCAGAA GCTTTTCATC AGTCAAATAA TATTGCTTTA
TTAGGAGATA CCAAGAAAAA GGAAATTTAT GACTTACTAT TTTCAACTAC AAAGGGGGCA
GGGTTTTCAA TATTCCGTTC TATACTTGGA GACGGAGGAA CATGGGGGAA TGCAACTGAC
GGACCAAATA AGACAATGCA GCCTTCTGAG ACAACTTGGG ACTGGAAAGA ATCAAATGAT
GACCAGATAT CTATGATTAG AGAGATACAG TCCGGCTACG GAATCAATAA AATTCTTTAC
ACTGTATGGA GTCCGCCTGC ATGGATGAAA TCAAACGGGT CAACTTCAAG AGGATATCTA
AAGACCGATA AATATCAAGC ATATGCAACA TATTTAGCAG AGCATATAAA AAACTACAAA
TCAAAATTTG GAATTGATAT TACTCATATA GGGATTTCAA ATGAGCCTAA CCTTGAAACA
GACTATTCTT CATGTACATG GACAGCAGCT CAATTCAAAA CCTTTATGAA GGATTATCTG
GTACCAACTT TTGATAAAGA AGGTATTACT GCAAAAGTTA TTATGGGAGA ACCAATGTCA
TGTACCGAAT CATTTGCAAT TGACTGTTTG AATGATGCCA CAGCATTGAC AAGAACAGAT
ATTGTAGGTT GTCACAATTA TGGATCATCA TACACAACTT TTCCAACCAC TAAGGCAAAG
GGAAAAGGAA TATGGCAGAC AGAAATATCA GACATGAATG GAAACGATAC TACAATAACT
GATGGTTTAA AGTGGTCAAA ACAAATCTTT GATTTTATGA CAATAACTCA GGGAAATGCA
TGGAATTACT GGTGGGGTGC GTGCTATAAA ACATATAATG GAGAAGGTCT CATACAAATG
GACATGAATT CAAAGACCTA TAAAGTTGCT AAAAGACTCT ATACTGTTGG ACAATATTCA
AGATTTATCA GACCGGGATG GCAGAGATTC GCTGCTACTT CGAACCCTGT GTCCAATGTA
TATGTTACCG CATATAAGGA TCCCGCTACA GGAAAATTTG CAATTGTTGC TATGAATGAC
GGTTATACAA ATCAATCAAT TACATATACA TTGAAAGGAT TTACTCCTGA CTCGGTTACT
CCATACACAA CTTCATCAAC CCAAGATTTG GCTGAAGGTA CAAAAATAAC TGTAAGCGGA
GGTAGCTTTA CAGCTAATCT GGCAGCAAAT TCTATAACAA CATTTGTTGG CGGAAGTGAT
GTAAATCCCG GTATCTATGG TGATGTCAAC GGCGACAAAG TTGTTGATGC CATTGACTTT
GCACTTTACA AGCAGTATCT CATAAAGCAG ATTAGCACCT TCCCGTCACC TGACGGAATG
AAGCTTGCTG ATGTAAACGG TGATAACAGT GTTGATGCAA TTGATTTTGC ATTAATCAAG
AAATACTTGC TTGGTTCAAT AACTAAACTT CCGGTTTAA
 
Protein sequence
MKKIIRLLGL TMVLMLVFTM VLPLNLYAAS TVTVDWGTNY QTIDGFGVSE AFHQSNNIAL 
LGDTKKKEIY DLLFSTTKGA GFSIFRSILG DGGTWGNATD GPNKTMQPSE TTWDWKESND
DQISMIREIQ SGYGINKILY TVWSPPAWMK SNGSTSRGYL KTDKYQAYAT YLAEHIKNYK
SKFGIDITHI GISNEPNLET DYSSCTWTAA QFKTFMKDYL VPTFDKEGIT AKVIMGEPMS
CTESFAIDCL NDATALTRTD IVGCHNYGSS YTTFPTTKAK GKGIWQTEIS DMNGNDTTIT
DGLKWSKQIF DFMTITQGNA WNYWWGACYK TYNGEGLIQM DMNSKTYKVA KRLYTVGQYS
RFIRPGWQRF AATSNPVSNV YVTAYKDPAT GKFAIVAMND GYTNQSITYT LKGFTPDSVT
PYTTSSTQDL AEGTKITVSG GSFTANLAAN SITTFVGGSD VNPGIYGDVN GDKVVDAIDF
ALYKQYLIKQ ISTFPSPDGM KLADVNGDNS VDAIDFALIK KYLLGSITKL PV