Gene Ccel_0739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0739 
Symbol 
ID7312141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp860434 
End bp862464 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content44% 
IMG OID643607677 
Productcellulosome protein dockerin type I 
Protein accessionYP_002505097 
Protein GI220928188 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value5.88173e-07 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGAAAGA AAATCTTCAT GTTACTGCTT GCTCTTTTGC AAGTAGCACT TTTGACTATA 
CCTCAACAAC TTGTTACGGC AGCATCGGCA CGCCAGATGG AGTATCTTGA CCGGGGAATC
GTTGCGGTTA AAGTAAGCAA TGGAGTATTC GTAAGCTGGA GAATGCTAGG AACAGATTCA
ACCAGTATCG CCTTCAATCT CTACCGTGAC GGGAAAAAGG TTAATACCAC TCCAATTACC
TCTGGTACAA ATTATGTCGA TACAACAGGA ACAGCCAGCT CAACATACTA TGTTTGTCCT
GTAAAGGATG GTGTTGAGCT GGCAAAATCC GAATCGACAG CAGTATGGGG GCAAAATTAT
TTATCCCTGC CTCTTCAAAT ACCCGCAGGA GGAACAACAA AAAGCGGTGA ATATTACACA
TACTCTGCAA ATGATTGTTC TGTGGGTGAT CTCGATGGGG ATGGCCAGTA TGAGATAATC
CTTAAATGGG ACCCGTCTAA TGCAAAAGAC AACTCACAGA GCGGGTATAC AGGGAATGTA
TACCTTGATG CATACAAACT AAATGGTAAG TTTTTGTGGA GAATTGATCT TGGAAGAAAT
ATCCGTGCAG GTGCTCACTA TACGCAATTT ATGGTTTATG ATCTGGATGG CGATGGAAAA
GCGGAGGTGG CCTGCAAGAC CGCTGATGGG ACAAAAGACG GGATAGGAAA AGTAATCGGG
AACGCAAATG CAGACTATAC AAATTCTAAT GGCTACATTT TATCCGGCCC CGAGTACCTG
ACCATTTTTA ATGGAGAGAC AGGAGCGGCA CTTACTACTA CCGACTATGA TCCTCCAAGA
GGCACTGTAA GTTCCTGGGG AGACAGTTAC GGGAACCGTG TAGACCGTTT CTTGGCTTGT
ATTGCCTACC TTGACGGTGT CCATCCAAGT CTTGTCATGT GCAGAGGCTA TTATACAAGA
GCAGTTCTTG CAGCCTATAA CTGGAGAGAT GGCAAGCTTA CAAAAGTATG GACCTTTGAC
AGTAAAGATA GTGGAAATTC TGCATATGAG GGACAGGGGA ACCACAATCT TAGTGTAGGG
GATGTTGATA GCGACGGAAT GGATGAGATT GTTTACGGAG CTTGTGCAAT AGACCATAAC
GGGAGAGGCC TGTATACTAC AAAGCTCAAT CACGGAGATG CAATGCACTT GTCTGATATG
GATCCGGGCA GACCAGGACT TGAAGTCTGG CAGTGTCATG AAACTGGCTC CAATGCATCT
TCCGGTGAGT ACCGTGATGC AAGAACAGGA CAACTAATAT GGGGACTTGC AGGTACATCA
GACACCGGAA GGGGACTTGC ACTTGATATA GATCCCCGAT ACAAGGGTTT TGAAATGTGG
TCATCAAGCA GTGACGGAGT ATACAATGTA AACGGTACAA AGGTGTCAAC CACAAAGCCC
TCTATGAATT TCGGAGTTTG GTGGGATGGG GATTTAAACC GTGAACTTTT GGATGGAACT
AAGCTCGACA AATGGGATTA TGTCAATAGT ACTCCCATGA GACTGCTTAC TCCTGCTGAT
TGTGCTTCAA ACAACTCAAC AAAAGCAAAT CCTTGCCTTA GTGCGGACAT TTTAGGAGAT
TGGCGAGAGG AAGTTATTTG GAGGACTACG GATAATAAGT ACCTGCGTAT TTATACAACA
ACTGCAGTTA CGAGCAACAG AATTTATACC TTAATGCACG ATCCCCTGTA CAGACTTAGT
ATTGCTTGGC AGAATGTGGC GTACAATCAG CCTCCGCATA CAGGATTTTA TCTTGGTGAT
GGTATGAGTC AGGCACCAAC ACCTAATATT TATATTGCAA AGCCTGCTTC CTCGTTTTTA
CCCGGTGACG TAAACAATGA CGGATCTGTT GATGCCTTGG ATTTTGCCAT TGTTAAAAAA
TACCTATTAG GTCAATCAAC AGAGTTAGAT ACCGGTAAAG CAGACATGAA TTCAGACGGG
GACATCAATG CACTAGACTT GGCATTACTA AAAGCAAGTT TGCTTTCGTA G
 
Protein sequence
MRKKIFMLLL ALLQVALLTI PQQLVTAASA RQMEYLDRGI VAVKVSNGVF VSWRMLGTDS 
TSIAFNLYRD GKKVNTTPIT SGTNYVDTTG TASSTYYVCP VKDGVELAKS ESTAVWGQNY
LSLPLQIPAG GTTKSGEYYT YSANDCSVGD LDGDGQYEII LKWDPSNAKD NSQSGYTGNV
YLDAYKLNGK FLWRIDLGRN IRAGAHYTQF MVYDLDGDGK AEVACKTADG TKDGIGKVIG
NANADYTNSN GYILSGPEYL TIFNGETGAA LTTTDYDPPR GTVSSWGDSY GNRVDRFLAC
IAYLDGVHPS LVMCRGYYTR AVLAAYNWRD GKLTKVWTFD SKDSGNSAYE GQGNHNLSVG
DVDSDGMDEI VYGACAIDHN GRGLYTTKLN HGDAMHLSDM DPGRPGLEVW QCHETGSNAS
SGEYRDARTG QLIWGLAGTS DTGRGLALDI DPRYKGFEMW SSSSDGVYNV NGTKVSTTKP
SMNFGVWWDG DLNRELLDGT KLDKWDYVNS TPMRLLTPAD CASNNSTKAN PCLSADILGD
WREEVIWRTT DNKYLRIYTT TAVTSNRIYT LMHDPLYRLS IAWQNVAYNQ PPHTGFYLGD
GMSQAPTPNI YIAKPASSFL PGDVNNDGSV DALDFAIVKK YLLGQSTELD TGKADMNSDG
DINALDLALL KASLLS