Gene Ccel_1239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1239 
Symbol 
ID7312202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1526092 
End bp1529139 
Gene Length3048 bp 
Protein Length1015 aa 
Translation table11 
GC content43% 
IMG OID643608160 
ProductCarbohydrate binding family 6 
Protein accessionYP_002505575 
Protein GI220928666 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000298242 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGAGAA AGAAAATTTT ATGTATATTT CTTGTGACTG TATTAATGCT AACTATATTA 
CCAATACCTC AACAGACGGT AATGGCTGAT ACAGGGGTAC TTAAGGAACT TAAAGGAACT
GACATCTACA ACGGATTAAG GGGACTGAAT TTTAACGAGG GTTGGAAATT CAATAAGGGG
GATGTAAGCA ACGGCCAGAG TACCGGGTAT AATGACAGCG GCTGGTCAGG TGTTACACTG
CCACATGACT GGAGTATTTA TAACACTTTT AATAAATCCT CAGCAGCGGG TGCAGGAGGA
GGTTATCTGG ATGGAGGAAT CGGATGGTAC AGGAAAACCT TTACCGTACC TTCGGATTAT
ACAGGAAAGA AGGTATTCAT TGAATTCGAC GGAGCGTACA TGAACAGCCA GGTATGGATT
AACGGTACTT TGCTGGGAAC CCGTCCATAT GGCTATTCAT CCTTTGAATA TGATTTAACT
CCATACCTCA ATATAGGTGG AAGTAATGTT ATTGCAGTAA GGCTCAACAA TAACCAACCC
ACTAGCCGCT GGTATTCAGG AAGCGGTATT TATCGGAATG TATGGCTGAC AGTATTAGAT
CCTGTGCATG TGACATACTG CGGAATGTTT GTAACAACAC CCACGGTCAG CAGCAGTTCG
GCCACCGCAA ACGTCAGCAC AAAGGTATTG AATCAGGGAA GTACTGCAAA ATCGGTATCA
TTAAAAACTA CAATTACCGA TGCGGAGGGT AATGTCGTAG CCACAAATAC TTCTTCAGTG
GCTAGTATAG CTGGCAGTGG TAGCAATACC TTCAGTCAGA ATCTGACAGT ATCAAACCCG
CATTTATGGT CTCCCGCTTC ACCATACCTA TATGCTGTTC AAACCCAGGT TATTGTGGAC
GGAAATGTAA CAGATACATA TAGTTCTACA TTGGGAATCA GATATTTCAG CTTTAGCTCC
ACATCAGGCT TTTCACTTAA TGGTGTAAAT ATGAAGATAA ATGGAGTGTG TCTGCATCAT
GATTTGGGTT CTCTTGGTGC GGCTGTTAAC TATCGTGCAA TTGAAAGAGA ACTTCAGATT
ATGAAGGATA TGGGATGTAA TGCAATTCGT ACTTCACATA ACCCGCCAGA CCCGCAAATG
CTGGAAATAT GTGACAGATT AGGTTTGATG GTTATGGATG AAGCATTTGA CTGCTGGGAA
ACAGGAAAAA ATTCTAATGA CTACCATCTG TACTTTAACA ATTGGGCACA GACCGATCTT
CAAGCAATGG TTACAAGGGA TCGTAATCAT CCGTCAATTA TCATGTACAG CATAGGCAAT
GAGATTCCTT CACCAAGTGT AGCTACAGCG ACAAAGCTTA AGAATTGGGT AAAAGATGTT
GATAACACAC GACCAGTCAC ATTGGGAACT TTCGCCGTAA GCATGGGTGA TGCAACTCCG
CAGGCGGTCG CAAGTGTTCT TGATTTAGTG GGCTATAACT ATTTTCCATA CATGTACGAC
GGAGGACATA ACAATCATCC GGAGTGGAAG ATGTTTGGAA GTGAAACGAG TTCAGCGGTC
AGAAGCCGCG GGGTTTATAA AACACCCACC AATAAAAACA TCCTGACTGA CAGTGATAAC
CAGTGTTCCT CTTATGATAA CAGTGTGGTC AGCTGGGGCA ACAGTGCTGA ATCTTCCTAT
AATGAAATTA ATAAAAGAAA TTACATGGCG GGTGAATTCA TATGGACAGG ATTTGACTAT
ATCGGTGAAC CTACACCTTA CGAATGGCCG GCAAAAAGTT CGTATTTTGG AATAGTGGAT
ACATGCGGGT TCCCGAAAGA TATCTACTAT TTCTATCAAA GTAAGTGGAG TACTAAACCG
ATGGTTCACA TACTCCCACA TTGGAACTGG TCTACGGGTA CTAATGTAGA GGTATGGGCA
TACAGTAACT GCGATACTGT AGAACTATTC CTTAATGGGA AGTCACTTGG CTCGAAAAGC
GTTGGAACAG CAGGACATCT TTCATGGAGT GTTCCATGGT CTTCAGGGAC ATTGCGGGCA
AAAGGTACAA AGGGCGGCAC TGTGGTATAC GATGAAGTCG TCACAGCAGG TACTCCTTCA
AAAGTCCTGT TGAAGCCTGA CAGAACTTCT GTTAAAGCAG ACGGTAAGGA TTTGATATAT
ATCGAAACAG ATATTGCAGA CGGCAACAAT GTGACTGTCC CCACAGCTGA CAATACAGTG
AATTTCTCCA TATCAGGTCC CGGTGTAATT GTGGGAGTTG ATAATGGAAA TCCAATAAGT
ACGGAAGCAT ATAAAGGCAG CAGTCGTAAG GCTTTCAACG GTAAGTGTCT GGTAATTGTC
CAGCCTACCA AAGTTAATGG AACAATTGTA GTAACGGCAA GCTCCAACGG ACTATCCTCT
GGAAGTGTTT CCATCGCTTC AACAGGAGGA GCAGAAGCAC CCATCGTATC TGCCTATAAA
AAGATTGAGG CCGAAAACTA CGATAATCAG TCTGGAATCC AGACAGAAGC CTGTTCGGAA
GGCGGACAGG ATGTAGGATT TATTGAAAAC GGGGACTACA CTGTTTACAA CAATGTGGAT
TTCGGTAGCG GTGCCGAGAG CTTTACGGCA AGAGCAGCAA GTGCTACTAG CGGAGGAAAT
ATTGAGATCA GACTTGACAG TCCTAACGGG ACTTTAATTG GAACTTGTCC AGTTGCAGGA
ACAGGTGATT GGCAGACTTA TGCTGATGTA AATTGCAGTG TCAGCGAGGT AAGCGGAAAA
CACGACCTGT ATCTAAAATT CACCGGGGAT AGCGGATATT TATTTAATAT TAACTGGTTT
ACATTTACTG CACGGAATTC AGCAAAATTG GGAGATTTGA ATTCCGATGA CCAAATAGAC
GCGATGGATT TCCAGTTGAT GAAAAAGTAT CTGCTGGGAC TGGGAGAAAT TAAAGATACA
AAGCTGGCCG ATTTGGATGC AAGTGGTACT ATTGATGTTT TAGATCTGAT GCTGCTCAAA
CAGTACCTGC TAGGCACTAT AACTTCTTTT CCGGGGCAGG GTACTTAA
 
Protein sequence
MLRKKILCIF LVTVLMLTIL PIPQQTVMAD TGVLKELKGT DIYNGLRGLN FNEGWKFNKG 
DVSNGQSTGY NDSGWSGVTL PHDWSIYNTF NKSSAAGAGG GYLDGGIGWY RKTFTVPSDY
TGKKVFIEFD GAYMNSQVWI NGTLLGTRPY GYSSFEYDLT PYLNIGGSNV IAVRLNNNQP
TSRWYSGSGI YRNVWLTVLD PVHVTYCGMF VTTPTVSSSS ATANVSTKVL NQGSTAKSVS
LKTTITDAEG NVVATNTSSV ASIAGSGSNT FSQNLTVSNP HLWSPASPYL YAVQTQVIVD
GNVTDTYSST LGIRYFSFSS TSGFSLNGVN MKINGVCLHH DLGSLGAAVN YRAIERELQI
MKDMGCNAIR TSHNPPDPQM LEICDRLGLM VMDEAFDCWE TGKNSNDYHL YFNNWAQTDL
QAMVTRDRNH PSIIMYSIGN EIPSPSVATA TKLKNWVKDV DNTRPVTLGT FAVSMGDATP
QAVASVLDLV GYNYFPYMYD GGHNNHPEWK MFGSETSSAV RSRGVYKTPT NKNILTDSDN
QCSSYDNSVV SWGNSAESSY NEINKRNYMA GEFIWTGFDY IGEPTPYEWP AKSSYFGIVD
TCGFPKDIYY FYQSKWSTKP MVHILPHWNW STGTNVEVWA YSNCDTVELF LNGKSLGSKS
VGTAGHLSWS VPWSSGTLRA KGTKGGTVVY DEVVTAGTPS KVLLKPDRTS VKADGKDLIY
IETDIADGNN VTVPTADNTV NFSISGPGVI VGVDNGNPIS TEAYKGSSRK AFNGKCLVIV
QPTKVNGTIV VTASSNGLSS GSVSIASTGG AEAPIVSAYK KIEAENYDNQ SGIQTEACSE
GGQDVGFIEN GDYTVYNNVD FGSGAESFTA RAASATSGGN IEIRLDSPNG TLIGTCPVAG
TGDWQTYADV NCSVSEVSGK HDLYLKFTGD SGYLFNINWF TFTARNSAKL GDLNSDDQID
AMDFQLMKKY LLGLGEIKDT KLADLDASGT IDVLDLMLLK QYLLGTITSF PGQGT