Gene Ccel_1233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1233 
Symbol 
ID7310030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1512213 
End bp1514453 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content41% 
IMG OID643608154 
ProductCarbohydrate binding family 6 
Protein accessionYP_002505569 
Protein GI220928660 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.555313 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATTTA AAAAGGTCTT CAGTATTTTA TTGCTGGTAT GTCTATTGGT TACACAAACG 
ACTATAGTCG GTGCATGGCA GTCTGATAAT GACAACGGTA CCTACACAAA CCCGATTTTA
AACGCTGATT ATCCTGATAT TTATGCTATC AGGGTGGGAA GTGATTACTA TATGGTCAGC
TCCACTTTCG TATCATTTCC ATCAATACCT ATCCTTCATT CAAAGGATTT AATTAACTGG
GAAATTATAG GATATGTTTC TTCGGATCTT AACGGTACAG GAAAATCTTA TAATCTGACT
AATACATTTG ACGATTACGG ACATGGCTGT TGGGCACCAA GCATTGCCTA CCGTAACGGT
ACATATTATG TGGGAATATA TCAGGCACAG GGGAAGTTCA TCATGTGTAC CGCCACAAAT
CCGGCCGGCC CTTATACAAA AACGGTATAC AATCAAGGAT TTCATGACCC GGCACTTTTT
ATTGATGACG ACGGTACAGG ATATATCGTC TCTGAAGCAA ACGATGTAAA GGTTACGAAG
CTAAGTGCTG ATTATAAGTC GGTAGTGAAC AGTCAGGTGA CTACCCGTAT TGAGGGTGCA
ACAGGAAATT ACCTGGTTGA AGGCACACAT GTTATGAAAA AGAACGGATA TTATTACATA
TTCCGTAATT CAACTCCTCC AGAATCATAT ACGTACTGCC TCAGGTCAAA GAGTATATAT
GGCCCTTATG AAATGAGGAT ACTACTGAAC GGTCCAAGCA TTTCCGGCGG AAGTCAGATT
CACCAGGGGG GACCTATAGA TACTACAACT GGTGAGTGGT GGTTCTATGT ATTTCAGGAT
GGACAGGGAT TGGGCCGCAG AGACATTTTG ATACCTATGC AGTGGCAGAA TGACTGGCCG
ATTCTCGGAG ACCCTTCTAC AGTAAAAAAT GTAAGCATAG CAGGAAAAAC CTATCCTATG
GGTACCGTAC CGGTTACATA TAAAAAACCG GATGTAGGTG CAACATACCC AATCCTTACA
ATTCCGAATT CTGATGAATT CAGTTCAACA ACAATGGGAG TACAGTGGCA GTGGAATCAT
TACCCTGACA ATACAAAGTG GTCTTTAAGC GAGCGACCGG GCTATCTCAG ACTTCACGCT
AAAAATGCAA ATAATCTGTG GAGGGCCACT AACACTCTTA CACAAAGAGT GGAAGGGCCG
GATTGCCAGG GTACAATAGA GCTTGACACT ACCAATATGT TAGATGGGGA TAATGCTGGA
CTATGCCTGC TTAATATTCC GTATGGAACC ATCGGGGTAA GCAAATCCGG CGGAGTAAAA
AGGATAGTTG CCAATATCAA CGCTAAAAAG GATACGGCAG GAACAATTAC AAATGGTCCT
ACACTTACCG GAAATACTGT CTATTTAAGG GCTAAAGCAG ATCTTATCAG TAATAAGGCA
ACTTTTTTCT ATAGCACGGA TAATGTGAAC TTCACTCAGC TTGGAGGAAC ATTAAGTATG
CCCTTTGACT TGGGATTTTT TCAGGGGGAC AAGTTCGGTA TATTCAATTA TACAACTGCT
TCCTCGGGAG GTTATGCAGA TATTAACTGG TTCCGATATT ATACTTCTGC GGGGCCAAAT
ACTCCTATTC CTGACATTAC GTTATCTGCC TTTGAAAAAA TAGAAGCAGA AAGCTACAAC
TCCCAATCGG GAATCCAGAA TGTAACTTGT GATGAAGGAA CCGAGGCTGT AGGCTATACC
GAAAACGGCG ATTACGTTGT ATATAAGAAT GTTGATTTCG GAAGCGGAGC AAATGGTTTT
AACGCCAGAG TATCAAGTGC AACGGTTGGA GGTACGATTG AAATCAGGCT CGACAGTGCC
AATGGTACAT TAATAGGTTC TTGCCCTGTT GCGGGAACGG GAGGGTGGCA AGCTTTTACA
GATGCAAATT GTACCGTAAG TGGTGTAAGT GGTAAACATG ACCTATATTT GAAATTTACA
GGTGAAAGCG GATATCTGTT TAACATCAAC TGGTTTAAAT TCAGTAATAC ATCTGTTATT
TCAGATAAAT TAGGTGATGT TAATTCTGAC GGACAGATTG ATGCCATTGA CTTACAGCTC
ATTAAAAAAT ATATCCTGGG ATTGGGAGAA ATCGAGAACA TAAAAACTGC TGACCTGGAT
GGAAACGGCG ATGTAAATGC AATTGATTTT TCCCTTATGA AACAGTACTT ACTGGGGCTA
ATTACCGAGT TTCCAGGGTG A
 
Protein sequence
MQFKKVFSIL LLVCLLVTQT TIVGAWQSDN DNGTYTNPIL NADYPDIYAI RVGSDYYMVS 
STFVSFPSIP ILHSKDLINW EIIGYVSSDL NGTGKSYNLT NTFDDYGHGC WAPSIAYRNG
TYYVGIYQAQ GKFIMCTATN PAGPYTKTVY NQGFHDPALF IDDDGTGYIV SEANDVKVTK
LSADYKSVVN SQVTTRIEGA TGNYLVEGTH VMKKNGYYYI FRNSTPPESY TYCLRSKSIY
GPYEMRILLN GPSISGGSQI HQGGPIDTTT GEWWFYVFQD GQGLGRRDIL IPMQWQNDWP
ILGDPSTVKN VSIAGKTYPM GTVPVTYKKP DVGATYPILT IPNSDEFSST TMGVQWQWNH
YPDNTKWSLS ERPGYLRLHA KNANNLWRAT NTLTQRVEGP DCQGTIELDT TNMLDGDNAG
LCLLNIPYGT IGVSKSGGVK RIVANINAKK DTAGTITNGP TLTGNTVYLR AKADLISNKA
TFFYSTDNVN FTQLGGTLSM PFDLGFFQGD KFGIFNYTTA SSGGYADINW FRYYTSAGPN
TPIPDITLSA FEKIEAESYN SQSGIQNVTC DEGTEAVGYT ENGDYVVYKN VDFGSGANGF
NARVSSATVG GTIEIRLDSA NGTLIGSCPV AGTGGWQAFT DANCTVSGVS GKHDLYLKFT
GESGYLFNIN WFKFSNTSVI SDKLGDVNSD GQIDAIDLQL IKKYILGLGE IENIKTADLD
GNGDVNAIDF SLMKQYLLGL ITEFPG