Gene Ccel_1231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1231 
Symbol 
ID7310028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1509068 
End bp1510642 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content43% 
IMG OID643608152 
ProductCarbohydrate binding family 6 
Protein accessionYP_002505567 
Protein GI220928658 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0177163 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG TAAGCAAAAT TGCAGCTCTC TTATTAGTAC TTACTTTGTT TATTCCAACA 
GTAGGCTATG CAGACAATCC AATAGTGCAG ACTTTATATA CTGCTGACCC TGCCCCTATG
ATATATAACG ATACATGCTA CGTGTACACG GGACATGACG AGGATACACT GGTAAATGAT
TTCTTTACTA TGAATGACTG GAGATGTTAC TCCTCAACTG ATATGGTGAA TTGGACAGAT
AACGGTTCGC CACTGTCATA CTCTTCGTTC AGTTGGGCAA AAGGGGATGC ATGGGCAGGC
CAATGTATTC AAAGAAATGG AAAGTTTTAT TATTATGTTC CTTTGACCCC GAAAACCGGA
GGGACGGCAA TCGGTGTTGC AGTATCAGAT AGCCCTACAG GCCCATTTAA AGATCCCCTT
GGAAAACCAT TAGTTAGTAC TGGCAGCGGC GACATTGACC CGACAGTATA TATCGATGAT
GACGGACAGG CTTATCTGTA CTGGGGCAAT CCAAACCTTT ACTATGTAAA GTTGAATCAG
GATATGGTTT CCTACTCAGG CAGTATTGTA AAGGTACCTT TGACAACTGC AAGTTTCGGA
ACAAGAAGTA AAACCGACAG ACCAACTACA TACGAGGAGG GACCATGGTT TTACAAGCGT
AACAGTTTGT ACTATATGGT ATTTGCAGGC GGCCCCATAT CAGAGCATAT AGGTTATTCC
ACCAGTACCG GGCCTACAGG GCCTTGGACT TATCGTGGTA AAATCATGCC TACCCAGGGC
AGCAGCTTTA CAAATCATCC GGGGGTAGCC GATTTTAAAG GTAATTCCTA TTTCTTCTAT
CACAATGGTG CTTTGCAGGG TGGAGGAGGG TTTCACCGTT CGGTGTGTGT AGAACAATTT
AAATATAATG CTGACGGTAC TTTTCCAACC ATAAATATGA CTACAACCGG CTCTACCCAA
ATCGGCAATC TTAATCCATT TATTAAAACT GAGGCGGAAA CAATCTGCTG GGAATCAGGT
ATTGAAACGG AAAAGTGCAG TGAAGGCGGA ATGAATGTAG CCTTTATAGA AAATGGGGAC
TATATAAAGG TAAAAGGTGT TGATTTTGGT ACGGGTGCAG CAGCCTTTAC TGCCAGAGTT
GCTTCTGCAA CCGACGGCGG GAATCTAGAA CTTCGGCTTG ACAGCCCAAC AGGTAAACTT
GTGGGGACTT GTGCAGTTAC AAGCACAGGA GGATGGCAGA CATGGGTCGA TAAGACCTGT
ACGGTAAGCG GTGCCGAGGG GATACATGAC TTGTACCTGA AATTTACAGG TGGAAGCGGT
TATCTGTTCA ATTTTAACTG GTGGAAGTTT ATCAAAGCTG GGAATACCTC TGTTATTGGA
GATCTCAATG GAGACAAAAG CGTGGATGCG GCAGATTATG CCATGATGAA GAAATATCTT
TTGGGATTAA TTGAAGATTT TCCGGCAGAA AACGATATTG AAGCCGGAGA CTTAAATAAA
GACAGCGTCA TTGACGCACT TGATTTTGCA GTTTTTAAAA AATATCTGCT TGGTACAATT
CCAAGTTTAC CATGA
 
Protein sequence
MKKVSKIAAL LLVLTLFIPT VGYADNPIVQ TLYTADPAPM IYNDTCYVYT GHDEDTLVND 
FFTMNDWRCY SSTDMVNWTD NGSPLSYSSF SWAKGDAWAG QCIQRNGKFY YYVPLTPKTG
GTAIGVAVSD SPTGPFKDPL GKPLVSTGSG DIDPTVYIDD DGQAYLYWGN PNLYYVKLNQ
DMVSYSGSIV KVPLTTASFG TRSKTDRPTT YEEGPWFYKR NSLYYMVFAG GPISEHIGYS
TSTGPTGPWT YRGKIMPTQG SSFTNHPGVA DFKGNSYFFY HNGALQGGGG FHRSVCVEQF
KYNADGTFPT INMTTTGSTQ IGNLNPFIKT EAETICWESG IETEKCSEGG MNVAFIENGD
YIKVKGVDFG TGAAAFTARV ASATDGGNLE LRLDSPTGKL VGTCAVTSTG GWQTWVDKTC
TVSGAEGIHD LYLKFTGGSG YLFNFNWWKF IKAGNTSVIG DLNGDKSVDA ADYAMMKKYL
LGLIEDFPAE NDIEAGDLNK DSVIDALDFA VFKKYLLGTI PSLP