Gene Ccel_1235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1235 
Symbol 
ID7310032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1516222 
End bp1517751 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content40% 
IMG OID643608156 
ProductCarbohydrate binding family 6 
Protein accessionYP_002505571 
Protein GI220928662 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00440741 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG TTTGCAAGAT ATTAGTACTT TTACTTATTT TTACTACGCT TTTTTCATTC 
GTATGTTATG CGGATAACCC GATTGTGCAG ACGTATTACA CTGCTGATCC GGCATCGATG
ATTTATAACG ACACTTGTTA CCTGTATGTC ACACATGACG AGGATACGAC TGTAAATAAC
TTTTATACCA TGAACGAATG GAGATGCTAT TCAACAACAG ATATGGTAAA TTGGAAAGAT
TGCGGTTCAG TACTGTCTTA CAAAACATTC AGCTGGGCAA AAGGAGATGC ATGGGCAGCA
CAGTGTATAC CTAGAAATGG AAAGTTCTAT TTATATGTTA CACTTACTAA CCAATATGGT
GCGAGAACAG TAGGTGTTGC AGTGTCAGAC AGCCCTACAG GACCATTTAA AGACGCTCTT
GGGAAGCCTT TAATTGCAAA TAACGGGGCA CAGGATATTG ATCCTACCGT ATTTATCGAT
AATGACGGGC AAGCCTACTT GTATTGGGGA AATGGCAATG CATACTATGT GAAATTAAAT
CAGGATATGA TTTCCTACTC AGGAAGCATA GTACAAGTAA ACCCAAAGCC GTCAGGGTAT
ATAGAGGGTC CATGGCTTTA TAAACGAAAC AATGTATACT ATTTGGTATA TGCAGGTATG
GGTTCAAATG GCGAAAATAT ACAATATGCT ACAAGCAACA GTCCTACAGG CCCATGGACT
TCAAAGGGTG CTATTATGAA CTCAAAAAAC AGCTTCACAA TTCATCCTGC CATAACCGAC
TTTAAAGGAA AATCATACTT TTTCTATCAT ACAGGTGATT TGCCGGGAGG AGGCAGCTAC
AAACGTTCAG TTTGTGTAGA AGAGTTTAAA TACAATTCTG ACGGTACGAT ACCAACTATC
CCTATAACTA AAACCGGCCC TGCTCAGGTT CAATACCTTG ATCCATTTGT TAAAAACGAG
GCTGAAACAA TCTGCTGGGA GTCCGGAGTT GAAACAGAAA TATGCAGTGA AGGCGGAATG
AATGTAGGCT TTATCGAAAA CGGGGATTAT ATAAAGGTAA AAGGTGTAGA TTTTGGCTCG
GGAGCATCAT CATTTGTTGC CAGGGTTGCT TCAGAAACAA GCGGCGGGAA TATAGAACTA
CGACTTGACA GTCCTACAGG CAAACTGGTT GGAACTTGTT CAGTAAGCGA AACGGGCGGA
TGGCAGACCT GGAGTGATAA GTCCTGTACA GTAAGCGGTG CTGAGGGAGT ACATGACTTA
TATCTGAAAT TCACCGGAGG AAGCGGTTAC CTGTTTAATT TCAATTGGTG GAAGTTCGAA
AAAAGCGGAA CACCTACAAT TGTTGGAGAC CTCAATGGGG ATGACTGTGT AGATGCTGCA
GATTATGCAT TGATGAAGAA ATATATTCTG GGATTAATTA ATGATTTTCC TGTAGATAAC
GACATTGAAG CAGGAGATTT AAATAAGGAC GGAACTATTG ATGCACTAGA TTGTGCCGTT
TTCAAGAAAC TTCTGTTAGG TATTATTTAA
 
Protein sequence
MKKVCKILVL LLIFTTLFSF VCYADNPIVQ TYYTADPASM IYNDTCYLYV THDEDTTVNN 
FYTMNEWRCY STTDMVNWKD CGSVLSYKTF SWAKGDAWAA QCIPRNGKFY LYVTLTNQYG
ARTVGVAVSD SPTGPFKDAL GKPLIANNGA QDIDPTVFID NDGQAYLYWG NGNAYYVKLN
QDMISYSGSI VQVNPKPSGY IEGPWLYKRN NVYYLVYAGM GSNGENIQYA TSNSPTGPWT
SKGAIMNSKN SFTIHPAITD FKGKSYFFYH TGDLPGGGSY KRSVCVEEFK YNSDGTIPTI
PITKTGPAQV QYLDPFVKNE AETICWESGV ETEICSEGGM NVGFIENGDY IKVKGVDFGS
GASSFVARVA SETSGGNIEL RLDSPTGKLV GTCSVSETGG WQTWSDKSCT VSGAEGVHDL
YLKFTGGSGY LFNFNWWKFE KSGTPTIVGD LNGDDCVDAA DYALMKKYIL GLINDFPVDN
DIEAGDLNKD GTIDALDCAV FKKLLLGII