Gene Ccel_1972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1972 
Symbol 
ID7310686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2333201 
End bp2334706 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content39% 
IMG OID643608906 
Productglycoside hydrolase family 43 
Protein accessionYP_002506300 
Protein GI220929391 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAAA AATTAACCGC TTTATTTATG GTTATCTTTA CCGTTCTTGT ATTAGGCACT 
TCTGTAGCCT TTGCCGAAAT TGGCATAATA AATAACGGCA CAAATTTTGG AGGCGTGCAG
GCACACGGGG GCAGTATCAT CAAGGTAGGC GAAACATATT ATTGGTATGG AGAAAACCGT
GATACAAATA ACCTGTTTGT ATCTGTGAAG GTTTACTCTT CAACTGATCT TGTGAATTGG
ACAGATATGG GTAATGCCTT ATCCAGAACA TCTGCGGCAG AGCTTAACTC CTGCAATATA
GAAAGACCGA AAGTTATGTA TAATGCATCA ACAAACCAAT ATGTAATGTG GATGCATTAC
GAAAACGGCA GGGACTATAC ATTGGCAAGG GCGGCTGTAG CATACAGTAC AAGCCCGACT
GGTCCGTTTA CTTATATTGG AAGTTTTAGA CCCAATAATA ACATGTCAAG AGATTGCACC
ACCTTTGTGG ATACTGATGG TACAGGATAT TTTATCTCTG CTGCAAACGA AAATGCCGAT
TTGGTAGTTT ACAGGCTTAC ATCGGATTAT AAGTCAGTAG CCTCACAAGT GGTAACTCTT
TGGCCGGGAC AATATCGAGA AGCTCCATGT ATGTTTAAAC GTGATAATAT CTATTACCTG
ATAACGTCAG GATGTACCGG GTGGGGTCCA AATCAGCAGA AATATGCTAC TTCCACATCA
ATATCTTCAG GCTGGACAGG TTTAACCAAT CTAGGTAATT CAACATGCTA TGACTCTCAA
GGTGCATGCG TTTTTCCGGT TGGTCGAAAC TTTGTATTAC TTACCGATAG ATGGGCAGGT
GCGTGGGGAG GAAAAGTAAA TGATTCGTCA TACCTGATGA TGCCTTTACT TATTAACGGA
ACCTCAGTAT CTCTAAACTA CACAGATACG GTAAAGGTTG CAGCTGAATT TGGTGCGATT
GCAAACGACT ATCACAGGTA CACCTTCGAA GGAGTAAAGA CAAACGCTGA CCCGTACCAA
TGGAACATAA GTCAACCTGC ATCAACAGAT ATACAAGTAC TTTCTGTAGA TGGAGACAAG
GCTCTTAAAA TGTCAGACAG CACAACATCC GGCTATTGCT ATGCATACAA GGATTTCCCT
GCTAAAAAAG GTATTGTTAC ATTTAATGTA GGTTTTAAAT TTGAGAATAC CGGAAGATGG
GATAGAATCA GACTTTACAA TGGAAATAAC ATAGGAATAG ACATTATAAA TTTTGAAGAA
GGGCTTGGTT GGATTACAGG AACTTCAACA AGAAATGTAT TACAAAAAGT TTCTTCAAAC
ACATGGTATG ATTTAAAAAT TGTTGCCGAT ACTTCTAATC AGAAATTTGA TGTATACATT
AATAATATGC TGATTAAAAC AGGTTGTGCA TTTACAGGTT CGATTACTGA GTTTGACAGA
ATTGCACTGG ATACAGGAAA TAGTTTTACA AATACGGCTG TATATGACGA TATAATAATT
AAGTAA
 
Protein sequence
MIKKLTALFM VIFTVLVLGT SVAFAEIGII NNGTNFGGVQ AHGGSIIKVG ETYYWYGENR 
DTNNLFVSVK VYSSTDLVNW TDMGNALSRT SAAELNSCNI ERPKVMYNAS TNQYVMWMHY
ENGRDYTLAR AAVAYSTSPT GPFTYIGSFR PNNNMSRDCT TFVDTDGTGY FISAANENAD
LVVYRLTSDY KSVASQVVTL WPGQYREAPC MFKRDNIYYL ITSGCTGWGP NQQKYATSTS
ISSGWTGLTN LGNSTCYDSQ GACVFPVGRN FVLLTDRWAG AWGGKVNDSS YLMMPLLING
TSVSLNYTDT VKVAAEFGAI ANDYHRYTFE GVKTNADPYQ WNISQPASTD IQVLSVDGDK
ALKMSDSTTS GYCYAYKDFP AKKGIVTFNV GFKFENTGRW DRIRLYNGNN IGIDIINFEE
GLGWITGTST RNVLQKVSSN TWYDLKIVAD TSNQKFDVYI NNMLIKTGCA FTGSITEFDR
IALDTGNSFT NTAVYDDIII K