Gene Ccel_1223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1223 
Symbol 
ID7310020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1497206 
End bp1498234 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content40% 
IMG OID643608144 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_002505559 
Protein GI220928650 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.999664 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA AGTTAATTGC ACTTCTGATG TGTTTTTCAT TGGTAATAGC GGCAGGATGT 
GGAGCTTCCG ATACCAATTC ATCAGATTCC GAATCGTCTG AGTCTGCCCA ATCCACATCA
GCTAATGATT CAGGCAGCAA CAAGTTGATT ACAATAGGCT TCTCACAGGT TGGTGCTGAA
AGTGACTGGC GGGTAGCTAA CACTGCATCA ATGAAATCCG CATTATCCGA AAAAAATGGC
TTCAAATTGA TTTTTGCTGA TGCTCAGCAG AAGCAGGAGA ACCAGATTAA AGCAGTAAGG
GATTTTATAT CACAGGATGT TGACGTTATA GCTATAGCAC CTGTTACAGA AACGGGCTGG
GAAACGGTAT TGGGTGAAGC AAAGGATGCA GATATACCGG TAATTATTGT TGATAGAATG
ATAAAGGTTT CGGATGATTC ACTCTTTAGC TGCTGGGTTG GTTCAGACTT CCAGAAGGAA
GGTGTTAACG CAGCTGAATG GTTAGTTAAC TATATGAAAG AGAAAGGCAA GACCGATAAA
CAAAATGTTG TAGTTCTTCA GGGAACAATA GGATCATCTG CTGAAATAGG CCGTACAAAG
GGTTTTGGTG ATACTATAAA GAAATATGAT AACTTTAATA TACTGGCACA ACAGACTGGA
GAGTTTACTC AGGCAAAAGG CCAGGAAGTA ATGGAATCCT TCTTAAAACA GTACAATGAC
ATCGATGTAG TTATAGCACA GAACGATAAT ATGGCCTTCG GAGCTATTGA TGCTTTAAAA
GCAGCAGGTA AGGCTCCTGG AAAAGATGTA ACAATTGTAT CCTTTGACGC AGTTAAGGCT
GCATTCAAAT CAATGATAGC AGGGGATATG AATGTATCGG TAGAATGTAA TCCTTTACAC
GGGCCTAGAG TAGCTGAACT GGCTAAAAAA CTCATGAACG ATGAAAAAGT TGAAAAGATA
CAGTATGTTG ATGAAAAAGT ATATCCGGCT GAAATAGCTG AAAAAGAACT CCCGAATCGC
CAATATTAA
 
Protein sequence
MKKKLIALLM CFSLVIAAGC GASDTNSSDS ESSESAQSTS ANDSGSNKLI TIGFSQVGAE 
SDWRVANTAS MKSALSEKNG FKLIFADAQQ KQENQIKAVR DFISQDVDVI AIAPVTETGW
ETVLGEAKDA DIPVIIVDRM IKVSDDSLFS CWVGSDFQKE GVNAAEWLVN YMKEKGKTDK
QNVVVLQGTI GSSAEIGRTK GFGDTIKKYD NFNILAQQTG EFTQAKGQEV MESFLKQYND
IDVVIAQNDN MAFGAIDALK AAGKAPGKDV TIVSFDAVKA AFKSMIAGDM NVSVECNPLH
GPRVAELAKK LMNDEKVEKI QYVDEKVYPA EIAEKELPNR QY