Gene Ccel_3238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3238 
Symbol 
ID7311817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3777685 
End bp3778878 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content39% 
IMG OID643610140 
Productgalactokinase 
Protein accessionYP_002507508 
Protein GI220930599 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.217049 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGAAAA ATTATGATGA GCTAAAGAAA AAGTTTTGCC GTATATACGG TGGTAGTGAA 
GAGGATTTAA GGATATTTTC AGGGCCCGGA AGAGTAAATC TCATCGGAGA ACACATTGAC
TATTGCGGCG GGTTTGTTTT TCCGGCTGCT TTAAGCCTTG ATTCCACTGT GATTGCAAGA
ATAAATAATG ATAATACCCT AAGAGTTGCA GCAACAGATC TGCCTGATAG GGTAGAGGTG
GAACTGGACA AACTGGAAAG TGCAAAGAGT CTGAAATGGG GAAACTATCA GGCAGGAGTA
GCATTTATGC TCCAAGATGC AGGCTATAGG TTGGTGGGGG TAGACATGCT TTTTCACGAC
ACTGTTCCAC TGGGATCAGG ACTTTCATCT TCTGCGGCAA TAGAGTTGGC AACGGCAGTT
ACATTAGTTA CTCTGTCTAA TGAGGTATAT GGAATAACAA AACCAATAGA TATGGTAGAA
ATGGCTGTAC TGGGACAAAG AACCGAAAAT GAATTCTGCG GAGTAAGCTG TGGAATAATG
GATCAGTTCG CATCTGCAAT GGGTAAAAAG GACCATGCTA TTTTGTTAGA TTGCGGAACT
TTGGAATATA AATATTTACC ATTAAAGCTT GATGGTTATA AAATAGTACT TGGAAATACA
AAGAAAAAAC GTGCACTTGG CGAATCAAAA TATAATGAGA GAGTCAGAGA ATGTGCAGAA
GGCTTAAAAA TACTGCAAAA ATATTTGCCG AACAAAAGGA ATTTATGTGA TATAACTGTT
TCCGAATTTG AGCAATACAA GTCAATGATT GAGGATGAAG TAATCAAAAA AAGAGTTACT
CATGTTATCA GCGAAAACGA CAGAGTACTT AGAGCCGCAG AGGCACTAAA GAGAAATGAC
TTAGAAGAGC TGGGAAGGCT TTTGGTAGAG GCAAATGATT CAATCAGGGA TTTATATGAA
GTTACCGGAA AGGAACTTGA CACAATGACT GCCGAAGCTA TGAAGGTTGA GGGAGTTTTA
GGTGCAAGAA TGACTGGTGC CGGATTTGGA GGATGTACAG TAAACATAGT TCCGGAGGAT
AAGGTTGATT TGTTTATTCA GCAAGTTGGC GAGAATTACA AAGAACAAAC TGGTATAACT
CCAGAGTTTT ATGTCAGTGA AATAAGTGAC GGAGCAAGAG AAATCAAGAT TTAA
 
Protein sequence
MQKNYDELKK KFCRIYGGSE EDLRIFSGPG RVNLIGEHID YCGGFVFPAA LSLDSTVIAR 
INNDNTLRVA ATDLPDRVEV ELDKLESAKS LKWGNYQAGV AFMLQDAGYR LVGVDMLFHD
TVPLGSGLSS SAAIELATAV TLVTLSNEVY GITKPIDMVE MAVLGQRTEN EFCGVSCGIM
DQFASAMGKK DHAILLDCGT LEYKYLPLKL DGYKIVLGNT KKKRALGESK YNERVRECAE
GLKILQKYLP NKRNLCDITV SEFEQYKSMI EDEVIKKRVT HVISENDRVL RAAEALKRND
LEELGRLLVE ANDSIRDLYE VTGKELDTMT AEAMKVEGVL GARMTGAGFG GCTVNIVPED
KVDLFIQQVG ENYKEQTGIT PEFYVSEISD GAREIKI