Gene Ccel_3221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3221 
Symbol 
ID7311802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3757000 
End bp3758295 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content38% 
IMG OID643610123 
Producthexokinase 
Protein accessionYP_002507491 
Protein GI220930582 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5026] Hexokinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGTCCA AATTAGAAAT AGTACAGGAT GTTATTAATG CTTTTGAGGT TAATAAAGAA 
AGTATGCTGC GTACTGCAAT GCTGTTTAAG GAAACTATGG AAAAAAGTCT GAATGGCGAA
AAAACCTGCC TTAAAATGCT TCCATCATAT ATTGGAAAGC CTACGGGAAA AGAACAGGGT
ACTTTTATGA CCATCGACAT GGGTGGCACT AATTTCAGAT GCACAAAGTA TAAAATTAAC
AATGGCAACT TTGAGAAAGT TGGTGAAATA AAACAGAAGC TAATTAATAA GGAAAAGAAT
TATGACCTTA CAAAGTCAGA TTCAGATGAA AAGCAGCTGT TTGGATTTAT GGCTGAATGC
ATAGGAGAGT TGCTAGAGCC GGAAGAATCT TTATATCTTG GAAACACGTT TTCATTCCCA
TGCAGACAGG AAGGAATCAA TGATGCTTAT CTTATTCAGT GGACCAAGGA AATAACAACT
TCAGGCGTTG TAGGCCAGAA TATTAATAAG CTTCTTGAAC AGTCATTAAA GGAGAAAAAT
ATAAATGTTA AGCCCGTCGC CATACTGAAT GATACCGTAG GTACTTTACT GGTAGCAATG
TACAGTTATC AGACGGCGGA TATAGGATCT ATAATGGGTA CAGGGCATAA CACATGTTAT
CTGGAGAACA ATCATCCTCT TAATGGTCAA AAGATGATTG TAAACATAGA ATCGGGCAAC
TATAATGTGG GACTTCCCGT AACCAAGTAC GATGAGATAA TAGATAAAAA CAGTCAAATA
CCGGGAGCAC AGCTCCTTGA AAAAATGGTT TCCGGTTACT ACATGGGGAG CCTTCTGAAA
GAGGTTTGTA AGGATCTCTA CAAAAATAAT GCATTGTTTA CAAATGAGGA TGTTGATATA
GATGCGTTTT TCAATCAGAA CTTCAACGCA TTGATGGTAG AGAATTTCAT TTTGTATCCT
TCCAATACAA AAGAGCAATA TAAATGTTCC ATTGAAGATG CGGAGATAGT AAAGAGGGTA
TCCGAAGCCA TATTGAAAAG AACAGTGAGA CTGGTAGCGG TATCACACAT GGGAATACTT
TTTCACCAGG AAAACAGCGG TACTTCTGTC AATAATGAGC ATGTAATTGC AATAGACGGA
ACAATATATG AAAAAATGCC CAATGCTCCC CAGCTTATGA AGGAGGCATT CAGGGAGGCA
CTTGGAGATG ACGCATCCAA TATTGAAATA AGACTTGTAA AGGATGGTTC AGGCCTTGGT
GCTGCAATAG CTGCTGCGTT TGCAGTAACA CAATAG
 
Protein sequence
MGSKLEIVQD VINAFEVNKE SMLRTAMLFK ETMEKSLNGE KTCLKMLPSY IGKPTGKEQG 
TFMTIDMGGT NFRCTKYKIN NGNFEKVGEI KQKLINKEKN YDLTKSDSDE KQLFGFMAEC
IGELLEPEES LYLGNTFSFP CRQEGINDAY LIQWTKEITT SGVVGQNINK LLEQSLKEKN
INVKPVAILN DTVGTLLVAM YSYQTADIGS IMGTGHNTCY LENNHPLNGQ KMIVNIESGN
YNVGLPVTKY DEIIDKNSQI PGAQLLEKMV SGYYMGSLLK EVCKDLYKNN ALFTNEDVDI
DAFFNQNFNA LMVENFILYP SNTKEQYKCS IEDAEIVKRV SEAILKRTVR LVAVSHMGIL
FHQENSGTSV NNEHVIAIDG TIYEKMPNAP QLMKEAFREA LGDDASNIEI RLVKDGSGLG
AAIAAAFAVT Q