Gene Ccel_2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2004 
Symbol 
ID7310714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2366423 
End bp2367685 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content42% 
IMG OID643608938 
Product3,4-dihydroxy-2-butanone 4-phosphate synthase 
Protein accessionYP_002506331 
Protein GI220929422 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase
[COG0807] GTP cyclohydrolase II 
TIGRFAM ID[TIGR00505] GTP cyclohydrolase II
[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.012993 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTTA GTTCAATAGA AGAAGCAATA GAGGATATCC GCCAAGGTAA AATTATTATA 
GTAGTAGATG ATGAGGACAG GGAGAACGAG GGAGACCTTC TTATGGCTGC CGAAAAGGCT
ACTCCCGAAA GCATAAATTT TATGGCTACC TATGGTAAGG GCATGATATG CGTTCCTCTG
ACATCAGCTA GAGCGGGAGA GTTGGAACTT TTCCCCATGG TAAGTCACAA TGAGGACCGT
CATGGTACGG CGTTTACGGT AACTGTGGAT CACAGAGATT CTACAACAGG TATTTCAGCT
TTCGAAAGAG CACATACAAT AGTTGAGCTT ACTAATAAAA AAGCACATCC GGGCGATTTT
AAAAGGCCGG GGCATGTATT TCCTCTTACT GCAAGGGACG GAGGAGTTCT TAAGCGTACA
GGACACACTG AAGCCGCAGT TGATCTGGCC CGTATGGCTG GTCTGTATCC CGCTGGTGTA
ATATGTGAAA TAATGAATGA TGACGGAAGG ATGGCAAGGG TTCCACAATT AATGGAGTTT
TCCCAAAAGC ATGGCTTAAA GATAATAACG GTAGCAGGTC TTATTGAATA TCGCAGAAAA
AATGAAAAGT TGATTAAAAG AGCTGCGGAA GCAAAAATGC CCACTGCTTA TGGAGAATTT
AAAATAATTG GTTATGAGAA TACTACCAAT GGGGAGCACC ATGTAGCACT TGTCAAGGGA
GATGTAGCAG GCTCAACAGA CCCTGTTCTG GTCAGAGTAC ATTCCGAATG TCTCACAGGT
GATGCTTTTC ATTCACAAAG GTGTGACTGC GGAGAACAGC TTGAAGCCGC ATTGAGCAGA
ATCAACAATG AAGGAAAAGG GGTTTTGCTT TATATGCGTC AGGAGGGAAG GGGCATCGGT
CTGATAAATA AAATACGTGC ATATGAGCTT CAAGACCAAG GTATGGATAC TGTCGAAGCA
AATATAAAGC TGGGCTTTCC GGCAGATTTG AGAGAATACG GCATAGGTGC TCAAATCTTG
TACGATTTAG GAATAAAGAA AATAAAGCTG CTGACTAACA ACCCCAAAAA ACTGGTTGGG
CTAAATGGGT ATGGTCTGGA GGTAGTCGGA CGAGAATCTA TTCAAATAAA AGAAAATGAA
AAAAATGAAT TTTATCTGAG AACAAAAAAG GAAAAAATGG GCCACTTGTT TGATGGTCTG
AACAATAAAA CAAGCGAGAA AAATACAACA GCACATCAGG AGGAAAATAA AAATGTCGAT
TAA
 
Protein sequence
MNFSSIEEAI EDIRQGKIII VVDDEDRENE GDLLMAAEKA TPESINFMAT YGKGMICVPL 
TSARAGELEL FPMVSHNEDR HGTAFTVTVD HRDSTTGISA FERAHTIVEL TNKKAHPGDF
KRPGHVFPLT ARDGGVLKRT GHTEAAVDLA RMAGLYPAGV ICEIMNDDGR MARVPQLMEF
SQKHGLKIIT VAGLIEYRRK NEKLIKRAAE AKMPTAYGEF KIIGYENTTN GEHHVALVKG
DVAGSTDPVL VRVHSECLTG DAFHSQRCDC GEQLEAALSR INNEGKGVLL YMRQEGRGIG
LINKIRAYEL QDQGMDTVEA NIKLGFPADL REYGIGAQIL YDLGIKKIKL LTNNPKKLVG
LNGYGLEVVG RESIQIKENE KNEFYLRTKK EKMGHLFDGL NNKTSEKNTT AHQEENKNVD