Gene Ccel_0302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0302 
Symbol 
ID7309194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp348359 
End bp350017 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content43% 
IMG OID643607232 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002504669 
Protein GI220927760 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAGTG ATATAGTAAA AAAAGGTATA GAAAAAGCAC CCCATAGATC ATTATTTAAG 
GCTATGGGTT ATACCGATGA AGAATTAGAA AGGCCTTTAA TCGGAGTGGC AAACTCAAAA
AGTGAAATAA TACCCGGACA CATACATCTT GATAAATTAA CAGAAGCTGT AAAAGCCGGA
ATCAGGATGG CAGGCGGTAC ACCGATAGAA TTCGGTGCAA TAGGTGTATG CGATGGGATA
GCAATGGGGC ATACAGGGAT GAAATACTCT CTGGCAACTA GAGAACTAAT CGCCGATTCA
TGTGAAGCAA TGAGCAAGGC CCACAGCTTT GATGGAATGG TTTTCATTCC CAACTGTGAC
AAGATTGTAC CAGGCATGCT GATGGCTGCA GCAAGAATAA ATATTCCATC CATAGTTATC
AGTGGCGGGC CAATGCTTTC CCTTAACAGG GATGGTAAAC AGCTTGATCT CAACAGTCTG
TTTGAAGCGG TTGGTTCATA TAAAGCAGGA ACGATGACAA AGGAAGAAGT GGATGATATT
GAAGACCACG CATGTCCTGG CTGCGGTTCA TGCTCGGGGA TGTTTACGGC AAATTCCATG
AACTGCCTTA CAGAAGTCCT CGGTATGGGG CTTACAGGAA ACGGAACAAT ACCTGCCGTG
TACGCAGAGC GTATAAGACT GGCAAAGTAT GCAGGAATGA AAATAATGGA GCTGGTTGAA
AAAGACATTA AACCTTCAGA CATACTCACA AATGAAGCTT TTGAAAATGC ATTAACTGTG
GATATGGCAC TTGGTTGTTC AACAAACTCA GTACTTCATC TTCCTGCTAT TGCAAATGAA
TTAGGAATAG AGATAAACCT AGATATTATT AATGAAATCA GCTCAAGGAC TCCGAATCTG
TGTAAGTTGG CTCCGGCCGG AAAATATCAT ATACAGGATT TATACAGTGC AGGCGGGGTT
CAGGCCGTTA TGAGTGAGCT GGCAAAAAAA GATCTGCTTC ACCTTGATTT AGTTACGGCA
ACAGGTAAAA CTATAAGAGA AAATATTCAG AATGCAAAAG TAAAGGACTA TGAAATAGTT
AAAAGCATAG ATACACCATA CAGTGCTACC GGAGGGATAG CTGTATTAAG GGGTAATATT
GCACCTGATG GAGCAGTAGT CAAAAAGTCG GCTGTAGCTG AAAAGATGCT GATTCACACG
GGGCCTGCAA GAGTATTTGA CAGTGAGGAT GAAGCAATTA CGGCTATCTA TAGCGGGCAG
ATAAATAAAG GTGATGTAGT AATTATACGT TACGAAGGCC CCAAGGGGGG GCCGGGTATG
AGAGAGATGC TTAGCCCTAC ATCCGCTATT GCGGGTATGG GACTGGACAG CGATGTTGCA
CTAATCACAG ACGGTAGGTT TTCAGGTGCA TCCAGAGGTG CATCAATTGG TCATGTATCA
CCTGAGGCAA TGGAGGGCGG CCCAATAGCA CTGGTTCAGG AAGGTGATAT TGTAGATATC
GACATACCTG CAGGACGCAT AAATATTCAG GTAACCAATG AAGAAATGGT AAAGCGTAAA
GAGTCATGGA AAGCTCCAAA GCCCAAGATA ACCACAGGAT ATCTTGGCAG ATATGCCAGA
CTGGTTACCT CTGCAAGTAC AGGAGCAGTC CTAAAGTAA
 
Protein sequence
MRSDIVKKGI EKAPHRSLFK AMGYTDEELE RPLIGVANSK SEIIPGHIHL DKLTEAVKAG 
IRMAGGTPIE FGAIGVCDGI AMGHTGMKYS LATRELIADS CEAMSKAHSF DGMVFIPNCD
KIVPGMLMAA ARINIPSIVI SGGPMLSLNR DGKQLDLNSL FEAVGSYKAG TMTKEEVDDI
EDHACPGCGS CSGMFTANSM NCLTEVLGMG LTGNGTIPAV YAERIRLAKY AGMKIMELVE
KDIKPSDILT NEAFENALTV DMALGCSTNS VLHLPAIANE LGIEINLDII NEISSRTPNL
CKLAPAGKYH IQDLYSAGGV QAVMSELAKK DLLHLDLVTA TGKTIRENIQ NAKVKDYEIV
KSIDTPYSAT GGIAVLRGNI APDGAVVKKS AVAEKMLIHT GPARVFDSED EAITAIYSGQ
INKGDVVIIR YEGPKGGPGM REMLSPTSAI AGMGLDSDVA LITDGRFSGA SRGASIGHVS
PEAMEGGPIA LVQEGDIVDI DIPAGRINIQ VTNEEMVKRK ESWKAPKPKI TTGYLGRYAR
LVTSASTGAV LK