Gene Ccel_0695 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0695 
Symbol 
ID7309554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp798781 
End bp799971 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content40% 
IMG OID643607634 
Productaldo/keto reductase 
Protein accessionYP_002505054 
Protein GI220928145 
COG category[R] General function prediction only 
COG ID[COG1453] Predicted oxidoreductases of the aldo/keto reductase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.994792 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAACAA GGATAAATCA AAAGAACGGA GAAGAACTTT CTATTCTAGG TTTTGGGTGT 
ATGAGGTTTC CCACCAGGGC AGGCGGGATA GACGAACCGA GAGCTATTAG GATGATACGT
TATGCTATAG AAAAGGGAAT TAATTATTTT GATACGGCTT ATATTTACCA TGGAGGGAAA
AGTGAAAGCC TTCTGGGAAA AGCTTTAGCC GGAGGTTTTC GTGAAAAAGT CAAAATAGCT
ACAAAACTGC CTTCTTTTAT GGTAAAGAAT CTTGATAATG CAAAAAAAAT ATTCAATACA
CAATTGGAAC GGCTTCAAAC GGACTATATT GACTATTATC TGCTACATAT GCTTACAGAC
AAAGCAGGCT TCGACAGACT TGCAGATATG GGAGTATTGA CGTGGATGGA AGAGCTTAAA
GAAAAAGGTA CCATAAAAAA TATCGGGTTT TCCTTTCACG GAGCTAAAAT TGAATTCGAA
CAGATTCTCA AGGCGTACCC TTGGGAGTTC TGCCAGATAC AATATAATTA TATGGACGAA
AATAATCAGG CCACAAAGGA CGGATTATTA CTGGCTAATG ATATGGGAAT ACCTGTTATT
GTCATGGAAC CTTTAAGAGG AGGAAAGCTG GTTACAAACC TGCCGGAAGA CGTAATAAAG
GCATTCGCAG AATGCGACCG TGACAGGTCT CCGGCAGAAT GGGCCTTGAG GTGGATTTGG
AATCATCCTC AGGTAAACGT GGTTCTGTCT GGGATGAGCG ATGAGGCACA GGTTGAAGAT
AATATAAGGA TAGCCTCAGA TTCGCATGCC AATTCCCTTA CGGATGAAGA ACTTGGTGTC
TTTGATAATG TCAAAAGGAT ATTACATGAA AGGACAAAAA TACCATGTAC GGCTTGCGGC
TACTGTATGC CGTGTCCTGC AGGAGTTGAC ATACCGGGCT GCTTTTCACA TTATAACGAT
AAGTACCTTA TTAAAGATAA AGGAACAAGA TTTCGGTATT ATCGGAACTT AGGAGCAGTA
GCAGCACAGC CTTCCTATGC TTCACAGTGT AAAGACTGCG GGAAATGTGA AAGTCATTGT
CCTCAAAAAA TAAGTATACG TTCCGAGCTT AAAACTGTCA GTAAGGAAAT GGAAAGTGTA
TTTTATAAAG CAGGAATAGC AATTGCAAGG AAATTTATGA AAATCAAATA G
 
Protein sequence
MLTRINQKNG EELSILGFGC MRFPTRAGGI DEPRAIRMIR YAIEKGINYF DTAYIYHGGK 
SESLLGKALA GGFREKVKIA TKLPSFMVKN LDNAKKIFNT QLERLQTDYI DYYLLHMLTD
KAGFDRLADM GVLTWMEELK EKGTIKNIGF SFHGAKIEFE QILKAYPWEF CQIQYNYMDE
NNQATKDGLL LANDMGIPVI VMEPLRGGKL VTNLPEDVIK AFAECDRDRS PAEWALRWIW
NHPQVNVVLS GMSDEAQVED NIRIASDSHA NSLTDEELGV FDNVKRILHE RTKIPCTACG
YCMPCPAGVD IPGCFSHYND KYLIKDKGTR FRYYRNLGAV AAQPSYASQC KDCGKCESHC
PQKISIRSEL KTVSKEMESV FYKAGIAIAR KFMKIK