Gene Ccel_2304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2304 
Symbol 
ID7310981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2687178 
End bp2689049 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content42% 
IMG OID643609233 
ProductRespiratory-chain NADH dehydrogenase domain 51 kDa subunit 
Protein accessionYP_002506621 
Protein GI220929712 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit
[COG3411] Ferredoxin
[COG4231] Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTA ATAATTTGCA AGATTTAAAA GCCTTTTCTG ACAGGTCGGT ACAGGCAATG 
GACAAACAAA AGAAGAAGGT TCTGGTGTGT GCGGGAACAG GTTGTGTAGC CGGCGGTTCA
TTGGAAATAT ACAACAGAAT CAAAGAGCTT ATAAATGAAA AAGGCCTTCT GGTTGATCTT
GAACTGGATT ATGAAAAGGA AGGCATAGGA GTAAAAAAAA GCGGTTGCCA CGGCTTCTGC
GAAATGGGGC CATTGGTAAG AATAGAACCG GAAAACTACT TGTATTTGAG AGTTCAGATA
GAAGATTGTG AAGAAATAGT TAATAAAACC CTTATAAATA ATGAAGCTGT CGAAAGACTT
ATGTATACAG ATGACGACAA GGTATACAAG GCACAGGAGG ACATTCCCTT CTACCGCAAG
CAGACAAGGA TGGTTCTGAA AAACTGCGGA AGCATCAATG CCGAATCCTT TTCAGAATAT
GTAGCAAAGG GTGGTTATAA GGCATTGGGC AAGGTATTGT TTGAAATGCA GCCTCAGGAG
GTCTGTCAGT CAATACTGGA TTCCAACCTG AGAGGAAGAG GGGGCGGAGG ATATCCGTCA
GGTGCAAAGT GGAAACAGGT ATTGAAACAG GACAGTGAAG TAAAATATGT TGTCTGTAAT
GGTGACGAAG GTGACCCGGG AGCTTTTATG GACAGAAGTA TCATGGAGGG TGACCCTCAC
GGTGTAATAG AAGGCATGAC TATTGCAGGA TATGCAACCA AAAGTACTAT CGGATATATA
TACGTTAGGG CTGAATACCC ACTTGCTGTA GAAAGATTGA AGATAGCTAT TGAAGATGCC
AGAAAACACG GGATGTTGGG CGAAAACATA CTTAGTTCAG GATTTTCATT TGACATAAAC
ATCAACATGG GTGCAGGAGC CTTCGTCTGC GGTGAGGGGA GTGCTTTAAC TGCTTCAATA
GAAGGCGAAA GAGGAATGCC CAGAGTAAAG CCTCCAAGAA CCGTTGAAAA AGGACTTTGG
GAAAAGCCGA CTGTACTTAA CAATGTAGAA ACATATGCAA ACGTTCCGTT GATTATAAAT
AACGGCAGTG AATGGTACAA AAGTATCGGA CCCGAGAATA GTCCGGGAAC AAAGGCATTT
GCCATAACAG GAAACGTTAA CCATACCGGA TTGATAGAAG TCCCGATGGG AACTACCTTG
AGAGAAGTAA TATTTGATAT CGGTGGCGGA ATAAAGAACG GCAAAAAATT CAAAGCAGTT
CAGATAGGGG GCCCTTCAGG AGGGTGTCTC ACAGAAGAGC ATTTGGATTT GCCACTGGAC
TTTGATTCTC TTAAAAGAGT TGGAGCAATG ATTGGGTCAG GTGGACTCGT TGTAATGGAT
GAAGACACCT GTATGGTTGA AGTTGCAAGG TTCTTTATGA ACTTTACACA GAACGAATCC
TGCGGAAAAT GCGTGCCATG CCGTGAGGGA ACCAAGAGAA TGCTTGAAAT CCTTGAAAAA
ATAGTAAACG GAAAAGGTAG TAAAGAAGAC TTAGATTTGC TTGAGGAACT GGCTGATACC
ATAAGCAGTA CGGCGTTATG CGGACTGGGC AAGTCGGCAG CAAGTCCTGT AGTTAGTACA
TTAAAATATT TCAGGGACGA ATACTTGGAA CACGTTGTAG ATAAGAAGTG CAAAACACAT
ACATGTAAGT CTTTAGCTTC TATAGTAATA GAAAAGGAAA AATGCAAGGG CTGTAGCAAA
TGTGCCAGAA TATGTCCAGT TCAGGCAATC GAGGGAAAGA TTAAGGAGCC GTATACAGTT
AATCAGTCAA AATGTATCAA GTGCGGCGCT TGTCTGGAAG TATGTCCATT TGCTGCAATA
AAGGAGGCGT AA
 
Protein sequence
MKINNLQDLK AFSDRSVQAM DKQKKKVLVC AGTGCVAGGS LEIYNRIKEL INEKGLLVDL 
ELDYEKEGIG VKKSGCHGFC EMGPLVRIEP ENYLYLRVQI EDCEEIVNKT LINNEAVERL
MYTDDDKVYK AQEDIPFYRK QTRMVLKNCG SINAESFSEY VAKGGYKALG KVLFEMQPQE
VCQSILDSNL RGRGGGGYPS GAKWKQVLKQ DSEVKYVVCN GDEGDPGAFM DRSIMEGDPH
GVIEGMTIAG YATKSTIGYI YVRAEYPLAV ERLKIAIEDA RKHGMLGENI LSSGFSFDIN
INMGAGAFVC GEGSALTASI EGERGMPRVK PPRTVEKGLW EKPTVLNNVE TYANVPLIIN
NGSEWYKSIG PENSPGTKAF AITGNVNHTG LIEVPMGTTL REVIFDIGGG IKNGKKFKAV
QIGGPSGGCL TEEHLDLPLD FDSLKRVGAM IGSGGLVVMD EDTCMVEVAR FFMNFTQNES
CGKCVPCREG TKRMLEILEK IVNGKGSKED LDLLEELADT ISSTALCGLG KSAASPVVST
LKYFRDEYLE HVVDKKCKTH TCKSLASIVI EKEKCKGCSK CARICPVQAI EGKIKEPYTV
NQSKCIKCGA CLEVCPFAAI KEA