Gene Ccel_2233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2233 
Symbol 
ID7310918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2612296 
End bp2614089 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content41% 
IMG OID643609165 
ProductRespiratory-chain NADH dehydrogenase domain 51 kDa subunit 
Protein accessionYP_002506555 
Protein GI220929646 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit
[COG3411] Ferredoxin
[COG4231] Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATTAT ATAGAGCACA TGTGCTGGTC TGTGCAGGTA CGGGTTGTAC ATCATCAAAT 
TCCTTAAAGA TAATAGATGA AATGGAATCA TTACTTGTAA GCAACAGGCT GGACAGTGAA
GTTCAAATAG TAAAAACCGG TTGTTTCGGT CTTTGTGCTG AGGGACCGAT AGTGGTTGTA
TATCCTGAAG GAGCAATGTA CACAAGAGTT GAGATTTCTG ATGTAAAAGA GATAGTCGAG
GAACATCTTC TGAAAGGACG TATAGTCAAG CGTTTATTAG CCGGCGAAAA GGAAGCAGAT
GACATTTCCA AGTCTCTTGA AGGTGTTGAT TTCTTTAATA GGCAGATGCG TATTGCATTA
AGAAACTGCG GACGAATAAA CCCTGAGGAT ATCAACGAGT ACATAGCTTT TGACGGATAC
AAGGCCCTTG AAAAAGTATT GACTGAAATG ACTCCAGAAG CTGTTATAGA TACAATGAAG
AAGTCAGGCC TCAGAGGAAG AGGGGGCGGA GGCTTCCCTA CAGGCGTGAA GTGGGAGTTT
GCAGCAAAAC AGACAGCAGA TCAGAAATAT GTTTGCTGTA ATGCAGACGA AGGTGATCCT
GGAGCATTTA TGGACAGAAG TGTTCTTGAA GGTGACCCTC ACTCAGTAAT AGAAGCAATG
TCAATAGCGG GTTTTGCTAT CGGAGCAACC CAAGGTTTTA TATATGTAAG AGCTGAGTAT
CCTATAGCAG TTAAAAGATT GCAGTTAGCA ATTGATGAGG CTAAAGAATA CGGAATTTTA
GGTGACAATG TACTGGGAAC AGGACATAAA TTCAATTTGG AAATAAGACT TGGTGCGGGA
GCATTTGTTT GTGGTGAAGA AACTGCACTT ATGACGTCAA TAGAAGGCCA CAGAGGCGAA
CCAAGACCAA GACCTCCATT CCCGGCAGTA AAGGGTTTAT GGGAAAAACC TACAATTCTT
AACAATGTAG AAACCTATGC AAATGTACCT GTTATTATTT TGAAGGGTGC AGAATGGTTT
TCAGGGATAG GAACTGAAAA GAGTAAGGGA ACAAAGGTAT TTGCTCTGGG AGGAAAGATA
AACAATACCG GATTGGTTGA GGTTCCTATG GGTACTACCC TGAGAGAAGT AGTTTATGAC
ATAGGCGGCG GAATACCTAA CGGCAAGAAG TTTAAGGCAG CACAAACAGG AGGGCCTTCC
GGCGGTTGTA TTCCTGCTTC TCATCTGGAT ACACCTATTG ACTATGATTC GCTTATTGCA
CTTGGATCTA TGATGGGTTC CGGCGGACTT ATAGTAATGG ACGAGGACAA CTGTATGGTT
GATATTGCCA AATTCTTCCT TGAATTTACA GTTGATGAAT CCTGCGGAAA ATGTCCTCCA
TGTCGTATTG GAACAAAACG TATGCTTGAA ATATTGGAAA GAATAACTGA AGGTAAGGGT
GAAGCCGGAG ATATTGAAAA GCTTGAATTG TTGGCTAAAA ATATTAAGGC ATCTGCTCTG
TGCGGACTTG GACAGACTGC TCCAAACCCG ATTCTGAGTA CGTTGAAATA TTTCAGAGAC
GAGTATGAGG CTCACGTATT TGACAAAAAA TGTCCTGCGG GTGTTTGTAA ATCAATGATG
AAATATACTG TTGATGCAAG CAAGTGTAAG AGCTGCGGAA TCTGTGCTAA AGCCTGTCCA
ATGGGATGTA TAAAGGGAGA AAAGAAGGTT CCTTACGTTA TTGATAATTC AAAATGTGCT
AAATGCGGTG TTTGTATAGA AAAATGTCCA TTCAAGGCAA TTTCAAAGGG ATAA
 
Protein sequence
MQLYRAHVLV CAGTGCTSSN SLKIIDEMES LLVSNRLDSE VQIVKTGCFG LCAEGPIVVV 
YPEGAMYTRV EISDVKEIVE EHLLKGRIVK RLLAGEKEAD DISKSLEGVD FFNRQMRIAL
RNCGRINPED INEYIAFDGY KALEKVLTEM TPEAVIDTMK KSGLRGRGGG GFPTGVKWEF
AAKQTADQKY VCCNADEGDP GAFMDRSVLE GDPHSVIEAM SIAGFAIGAT QGFIYVRAEY
PIAVKRLQLA IDEAKEYGIL GDNVLGTGHK FNLEIRLGAG AFVCGEETAL MTSIEGHRGE
PRPRPPFPAV KGLWEKPTIL NNVETYANVP VIILKGAEWF SGIGTEKSKG TKVFALGGKI
NNTGLVEVPM GTTLREVVYD IGGGIPNGKK FKAAQTGGPS GGCIPASHLD TPIDYDSLIA
LGSMMGSGGL IVMDEDNCMV DIAKFFLEFT VDESCGKCPP CRIGTKRMLE ILERITEGKG
EAGDIEKLEL LAKNIKASAL CGLGQTAPNP ILSTLKYFRD EYEAHVFDKK CPAGVCKSMM
KYTVDASKCK SCGICAKACP MGCIKGEKKV PYVIDNSKCA KCGVCIEKCP FKAISKG