Gene Ccel_0438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0438 
Symbol 
ID7309320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp501546 
End bp503537 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content42% 
IMG OID643607368 
Productcarbon-monoxide dehydrogenase, catalytic subunit 
Protein accessionYP_002504800 
Protein GI220927891 
COG category[C] Energy production and conversion 
COG ID[COG1151] 6Fe-6S prismane cluster-containing protein 
TIGRFAM ID[TIGR01702] carbon-monoxide dehydrogenase, catalytic subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value5.30778e-07 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTTA GATATCATCA CACAAAATTT GATCATTCAG AAAATCACCA TCATAACGAT 
TCAGGCTGCA ATGATTATGC AACTGCCGTA GCTGAATACA GAAAAAGTTT CGCTTCGAAG
AAGGAGGTTC TGGAACAGAC TCCGGACCCT GCCGTAAAAG CTATGCTTCT ATATATGGAG
GAGAAAGGAT GCGAGACTGT TTTCGACAGA TTTGACGCTC AGAAGCCACA GTGCGGCTTC
GGTCTTGCAG GAGTCTGCTG CAGAATCTGC AACATGGGGC CATGTAAAAT TACGAAAAAA
AGTCCTAAAG GTGTTTGTGG AGCTGATGCA GACGTTATTG TAGCACGAAA TATACTCAGG
AGTGTCGCTG CAGGTGCTGC AGAGCATGGT GCACGCGGAC GTGAAAGTAT GCTGGCATTG
AAGTATGCTT CTGAAGGCAG AATCAAATTT CCTATTGAGG GAGAAAACAA GGTGTTGGCC
ACAGCTAAAG CCTTTGGACT TGACACTGAG AATAAAAGCA TTAAGGAACT TGCAGGGTTA
ATTGCCGATA TTCTATTGGA GGACTTATCC AGAACTATTC CCGGGCCTCA CCATACACTT
AATGCATTTG CTTCAAAAGA ACGAATTAAT GTTTGGAGTA ATCTTGATAT TATTCCCATC
AGTCCTTACC ACGAAGTTTT CGAGAGTCTT AACAGAACAG GTACAGGAAA CGACAGTGAT
TGGGAAAATA TTATGAAACA GATGCTGCGT ACAGGTGTAG CCTTTGCATG GTCTTGTGTT
TTGGGTTCAT CAATCGCAAT GGACAGTTTG TTCGGACTAC CTGTAAGAAG TACTTCTAAG
GTTAATATCG GTGCTTTGGA AAAGGGTTAT GTTAACATAG GAATTCACGG CCACTCTCCG
GTTTTGGTAA GTGAAATTGT AAAGCTTGGT AATTCAGATG AATTTCAAGA ACTTGCAAGG
CAAAACGGTG CTTTGGGAAT TAAGTTTTAT GGTATTTGCT GTTCAGGACT TTCTGCTATG
TACAGATATG GCGGTGTTAT ACCCCTTTCT AATGCTATCG GAGCTGAGCT GGTTCTGGGA
ACTGGTGCTT TGGATTTATG GGTAGCTGAC GTTCAGGATG TTTTCCCTTC TATCATGGAG
GTTGCAAGCT GTTTTAAAAC AACTGTGGTT ACCACGAGTG ACTCGGCCAG ACTACCGGGA
GCCGAGCATT ACGGCTTTGA TCACCACCAC TCTAATATTG ACCAGACTAG CGAGCTTGCA
ACGAAGATAC TGAACAGGGC TATTGAGAGC TTTACCCAGA GAAGAGAGGT TCCAGTTTAT
ATTCCGCCAT ATGAAATTGA AGCGGAAGTC GGATTTTCCG TTGAATATAT TAACAGGCAT
TTTGGAAGTG TTAAGCCCAT TGCAGAAGCA TTAAAAAATG GTGAAATCCT TGGGATTGTT
AACCTGGTTG GGTGTAACAA CCCTCGTATT GTCTATGAAA AGGCCATAGT TGAGCTTACA
GACATCCTTT TAGAAAATAA TGTTCTGGTA TTAACCAACG GGTGTGCTTC CTTTCCTCTC
ATGAAACTTG GATATTGCTC TACTGCTGCT TTGGAGCGTA CAGGTGATAA TTTGAAGGGT
TTTTTAAAGG ATCTTCCTCC TGTATGGCAT ATGGGCGAAT GTTTGGATAA TGCAAGAGCA
TCCGCACTTT TTAAGGCTGT TGCTGAGACT TCTGATATTT ACATCAAGGA TATGCCTTAT
GCTTTTGCTA GTCCGGAATG GTCTAACGAA AAAGGTATAT GTGCAGCGTT GAGTTTCAGA
TTGTTGGGTA TTGATTCTTA TCACTGTGTA TATGCTCCAA CTCAAGGTTC AGATAATGTT
ACAGAATTTA TGAGTAATGG AACTAAAGAT ATTCTGGGTT CAAGGATGAT CGTCAATGTC
AATCACATCG AATTGGCAAA TGAGATTGTA AATGACCTTA AAGTACAGAG AAAAGCTTTA
GGCTGGGATT AG
 
Protein sequence
MNFRYHHTKF DHSENHHHND SGCNDYATAV AEYRKSFASK KEVLEQTPDP AVKAMLLYME 
EKGCETVFDR FDAQKPQCGF GLAGVCCRIC NMGPCKITKK SPKGVCGADA DVIVARNILR
SVAAGAAEHG ARGRESMLAL KYASEGRIKF PIEGENKVLA TAKAFGLDTE NKSIKELAGL
IADILLEDLS RTIPGPHHTL NAFASKERIN VWSNLDIIPI SPYHEVFESL NRTGTGNDSD
WENIMKQMLR TGVAFAWSCV LGSSIAMDSL FGLPVRSTSK VNIGALEKGY VNIGIHGHSP
VLVSEIVKLG NSDEFQELAR QNGALGIKFY GICCSGLSAM YRYGGVIPLS NAIGAELVLG
TGALDLWVAD VQDVFPSIME VASCFKTTVV TTSDSARLPG AEHYGFDHHH SNIDQTSELA
TKILNRAIES FTQRREVPVY IPPYEIEAEV GFSVEYINRH FGSVKPIAEA LKNGEILGIV
NLVGCNNPRI VYEKAIVELT DILLENNVLV LTNGCASFPL MKLGYCSTAA LERTGDNLKG
FLKDLPPVWH MGECLDNARA SALFKAVAET SDIYIKDMPY AFASPEWSNE KGICAALSFR
LLGIDSYHCV YAPTQGSDNV TEFMSNGTKD ILGSRMIVNV NHIELANEIV NDLKVQRKAL
GWD