Gene Ccel_2232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2232 
Symbol 
ID7310917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2610529 
End bp2612277 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content42% 
IMG OID643609164 
Producthydrogenase, Fe-only 
Protein accessionYP_002506554 
Protein GI220929645 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID[TIGR02512] hydrogenases, Fe-only 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAACTG TAAACATTAC AATAGACGGT AAGAAACTTC AGGTTGAACA AGGCATAACA 
ATTTTAGAGG CAGCCAGACA GGCTAATATA AAGATACCTA CACTTTGCTT CCTCAAGGAT
ATCAATGAAA TAGGTGCATG TAGGATGTGT CTGGTTGAAA TAAAGGGTGC GAGAGCGTTG
CAAGCATCTT GCGTATATCC TGTTGCAGAA GGACTTGAAA TTTATACTCA GAGTCCGGAT
GTCAGAGAGG CAAGAAAGGT AACCTTGGAG CTTATCCTTT CCAACCACGA CAAGAAATGT
CTTACCTGTG TAAGAAGCAA AAACTGCGAG CTGCAAAACT TGGCTGAGGA ATTAAATATA
AAAGATATAA GATTTGAAGG TGCATCAATT GATCTTCCTT TAGATGATTT TTCACCTTCA
ATTGTCAGAG ACCCAAATAA ATGCGTGCTG TGCAAACGCT GTGTAAGTAT GTGTAAAAAC
ATACAGACAG TTTCTGTTAT CAGTGCAGCT GAGAGAGGTT TCAAATCAAC GATTTCCTGT
GCATTTGACA GGTCATTGGA AGAGGTACCG TGTACAATGT GCGGACAGTG TATCAGTGTT
TGTCCTGTAG GAGCTTTAAG AGAGAAAGAT GACACTGACA AGGTATGGTC TGCTTTGGCT
GACAAGGAAC TTCATGTTGT AGTACAGACA GCTCCTGCTG TTCGTGTTGC TCTGGGTGAG
GAGTTTGGGC TTCCTATAGG GACCAGAGTT ACGGGGAAAA TGGCTGCTGC CTTGAACCAC
CTGGGTTTTG CCAAAGTATT TGACACAGAT ACTGCTGCTG ATCTTACAAT TATGGAAGAA
GGCACCGAGC TCCTAAACAG AATCAAAAAC GGTGGAAAGC TTCCTGTTAT AACTTCTTGT
AGTCCGGGAT GGATAAAGTT CTGTGAGCAC AACTACCCTG AATTCCTTGA AAACCTATCA
TCTTGTAAAT CACCACATGA AATGTTCGGT GCGGTGCTGA AAACTTACTA TGCTGAAAAG
ATGGGTATCG ACCCTAAAAA AATATTTGTA GTGTCCGTAA TGCCATGTAC CGCAAAGAAG
TTTGAAGCAC AAAGACCAGA GCTTTCCGCA ACAGGCTTGC CTGATGTTGA CGTAGTTATA
ACTACCAGAG AGCTTGCAAG AATGATAAAG GAAGCAGGTA TCGATTTTAA TAATCTTGAG
GACATGGATT TTGACGACCC AATGGGTAAC GCAACAGGAG CCGGCGTAAT ATTCGGTGCA
ACCGGGGGAG TTATGGAAGC AGCTCTCAGA ACAGTATCTG AGATAGTTGC AGGAAAATCC
TTTGAAGATA TTGAATATAC TGCTGTAAGA GGTATAGAGG GTATCAAGGA AGCAACAGTT
GCTATAGGTG ACATGAAGGT TAAAGCGGCT GTAGCAAATG GTCTCGGCAA CGCAAGGAAG
CTCCTTGACA GTATAAAAGC AGGAGAAGCA GCATATGACT TCGTTGAAAT AATGGCTTGT
CCGGGCGGTT GTGTAAACGG CGGAGGACAG CCAATACAAC CTTCTTCTGT AAGAAGCTGG
ACTGACTTGC GTACTGAACG TGCAAAGGCA ATATATGAAG AAGATGTAAG TCTTCCAATT
AGAAAGTCAC ATGAAAACCC AGTAATCAAA GAAATGTATG ATAAATATTT CGGAGAGCCG
GGAAGCCATA AGGCACATGA GATTTTACAC ACACATTATG CTGCAAGGGA AAACTACCCT
GTAAAATAG
 
Protein sequence
MSTVNITIDG KKLQVEQGIT ILEAARQANI KIPTLCFLKD INEIGACRMC LVEIKGARAL 
QASCVYPVAE GLEIYTQSPD VREARKVTLE LILSNHDKKC LTCVRSKNCE LQNLAEELNI
KDIRFEGASI DLPLDDFSPS IVRDPNKCVL CKRCVSMCKN IQTVSVISAA ERGFKSTISC
AFDRSLEEVP CTMCGQCISV CPVGALREKD DTDKVWSALA DKELHVVVQT APAVRVALGE
EFGLPIGTRV TGKMAAALNH LGFAKVFDTD TAADLTIMEE GTELLNRIKN GGKLPVITSC
SPGWIKFCEH NYPEFLENLS SCKSPHEMFG AVLKTYYAEK MGIDPKKIFV VSVMPCTAKK
FEAQRPELSA TGLPDVDVVI TTRELARMIK EAGIDFNNLE DMDFDDPMGN ATGAGVIFGA
TGGVMEAALR TVSEIVAGKS FEDIEYTAVR GIEGIKEATV AIGDMKVKAA VANGLGNARK
LLDSIKAGEA AYDFVEIMAC PGGCVNGGGQ PIQPSSVRSW TDLRTERAKA IYEEDVSLPI
RKSHENPVIK EMYDKYFGEP GSHKAHEILH THYAARENYP VK