Gene CPR_2579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2579 
Symbol 
ID4205092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2808956 
End bp2810428 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content32% 
IMG OID642567129 
Product[Fe] hydrogenase 
Protein accessionYP_699826 
Protein GI110802004 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTATAA AGGATGCTAA CAAGCAATAT ATAAAGTTTG ACACTGCTGT TCAAGTTCTT 
AAATATGAGG TTTTAAAAAG GATAGCAGAG AAAGAATTTG ATGGTACTTT AGAAAAAGAA
AAGCTTAATA TAGCAAAAGA AATTGTGGAT GATTTAAAAC CTAATGTAAG ATGTTGCATA
TATAAGGAGA GGGCTATAGT TGAAGAAAGA ATTAAACTAG CTTTAGGTGG ACATGAAAAT
AGAGAAAACA TGATAGAAGT AATAGACATA GCCTGTGATG AATGTCCTGT TAATAGATTT
ATAGTCACTG ATGCATGTAG AGGATGCCTA GCTAAGAAAT GTAGAGATAG TTGTAATTTT
GAAGCTATAA ATTTTGATAA TAGAAAATGC AAAATTGATT ATGAAAAATG TAAGGAATGT
GGAAAATGTA AAGAGGTTTG TCCTTACAAT GCTATAGCTG AGGTAAAAAG ACCTTGCATG
AGAGCATGTA TTCCAAAGGC CTTATCTTAT GATGTTGATA GTAAAAAAGC TGTCATAGAT
GATTCAAAAT GTATTCAATG TGGAGCCTGT GTAGTTGATT GTCCTTTTGG AGCTATAATG
GACAAATCCT ATTTAGTAGA TGTTATAAGA TTACTTAAAG ATGAGAAGAA AGTTTATGCC
ATTGTGGCGC CAGCTATATC ATCTCAATTT AATCATAGTA AGATTGGTAA AGTGATTACT
GCTATAAAAA AATTAGGATT TGAAGATGTG TTTGAAGCAG CTTTAGGAGC AGATTTGGTG
GCTGTTCATG AATGTAATGA ATTTAAAGAA AAAGGTGAAT TAGACTTCAT GACAACAAGT
TGTTGCCCTG CTTTTGTTTC TTATATAGAA AAAAATTATC CAGAGCTTAA GGAGTATATA
TCTAATACTG TATCTCCTAT GGTAGCTATG GCAAGGTTAA TAAAATCTCA AAATAAAGAT
GTTAAAACTG TATTCATAGG ACCTTGTATT GCAAAGAAAA CAGAAGCTAA GAGAAATGAG
GTTAGTGGAG ATGTTGATTA TGTATTGACC TTTGAGGAGC TTTTAGCTTT ACTTGACTCT
AGAAATATTA AAATTGACGA ATGTAAAGAA AGTGATACTA AGCATGGTTC ATTTTATGGA
AGACTTTTTG CTAGAAGTGG AGGAGTTACC GAATCAGTTA AACATCTTAT AGATAGCGAA
GGGATAAAAG TAGATTTTAG ACCTATACTT GGAGATGGAA TTAAGGATTG CGACATAAAA
CTTAGATTAG CAAAACTAAA AAGAGCACAG GGAAACTTTT TAGAAGGAAT GGCTTGCAAA
GGTGGATGTA TAAATGGGCC AGGCTCCTTA AATCATGATA TTAAGAATAG CAAAAAAGTT
GATAAATATG GCGAATTATC CTCTTCTGAG AAGATAAAAG ATACTTTAGT TGATATTAAA
TTTGAGGATT TAAATTTATC TAAAAATGAG TAA
 
Protein sequence
MAIKDANKQY IKFDTAVQVL KYEVLKRIAE KEFDGTLEKE KLNIAKEIVD DLKPNVRCCI 
YKERAIVEER IKLALGGHEN RENMIEVIDI ACDECPVNRF IVTDACRGCL AKKCRDSCNF
EAINFDNRKC KIDYEKCKEC GKCKEVCPYN AIAEVKRPCM RACIPKALSY DVDSKKAVID
DSKCIQCGAC VVDCPFGAIM DKSYLVDVIR LLKDEKKVYA IVAPAISSQF NHSKIGKVIT
AIKKLGFEDV FEAALGADLV AVHECNEFKE KGELDFMTTS CCPAFVSYIE KNYPELKEYI
SNTVSPMVAM ARLIKSQNKD VKTVFIGPCI AKKTEAKRNE VSGDVDYVLT FEELLALLDS
RNIKIDECKE SDTKHGSFYG RLFARSGGVT ESVKHLIDSE GIKVDFRPIL GDGIKDCDIK
LRLAKLKRAQ GNFLEGMACK GGCINGPGSL NHDIKNSKKV DKYGELSSSE KIKDTLVDIK
FEDLNLSKNE