Gene CPR_2341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2341 
SymbolhydA 
ID4206336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2568433 
End bp2570151 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content33% 
IMG OID642566891 
Productiron hydrogenase 
Protein accessionYP_699606 
Protein GI110803368 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG1034] NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G)
[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID[TIGR02512] hydrogenases, Fe-only 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA TAATAATCAA TGATAAGACT ATCGAATTTG ATGGTGACAA GACGATACTT 
GACTTAGCAA GAGAGAATGG GTTTGACATT CCAGTGCTTT GTGAGTTAAA AAATTGCGGA
AACAAAGGAC AATGTGGAGT TTGTCTTGTT GAGCAAGAAG GAAATGATAG ACTATTAAGA
TCTTGTGCAA TAAAAGCTAA AGATGGAATG GTTATTAAAA CTGATAGTGA AAAAGTACTA
GAAGCTAGAA AAGAAAGAGT TGCAGAACTT TTAGATGAGC ATGAGTTCAA ATGTGGACCA
TGTAAAAGAA GAGAGAATTG TGAATTCTTA AAACTTGTAA TTAAAACAAA GGCAAGAGCT
CACAAACCAT TTGTAGTTGC AGATAAATCA GAATATGTAG ATGACAGAAG TAAATCAATA
GTTTTAGATA GAAGTAAATG TGTTAAATGT GGTAGATGTG TTGCTGCATG TAGAACAAGA
ACTGCTACTA ACTCAATAAA ATTCCACAGA ATTGATGGAG TAAGATTAGT AGGACCAGAG
GAATTAAAAT GTTTCGACGA TACAAATTGT TTATTATGTG GACAATGTGT AGCTGCTTGT
CCAGTTGATG CATTATCAGA AAAATCACAT ATTGAAAGAG TTCAAGATGC ATTAAATGAT
CCAGAAAAGC ATGTAATAGT TGCTATAGCA CCAGCTGTAA GAACATCAAT GGGTGAATTA
TTCAAAATGG GTTACGGCCA AGATGTTACT GGAAAATTAT ATACTGCTTT AAGAAAACTA
GGTTTTGATA AAGTATTTGA TATAAACTTC GGTGCTGATA TGACAATAAT GGAAGAAGCT
ACTGAGCTTA TAGAAAGAAT AAAAAATAAT GGGCCTTTCC CAATGTTAAC TTCATGTTGC
CCATCATGGG TTAGAGAAGT TGAAAACTAC TTCCCAGAAT TAGTAGAAAA TCTTTCAACA
GCTAAGTCAC CACAACAAAT ATTTGGTTCA GCATCAAAAA CTTATTATCC TCAAGTTGCT
GATATAGATC CTAAAAAAGT ATTTACAGTA ACTGTAATGC CTTGTACTTC TAAAAAATTC
GAAGCTGATA GACCTGAAAT GGAAAACGAA GGAATAAGAA ATATAGATGC AGTTATAACA
ACTAGAGAGT TAGCTAGAAT GATAAAGGCT GCTAAAATAG ATTTTGCTAA ATTAGAAGAC
AGTGAAGTAG ATCCAGCTAT GGGTGAGTAT ACTGGAGCAG GTGTTATATT TGGAGCTACT
GGTGGAGTTA TGGAAGCTGC TTTAAGAACA GCTAAAGATT TCATGGAAAA TGATAACTTA
GACAATGTAG ATTATGAAGC TGTTAGAGGA TTAGCTGGAA TAAAAGAAGC TGAAGTAGAA
ATAGCAGGAA ATGAATATAA ATTAGCTGTT GTAAATGGAG CTGCTAATGT ATTTGAATTA
GTTAAGTCTG GTAAAATAAA TGATTACCAC TTCATCGAAG TAATGGCATG TCCAGGTGGA
TGTGTTAACG GTGGGGGACA ACCACATATC TCAGCTGAAG ATAGTGATAA AATTGATATT
AGAGAAGTAA GAGCTTCTGT TCTTTACAAT CAAGATAAGA ATTTAGAGAA GAGAAAATCA
CATCAAAACT CAGCTTTATT AAAAATGTAT GAAAACTACA TGGGTAAACC AGGTCATGGA
AGAGCTCATG AGTTATTACA CATGAAATAT AAAAAATAA
 
Protein sequence
MNKIIINDKT IEFDGDKTIL DLARENGFDI PVLCELKNCG NKGQCGVCLV EQEGNDRLLR 
SCAIKAKDGM VIKTDSEKVL EARKERVAEL LDEHEFKCGP CKRRENCEFL KLVIKTKARA
HKPFVVADKS EYVDDRSKSI VLDRSKCVKC GRCVAACRTR TATNSIKFHR IDGVRLVGPE
ELKCFDDTNC LLCGQCVAAC PVDALSEKSH IERVQDALND PEKHVIVAIA PAVRTSMGEL
FKMGYGQDVT GKLYTALRKL GFDKVFDINF GADMTIMEEA TELIERIKNN GPFPMLTSCC
PSWVREVENY FPELVENLST AKSPQQIFGS ASKTYYPQVA DIDPKKVFTV TVMPCTSKKF
EADRPEMENE GIRNIDAVIT TRELARMIKA AKIDFAKLED SEVDPAMGEY TGAGVIFGAT
GGVMEAALRT AKDFMENDNL DNVDYEAVRG LAGIKEAEVE IAGNEYKLAV VNGAANVFEL
VKSGKINDYH FIEVMACPGG CVNGGGQPHI SAEDSDKIDI REVRASVLYN QDKNLEKRKS
HQNSALLKMY ENYMGKPGHG RAHELLHMKY KK