Gene CPF_2655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2655 
SymbolhydA 
ID4202993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2926027 
End bp2927745 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content34% 
IMG OID638083521 
Productiron hydrogenase 
Protein accessionYP_697035 
Protein GI110799871 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG1034] NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G)
[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID[TIGR02512] hydrogenases, Fe-only 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA TAATAATCAA TGATAAGACT ATCGAATTTG ATGGTGATAA GACGATACTT 
GACTTAGCAA GAGAGAATGG ATTTGACATT CCAGTGCTTT GTGAGTTAAA AAATTGCGGA
AACAAAGGAC AATGTGGAGT TTGTCTTGTT GAGCAAGAAG GAAATGATAG ACTATTAAGA
TCTTGTGCAA TAAAAGCTAA AGATGGAATG GTTATTAAAA CTGATAGTGA AAAAGTACTA
GAAGCTAGAA AAGAAAGAGT TGCAGAACTT TTAGATGAGC ATGAGTTCAA ATGTGGACCA
TGTAAAAGAA GAGAGAATTG TGAATTCTTA AAACTTGTAA TTAAAACAAA GGCAAGAGCT
CACAAACCAT TTGTAGTTGC AGATAAATCA GAATATGTAG ATGACAGAAG TAAATCAATA
GTTTTAGATA GAAGTAAATG TGTTAAATGT GGTAGATGTG TTGCGGCATG CAGAACAAGA
ACTGCTACTA ACTCAATAAA ATTCCACAGA ATTGATGGAG TAAGATTAGT AGGACCAGAA
GAATTAAAAT GTTTCGACGA TACAAATTGT TTATTATGTG GACAATGTAT AGCTGCTTGC
CCAGTTGATG CATTATCAGA AAAATCACAT ATTGAAAGAG TTCAAGAGGC ATTAAATGAT
CCAGAAAAGC ATGTAATAGT TGCTATGGCA CCAGCTGTAA GAACATCAAT GGGTGAATTA
TTCAAGATGG GTTATGGACA AGATGTTACT GGAAAATTAT ATACTGCTTT AAGAGAATTA
GGTTTTGATA AAGTATTTGA TATAAACTTC GGTGCTGATA TGACAATAAT GGAAGAAGCT
ACTGAGCTTA TAGAAAGAAT AAAAAATAAC GGACCTTTCC CAATGTTAAC TTCATGTTGT
CCATCATGGG TTAGAGAAGT TGAAAACTAC TTCCCAGAAT TAGTAGAAAA TCTTTCATCA
GCTAAATCAC CACAACAAAT ATTTGGTGCA GCATCAAAAA CTTATTATCC ACAAGTTGCT
GATATAGATC CTAAAAAAGT ATTTACAGTA ACTGTAATGC CTTGTACTTC TAAAAAATTC
GAGGCTGATA GACCTGAAAT GGAAAACGAA GGAATAAGAA ATATAGATGC AGTTATAACA
ACTAGAGAGT TAGCTAGAAT GATAAAAGCT GCTAAAATAG ATTTTGCTAA ATTAGAAGAT
GGTGAAGTGG ATCCAGCTAT GGGTGAGTAC ACTGGAGCAG GTGTTATATT TGGAGCTACT
GGTGGAGTTA TGGAAGCTGC TTTAAGAACA GCTAAAGATT TCATGGAAAA TGACAACTTA
GATAATGTAG ATTACGAAGC TGTTAGAGGA TTAGCTGGAA TAAAAGAAGC TGAAGTAGAA
ATAGCAGGAA ATGAATATAA ATTAGCTGTT GTAAGTGGAG CTGCTAATGT ATTTGAACTA
GTTAAGTCTG GTAAAATAAA TGACTACCAC TTCATCGAAG TAATGGCATG TCCTGGTGGA
TGTGTTAATG GTGGAGGACA ACCACACATC TCAGCTGAAG ATAGTGATAA AATGGATATT
AGAGAAGTAA GAGCTTCTGT TCTTTACAAT CAAGATAAGA ATTTAGAGAA GAGAAAATCA
CATCAAAACT CAGCTTTATT AAAAATGTAT GAAAGCTACA TGGGTAAACC AGGTCATGGA
AGAGCTCATG AGTTATTACA CATGAAATAT AAAAAATAA
 
Protein sequence
MNKIIINDKT IEFDGDKTIL DLARENGFDI PVLCELKNCG NKGQCGVCLV EQEGNDRLLR 
SCAIKAKDGM VIKTDSEKVL EARKERVAEL LDEHEFKCGP CKRRENCEFL KLVIKTKARA
HKPFVVADKS EYVDDRSKSI VLDRSKCVKC GRCVAACRTR TATNSIKFHR IDGVRLVGPE
ELKCFDDTNC LLCGQCIAAC PVDALSEKSH IERVQEALND PEKHVIVAMA PAVRTSMGEL
FKMGYGQDVT GKLYTALREL GFDKVFDINF GADMTIMEEA TELIERIKNN GPFPMLTSCC
PSWVREVENY FPELVENLSS AKSPQQIFGA ASKTYYPQVA DIDPKKVFTV TVMPCTSKKF
EADRPEMENE GIRNIDAVIT TRELARMIKA AKIDFAKLED GEVDPAMGEY TGAGVIFGAT
GGVMEAALRT AKDFMENDNL DNVDYEAVRG LAGIKEAEVE IAGNEYKLAV VSGAANVFEL
VKSGKINDYH FIEVMACPGG CVNGGGQPHI SAEDSDKMDI REVRASVLYN QDKNLEKRKS
HQNSALLKMY ESYMGKPGHG RAHELLHMKY KK