Gene CPF_1076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1076 
Symbol 
ID4203697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1226670 
End bp1228760 
Gene Length2091 bp 
Protein Length696 aa 
Translation table11 
GC content32% 
IMG OID638081957 
Product[Fe] hydrogenase 
Protein accessionYP_695522 
Protein GI110799819 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG1592] Rubrerythrin
[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID[TIGR02512] hydrogenases, Fe-only 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.723256 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAAAC ATTTAGCTAC GGATATACGT GTAGCTATTG AGAAGGATAA TCCATCTATT 
TGTAGAGATG AAGAGAAATG TATAAAATGT GGTTCATGCA AAAATATATG CACAGATTAT
ATAGGAGTAA ATGGTCATTA TTCTTTGGAG AAAACTAATG ATACTGCTGT TTGTATAAAT
TGTGGTCAAT GTGCGAATGT ATGCCCAACA TCTAGTATTA CAGAGGTTTT TGATTATAAA
AAGGTACAAG ATGCGATAAG TGATAAAGAT AAAATAGTCA TAGTTTCTAC GTCGCCTGCT
GTTAGAGTAT CTTTAGGTGA AGAATTTAAT ATGAATGATG GAAGTTTTGT TCAAGGTAAA
ATGATAGCAT TACTTCGTAA GTTAGGTTTT GATTATGTTC TTGATACTAA TTTCGCTGCT
GATATGACTA TAGTTGAGGA GGCAAGTGAA TTAGTAGAGC GTATAACTAA AAATAATAAA
CCTCTACCTC AATTTACTAG TTGTTGTCCG GCTTGGGTAA AATATGCTGA AACCTTTCAT
CCAGAAATCT TAGAGCATAT TTCAACTTCA AAGAGTCCTA TAGGTATGCA AGGACCTACA
ATAAAGACTT ATTTTGCAAA GAAGATGGGA ATTGATCCTT CTAAGATTGT AAATGTAGCC
TTAACGCCAT GTACAGCTAA AAAGTTTGAA ATCAAAAGAG AAGAAATGAA TGCATCAGGT
AGATATTATG GCATAGAAGA TATGCGTGAT ATGGACTATG TTATAACAAC TAGGGAAGTA
GCTATTTGGG CTAAAGAAAA AGAAATTGAT TTTAATTCTT TAGAGGATAG TAACTTTGAT
AAATTAATGG GAGAAGCTTC AGGGGCTGGT GTTATATTTG CTAATACAGG TGGAGTTATG
GAAGCTGCTT TAAGAACAGC ATACAAATAT ATTACCAAAG AAGATCCACC TAAAGATTTT
TATGATTTAG AGTCTGTAAG AGGTATTGAA GGTGTTCGTG AAGCTAGTTT TAAAATAAAT
GACTTAGAAA TAAACATAGC TGTTATTTAT GGAACTGAAA ATGCTTTAAA GTTTATTTCA
AAGATAAAAA ATTCGGATAA GAAATATCAC TTTATTGAGG TTATGACTTG TCCAGGCGGA
TGCATAGGTG GAGGGGGACA ACCTAAAGAT AAAAAATTTG AGGGTGATAA ACTTAGAGAA
AAACGTATAG ATGGATTATA TAAAAGAGAT TCAAGCATGA AATTAAGAGT TAGCTATGAA
AATGAAGAGA TTAAAAAGTT ATATGAGGAG TTTTATGAAA AACCTTTAAG CAATCTTGCT
GAGGAAATGT TACATACAAC TTATTTTGAT AGAAGTAAAG ATTTAGGAGG AGAAAAAATG
AGTGAATCAG TAAAATATCG TTGTACAGTT TGTGGATATA TACAAGAAGG AGAATTACCA
GAAGGATTTA TATGTCCAGT TTGCAAGAGT CCAGCTTCAG CCTTTGTAAA AGTTGAAGAG
AAATCAGAGG TAGATGATAA AGATTCTAAT AAATCAGAAT CATCTAGTGA AAGTAGTAAT
AAATATAAAG GAACAAAAAC AGAAAAGAAT TTAATGGATG CATTAGCAGG GGAGTCCATA
GCTAGAAATA AATATACATT CTTTGCAGAG GTAGCCAAGA ATGAAGGATA TGAACAAATA
TATGAACTAT TCTTAAAAAC AGCTGGCAAT GAAAGAGAAC ATGCTAAACT ATGGTTTAAG
GAGTTAGGAT ATTTAGGAGA TACTAAAGAA AATTTATTAC ATGGGGCAGA AGGAGAGCAT
TATGAATGGT CAGATATGTA TGCTAGATTT GCAAAAGAGG CTGAAGAAGA GGGATTTTTT
GATTTAGCAG AAAAATTTGT AGAGGTTGCT AAAATAGAAA AATCTCATGA AGATAGATAT
AGAAAGCTTT TAAACAATAT AAAGATGAAA AAGGTATTTG AAAAATCAGA AGAAACTATG
TGGGAATGCT TAAATTGTGG ATATCTTGTT ATTGGTAAAA AGGCTCCAGA GGTTTGCCCA
GTATGTAATT ATGCCAAAGG ATTTTTTGAA GTAAGAGCTG AAAATTATTA A
 
Protein sequence
MGKHLATDIR VAIEKDNPSI CRDEEKCIKC GSCKNICTDY IGVNGHYSLE KTNDTAVCIN 
CGQCANVCPT SSITEVFDYK KVQDAISDKD KIVIVSTSPA VRVSLGEEFN MNDGSFVQGK
MIALLRKLGF DYVLDTNFAA DMTIVEEASE LVERITKNNK PLPQFTSCCP AWVKYAETFH
PEILEHISTS KSPIGMQGPT IKTYFAKKMG IDPSKIVNVA LTPCTAKKFE IKREEMNASG
RYYGIEDMRD MDYVITTREV AIWAKEKEID FNSLEDSNFD KLMGEASGAG VIFANTGGVM
EAALRTAYKY ITKEDPPKDF YDLESVRGIE GVREASFKIN DLEINIAVIY GTENALKFIS
KIKNSDKKYH FIEVMTCPGG CIGGGGQPKD KKFEGDKLRE KRIDGLYKRD SSMKLRVSYE
NEEIKKLYEE FYEKPLSNLA EEMLHTTYFD RSKDLGGEKM SESVKYRCTV CGYIQEGELP
EGFICPVCKS PASAFVKVEE KSEVDDKDSN KSESSSESSN KYKGTKTEKN LMDALAGESI
ARNKYTFFAE VAKNEGYEQI YELFLKTAGN EREHAKLWFK ELGYLGDTKE NLLHGAEGEH
YEWSDMYARF AKEAEEEGFF DLAEKFVEVA KIEKSHEDRY RKLLNNIKMK KVFEKSEETM
WECLNCGYLV IGKKAPEVCP VCNYAKGFFE VRAENY