Gene P9211_17021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_17021 
SymbolhemF 
ID5730086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1529251 
End bp1530378 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content42% 
IMG OID641286084 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_001551587 
Protein GI159904243 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0408] Coproporphyrinogen III oxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.18622 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.770424 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCATAA TTTGCAATCT GAATTCAATA ATCCTAAATG GGTATTCTGA TGTCCTAGCC 
TCAACTAGAA AATACAGATA TAGCCCAGTG TCATTGGAAC ACCTTTCTCA GCCTCCTGCA
AATTCGAGAC AGAGGGCCAA AGAACTTGTA CTTTCGCTAC AAGATCAAAT ATGTAATGGC
CTTGAAGCAG TAGATGGTGA AGGAACTTTC AAGGAAGAGA CATGGGAAAG ACCTGAAGGA
GGCGGGGGCA GGTCAAGAGT AATGAGTGAA GGAAGAGTTC TTGAACAAGG AGGGGTCAAC
TTCTCTGAAG TACAAGGACA GGAACTCCCC CCATCAATAA TTAATCAACG ACCTGAAGCC
AAAGGGCATC CCTGGTTTGC AACTGGGACT TCTATGGTCC TTCATCCAAA AAATCCCTAT
ATTCCAACTG TTCATCTCAA TTACCGTTAT TTCGAAGCTG GTCCAGTTTG GTGGTTTGGC
GGTGGTGCAG ATCTAACACC TTACTATCCA TACTTAAGCG ATACCAAGCA TTTCCACAAA
ACTCTCCAGC AAGCTTGTGA TTCCATAAAT CCCTTACTGC ATAAGGTTTT CAAGCCATGG
TGTGATGAAT ATTTCTTTCT AAAGCACAGG AATGAAACAA GAGGCGTAGG TGGTATTTTC
TTTGACTACC AAGATGGATC AGGAAATTTA TATAAGGGTC AAGACCCTAA AGGGCCCGCT
GCAAAAATTG CAAATGAACT AGGCAAGCAT CCTATGAATT GGGAGGAACT CTTTGCATTG
GCAAAAGCAT GCGGGAATGC TTTTCTACCT TCTTATATCC CAATCATCGA AAAGCGACAG
AATCAATCAT TTACGGAGCG AGAAAGACAA TTTCAGCTGT ATAGACGAGG AAGATATGTT
GAATTTAATT TGGTATGGGA TAGAGGAACA ATTTTCGGTC TTCAGACAAA CGGTAGAACG
GAGTCAATAC TAATGTCTCT CCCTCCTTTA GCAAGATGGG AATATGGCTA CAAGGCAGAG
AAGGGATCAA GAGAAGAACT ACTCACTAAT GTGTTTACTA AGCCTCAAGA ATGGTTTAAC
GATAAGACTT TGGAAGAAAA ATGTCATCCG TTAGAAGCTG TGGATTAA
 
Protein sequence
MVIICNLNSI ILNGYSDVLA STRKYRYSPV SLEHLSQPPA NSRQRAKELV LSLQDQICNG 
LEAVDGEGTF KEETWERPEG GGGRSRVMSE GRVLEQGGVN FSEVQGQELP PSIINQRPEA
KGHPWFATGT SMVLHPKNPY IPTVHLNYRY FEAGPVWWFG GGADLTPYYP YLSDTKHFHK
TLQQACDSIN PLLHKVFKPW CDEYFFLKHR NETRGVGGIF FDYQDGSGNL YKGQDPKGPA
AKIANELGKH PMNWEELFAL AKACGNAFLP SYIPIIEKRQ NQSFTERERQ FQLYRRGRYV
EFNLVWDRGT IFGLQTNGRT ESILMSLPPL ARWEYGYKAE KGSREELLTN VFTKPQEWFN
DKTLEEKCHP LEAVD