Gene P9303_22671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_22671 
SymbolhemF 
ID4777397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2001390 
End bp2002463 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content53% 
IMG OID640087785 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_001018267 
Protein GI124023960 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0408] Coproporphyrinogen III oxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.364873 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCCTA CTTTGATTGG CAAGGTTATG GGTGCTTCTG AGAATGTTGG CCAAGGTCCT 
CCTCCTCACT CGCGCGAACG TGTACGTGAA TTGGTACTCG GATTGCAAGA TGAGATCAGT
AATGAACTGG AGAGTCTTGA TGGTGGCCAA TCTTTTAGAA CTGATAGTTG GGAGCGGCCT
GAAGGGGGTG GTGGGCGATC CAAGGTGATG CGTGAGGGCC GAGTTTTTGA ACAGGGCGGC
GTTAATTTCT CTGAGGTGCA CGGCGAGGAG TTGCCTCCGT CGATTCTGAA TCAGCGACCT
GAGGCAAAGG GGCATCCCTG GTTCGCTACC GGCACTTCGA TGGTGCTACA CCCGCGCAAT
CCCTATGTGC CTACGATCCA CCTTAATTAC CGCTATTTCG AGGCGGGGCC GGTGTGGTGG
TTTGGCGGTG GCGCTGACCT CACGCCGTTT TATCCCTACC TGGAAGATGC CCGCCATTTT
CATCGCGTTC ACAAGCAGGC TTGCGATACG GTTGGACCTG AGCTCCATAA GGTCTTTAAA
CCTTGGTGTG ACGAATATTT CTATCTGAAG CACCGTGGTG AGACCCGTGG TGTGGGTGGG
ATTTTTTACG ACTACCAGGA TGGATCTGGA GTGCTTTACA AAGGTCAAAA CCCTGAGGGT
CCAGCTGCAC AGGTCTCACG GGAGTTAGGG CCTCATCCGA AGAGCTGGGA ACAGTTATTT
GAGCTGGCCA AGGCTTGTGG GAAGGCTTTC TTGCCGGCTT ATGTGCCGAT TGTGGAGAAA
CGTCAGCAGC AGGCCTATGG CGATCGAGAA CGTCAATTCC AGTTGTATCG CCGTGGGCGA
TATGCGGAGT TCAATCTGGT CTGGGATCGG GGCACGATTT TCGGATTGCA AACCAATGGC
CGAACGGAGT CGATCTTGAT GTCTTTGCCA CCACTGGCTC GTTGGGAGTA TGGATATGCC
GCACCAGCTG ATTCAAGGGA GGCTTTGCTC ACTGATTTGT TTACTCGACC TCAGAATTGG
TTTGAGGATT CGACGTTGGA TGAGCGTTGT CGACCACACC AGGCGGTGGA TTAG
 
Protein sequence
MVPTLIGKVM GASENVGQGP PPHSRERVRE LVLGLQDEIS NELESLDGGQ SFRTDSWERP 
EGGGGRSKVM REGRVFEQGG VNFSEVHGEE LPPSILNQRP EAKGHPWFAT GTSMVLHPRN
PYVPTIHLNY RYFEAGPVWW FGGGADLTPF YPYLEDARHF HRVHKQACDT VGPELHKVFK
PWCDEYFYLK HRGETRGVGG IFYDYQDGSG VLYKGQNPEG PAAQVSRELG PHPKSWEQLF
ELAKACGKAF LPAYVPIVEK RQQQAYGDRE RQFQLYRRGR YAEFNLVWDR GTIFGLQTNG
RTESILMSLP PLARWEYGYA APADSREALL TDLFTRPQNW FEDSTLDERC RPHQAVD