Gene A9601_17891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_17891 
SymbolhemF 
ID4718523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1523724 
End bp1524752 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content34% 
IMG OID640079519 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_001010179 
Protein GI123969321 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0408] Coproporphyrinogen III oxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGAAAG AACCTCCTAA AAACTCGAGA GAAAAAACTA AAAATCTCTT ATTAACTCTA 
CAAGACAAAA TTTGTTCAGG ACTTGAAAAT GTAGATGGAA AAGGAAAATT TACAGAGGAA
TCCTGGCTAA GAGACGAAGG TGGCGGTGGA AGATCTAGAG TATTGAAAAA TGGTTCTATT
TTTGAGCAAG CAGGCGTAAA TTTCTCGGAA GTACAAGGAA AAGAATTACC TCAATCTATA
ATCTCTCAAA GGCCCGAAGC AAAAGGTCAT GAATGGTTTG CTACGGGAAC TTCTATGGTT
TTGCATCCTA AGAATCCCTA TATTCCCACA GTTCATCTGA ATTATCGATA TTTCGAAGCT
GGTCCTGTTT GGTGGTTTGG AGGAGGTGCA GACTTAACCC CTTTTTATCC TTATCTTTCT
GATGTAAGGA ATTTTCATAA TGAGCATAAA AAAGCTTGTG AGAAAGTTGA TCAAGATTTG
CATAAAGTTT TCAAACCATG GTGTGATGAA TATTTCTTCT TGAAACATAG AAATGAATCT
AGAGGTATAG GAGGTATTTT TTATGATTAT CAAGATGGTT CAGGAAATAT TTATAGAGGA
AATAATAAAA ATGGAGAAGC ATCAAAAGCT TCACAAAATG TTGGCAGATC TAATTTAAAT
TGGGATAATT TATTTTCTTT AGCAGAAAAC TGTGGGCAGG CATTCCTACC TTCATATTTG
CCCATTATTG AAAAAAGAGC TTCTCAAAAA TATTCACCGA AAGAAAGAGA ATTCCAGCTA
TATCGAAGAG GTAGATATGT CGAATTCAAT TTAGTTTGGG ATAGAGGGAC GATTTTTGGG
CTACAAACAA ATGGCAGAAC TGAATCTATA TTAATGTCCT TACCGCCTTT AGCTAGATGG
GAATATGGAT ATAAAGCTAA AAATGGCTCT CGAGAGGAAT TTCTCACATC AATTTTTACA
AAACCCCAAG ATTGGTTTAA TGATAAAGAG TTAGAAAAAT TCTGTATGGA GAATAATATT
TTTGATTAA
 
Protein sequence
MLKEPPKNSR EKTKNLLLTL QDKICSGLEN VDGKGKFTEE SWLRDEGGGG RSRVLKNGSI 
FEQAGVNFSE VQGKELPQSI ISQRPEAKGH EWFATGTSMV LHPKNPYIPT VHLNYRYFEA
GPVWWFGGGA DLTPFYPYLS DVRNFHNEHK KACEKVDQDL HKVFKPWCDE YFFLKHRNES
RGIGGIFYDY QDGSGNIYRG NNKNGEASKA SQNVGRSNLN WDNLFSLAEN CGQAFLPSYL
PIIEKRASQK YSPKEREFQL YRRGRYVEFN LVWDRGTIFG LQTNGRTESI LMSLPPLARW
EYGYKAKNGS REEFLTSIFT KPQDWFNDKE LEKFCMENNI FD