Gene P9301_14971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_14971 
SymbolhemN 
ID4912834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1266897 
End bp1268120 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content33% 
IMG OID640161092 
Productputative oxygen-independent coproporphyrinogen III oxidase 
Protein accessionYP_001091721 
Protein GI126696835 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.90779 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAGT TTCCAAGAAG TGCTTATGTG CACATTCCTT TTTGCCACAG AAGGTGTTTT 
TATTGTGATT TTGCAGTTAT TCCATTAGGA AACGAAGTTG AAACTTTAAA AGGTTATGGA
AGCAAAACAG TTCAGGAGTA TTTGCAATTT TTATATAAAG AAATATTGTC AATTAAGCAT
AAATCACCTC TATCGACAAT TTATATAGGA GGTGGTACAC CATCAATCTT AGATCCTGCC
CAAATCAAAG AATTGGTTAA TCTTTTTAAA GAAAATTATG GAATTGACTA TGGTGCTGAA
ATTACTATGG AGGTTGATCC AGCAAGTTTT ACTCAAGATG ATCTTTTCGG ATTCATAAAT
GCTGGGATAA ATAGATTTAG TCTTGGAGTA CAAAGTTTTA ATAATCAGAT ACTTCAAAAG
TCGGGAAGGC GTCATTTGAA AGAAGATGCA GTGAAATCTT GTTTATGGTT GAAGAGAGAA
TATGATAACG GGTTAATAAA AAGCTGGAGT CTAGATTTAA TTCAAAACTT GCCACTAAGT
GGATTTAAGG AATGGCAAGA TGACTTAAAA AAAGCAATTA CATTTTCACC ACCTCATCTA
TCTATTTACG ATTTAAATAT TGAAGATGGC ACTGTTTTTA AAAAATTAAT TAATTTAGGC
AAATTAAAAC TTCCAAGTGA TGAAGAAGCT TTTAGAAATA GTGAATCAAC ACATTTAATT
TTAAAAAAAT CAGGGTATTC AAGATATGAA ATCTCAAACT ATTGCCTTCC GCGACACCAA
TCGAGACATA ATAGAGTTTA TTGGAGTGGT TTAGGCTGGT GGAGTTTTGG TCAAGGTTCC
ACTAGTTCAC CTTGGGGAGA AAAGTTTACT CGACCAAGAG TTAGTAAAGA ATATAAAGAA
TGGGTAACAA GACAAGACGA ATTTAATTTA GATTCATCCC TAACTAATAA GGGGTTTGTT
TATAAAGAAC TTGATGAGAA AATAATGTTG GGATTAAGAC TTAAAGAGGG TGTAGATATC
AAAGAAGTGT TTAAAGAGCA AAACTGGGAA AACAAAAAAT TGGAAAGCAA CTTCAGTAAA
TTGGTTGAAG AATGGGAGAG ATTTCTTGAA AGTGGACTAC TAGTAAGAAA GGGGAATAGA
TTCTTTTTAA GTGAGCCTAA TGGCATGGAA CTTAGTAATC AAATTCTTGT CTCTATGTTT
AAGTGGTGGG ATGAGATTAA CTAA
 
Protein sequence
MNKFPRSAYV HIPFCHRRCF YCDFAVIPLG NEVETLKGYG SKTVQEYLQF LYKEILSIKH 
KSPLSTIYIG GGTPSILDPA QIKELVNLFK ENYGIDYGAE ITMEVDPASF TQDDLFGFIN
AGINRFSLGV QSFNNQILQK SGRRHLKEDA VKSCLWLKRE YDNGLIKSWS LDLIQNLPLS
GFKEWQDDLK KAITFSPPHL SIYDLNIEDG TVFKKLINLG KLKLPSDEEA FRNSESTHLI
LKKSGYSRYE ISNYCLPRHQ SRHNRVYWSG LGWWSFGQGS TSSPWGEKFT RPRVSKEYKE
WVTRQDEFNL DSSLTNKGFV YKELDEKIML GLRLKEGVDI KEVFKEQNWE NKKLESNFSK
LVEEWERFLE SGLLVRKGNR FFLSEPNGME LSNQILVSMF KWWDEIN