Gene A9601_15101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_15101 
SymbolhemN 
ID4718234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1293792 
End bp1295015 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content32% 
IMG OID640079234 
Productputative oxygen-independent coproporphyrinogen III oxidase 
Protein accessionYP_001009900 
Protein GI123969042 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.574259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAGT TTCCAAGAAG TGCTTATGTG CACATTCCTT TTTGCCACAG AAGATGTTTT 
TATTGTGATT TTGCAGTTAT TCCATTAGGA AACAAAGTTG AAACTTTAAA AGGTTATGGA
AGCAAAACTG TTCAAGAGTA TTTGCAATTT TTATTTAAAG AAATATTGTC AATTAAACAT
AAATCACCTC TATCGACAAT TTATATAGGA GGTGGTACAC CATCAATTTT AGATCCCAGC
CAAATCAAAG AATTAATTGA TCTTTTTAAA GAAAATTATG GCATTGACTA TGGTGCTGAA
ATCACTATGG AGATTGATCC AGCTAGTTTT ACTCAAGATG ATCTTTTTGG ATTCATAAAT
GCTGGGATAA ATAGATTTAG TCTCGGAGTA CAAAGTTTTA ATAATCAGGT ACTTCAAAAG
TCGGGAAGGC GTCATTTGAA AGAAGATGCA GAAAAATCTT GTTTCTGGTT GAAGAGAGAA
TATGATTCTG GGTTAATAAA AAGCTGGAGT TTAGATTTAA TACAAAACTT GCCACTTAGT
GGATTTAAAG AATGGCAAGA TGACTTAAAA AAAGCAATAA CATTTTCACC GCCGCATCTA
TCTATTTACG ATTTAAATAT TGAAAATGGC ACTGTTTTTA AGAAATTAGT TAATTTAGGC
AAATTAAAAC TCCCAAGTGA TGAAGAAGCT GTTAGAAATA GTGAATCAAC ACATTTAATT
TTAAAAAACT TAGGGTATTC AAGATATGAA ATCTCAAACT ATTGCCTTCC GGGACATCAA
TCGAGACACA ATAGAGTTTA TTGGAGTGGT TTAGGCTGGT GGGGTTTTGG TCAAGGCTCC
ACTAGTTCAC CTTGGGGGGA AAAATTAACT AGACCAAGAG TTAGTAAAGA ATATAAAGAA
TGGGTAATTA GACAATACGA ATTTAATTTA GATTCATCCT TAACTAATAA GGATTTTGTC
TACAAAGAAC TTGATGAGAA AATAATGTTG GGATTAAGAC TCAAAGAGGG TTTAGATATC
AAAAAAGTGT TTCAAGAACA AAACTGGGAG AACAAAAAAT TTGAAAGCAA CTTTAGTAAA
TTGCTCAAAG AATGGGAAAG GTTTCTTGAA AGTGGACTTT TAGTAAGAAA GGGTTATAGA
TTCTTTTTAA GTGAGCCTAA TGGCATGGAA CTAAGCAATC AAGTTCTTGT TTCTATGTTT
AAGTGGTGGG ATGAGATTAA TTAA
 
Protein sequence
MNKFPRSAYV HIPFCHRRCF YCDFAVIPLG NKVETLKGYG SKTVQEYLQF LFKEILSIKH 
KSPLSTIYIG GGTPSILDPS QIKELIDLFK ENYGIDYGAE ITMEIDPASF TQDDLFGFIN
AGINRFSLGV QSFNNQVLQK SGRRHLKEDA EKSCFWLKRE YDSGLIKSWS LDLIQNLPLS
GFKEWQDDLK KAITFSPPHL SIYDLNIENG TVFKKLVNLG KLKLPSDEEA VRNSESTHLI
LKNLGYSRYE ISNYCLPGHQ SRHNRVYWSG LGWWGFGQGS TSSPWGEKLT RPRVSKEYKE
WVIRQYEFNL DSSLTNKDFV YKELDEKIML GLRLKEGLDI KKVFQEQNWE NKKFESNFSK
LLKEWERFLE SGLLVRKGYR FFLSEPNGME LSNQVLVSMF KWWDEIN