Gene P9303_19991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_19991 
SymbolhemN 
ID4776655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1757163 
End bp1758473 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content56% 
IMG OID640087513 
Productputative oxygen-independent coproporphyrinogen III oxidase 
Protein accessionYP_001018006 
Protein GI124023699 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.206025 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGAAG GTCTTGCTGT GTTGCCACCT CGCAGTGCCT ATCTGCACAT TCCTTTCTGC 
CATCGGCGTT GCTTTTATTG CGATTTCGCT GTCGTGCCGC TTGGCGATCA TGCCAATGGG
GCTAAGGGCT CAGGTAGCGC CTCGATTCAG TCTTATCTGC AGCTATTGCA ACGGGAGATT
GCGCTTGTCA AGCCTGGGCC AACGCTGGCC ACGGTGTACA TCGGTGGCGG AACACCATCT
CTGCTCAGCT CAGCTCAGAT TGGGGCTCTG TTGGATCAGC TACGGCAACG GTTTGGCGTT
CAGCTTGGTG CAGAAATCAC ACTGGAAATG GATCCAGCTA GTTTCGATCA GGCTTACCTA
GCGGCCGTAT TAGCGGCTGG TGTCAACAGG GTGAGCTTGG GGGGGCAGAG TTTCGATGAT
GCAGTGCTCG AGACGCTTGG GCGCCGTCAT CGTCGCCACC ATTTACTGGA GGCGTGCGGA
TGGTTGCATC AGGCTCATCA GTGCGGAGAG CTGAAAAGTT GGAGTTTGGA TCTCATCCAG
AACCTGCCAG GACAGGAGTT GGTGGCTTGG AAGCAACAGC TTGTTGAGGC CATTGATACT
GGTTCACCTC ATCTTTCGAT CTACGATTTA TCTGTTGAAC CAGGCACGGT ATTTGCTTGG
CGTCAGCGGC GGGGGGAGTT GGATTTGCCC GATGATGATT TAGCAGCTGA GCAGATGCAG
ACCACCAGTG TCTTGCTTCG TCAGGCAGGA TTTGGCCGCT ATGAAATCTC CAATTACGCC
TTGCCAGGGC ACGCCTCGCG CCACAACCGC GTGTATTGGA GTGGGGCAGG GTGGTGGGCG
TTTGGTCAGG GTGCGACCAG TGCGCCCTGG GGCGAGAGGC TGGCTCGTCC GCGCACAAGG
GATGGTTACT GCAACTGGAT TGAGGTTCAG GAGGTAGAAG GACTGGATTC CTCTCTCGTT
GCGGCTCAGG CAAGACCCTT ACCTCTGGAT GAACAGTTGT TGGTGGGTTT GCGTTGTCGT
GAGGGAGTTG ATCTTGAGGC CCTCAGTAGG GCTTGGGGCT GGACTCATGA GCAGTGCAAC
GCTCTATTGC CGTCGTTGCA GGTGCGCTGG CAGGCGGCGT TGGATCGGGG CTGGTTAGAG
CTGCATGGAC GACGTTGGCA GCTCAGTGAT CCCGAGGGGA TGGCCATCAG CAATCAGGTT
CTGGTTGAGA TGTTGTTGTG GTGGCAGTCT CTGCCTGCTG ATGCAGTTGC TTCACCCAAC
CTTGAAGGGC TTCTACGCAC AGCTGGCGAC CTTGGATCAA TGGCGGACTG A
 
Protein sequence
MREGLAVLPP RSAYLHIPFC HRRCFYCDFA VVPLGDHANG AKGSGSASIQ SYLQLLQREI 
ALVKPGPTLA TVYIGGGTPS LLSSAQIGAL LDQLRQRFGV QLGAEITLEM DPASFDQAYL
AAVLAAGVNR VSLGGQSFDD AVLETLGRRH RRHHLLEACG WLHQAHQCGE LKSWSLDLIQ
NLPGQELVAW KQQLVEAIDT GSPHLSIYDL SVEPGTVFAW RQRRGELDLP DDDLAAEQMQ
TTSVLLRQAG FGRYEISNYA LPGHASRHNR VYWSGAGWWA FGQGATSAPW GERLARPRTR
DGYCNWIEVQ EVEGLDSSLV AAQARPLPLD EQLLVGLRCR EGVDLEALSR AWGWTHEQCN
ALLPSLQVRW QAALDRGWLE LHGRRWQLSD PEGMAISNQV LVEMLLWWQS LPADAVASPN
LEGLLRTAGD LGSMAD