Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_19991 |
Symbol | hemN |
ID | 4776655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1757163 |
End bp | 1758473 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640087513 |
Product | putative oxygen-independent coproporphyrinogen III oxidase |
Protein accession | YP_001018006 |
Protein GI | 124023699 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases |
TIGRFAM ID | [TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.206025 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGAAG GTCTTGCTGT GTTGCCACCT CGCAGTGCCT ATCTGCACAT TCCTTTCTGC CATCGGCGTT GCTTTTATTG CGATTTCGCT GTCGTGCCGC TTGGCGATCA TGCCAATGGG GCTAAGGGCT CAGGTAGCGC CTCGATTCAG TCTTATCTGC AGCTATTGCA ACGGGAGATT GCGCTTGTCA AGCCTGGGCC AACGCTGGCC ACGGTGTACA TCGGTGGCGG AACACCATCT CTGCTCAGCT CAGCTCAGAT TGGGGCTCTG TTGGATCAGC TACGGCAACG GTTTGGCGTT CAGCTTGGTG CAGAAATCAC ACTGGAAATG GATCCAGCTA GTTTCGATCA GGCTTACCTA GCGGCCGTAT TAGCGGCTGG TGTCAACAGG GTGAGCTTGG GGGGGCAGAG TTTCGATGAT GCAGTGCTCG AGACGCTTGG GCGCCGTCAT CGTCGCCACC ATTTACTGGA GGCGTGCGGA TGGTTGCATC AGGCTCATCA GTGCGGAGAG CTGAAAAGTT GGAGTTTGGA TCTCATCCAG AACCTGCCAG GACAGGAGTT GGTGGCTTGG AAGCAACAGC TTGTTGAGGC CATTGATACT GGTTCACCTC ATCTTTCGAT CTACGATTTA TCTGTTGAAC CAGGCACGGT ATTTGCTTGG CGTCAGCGGC GGGGGGAGTT GGATTTGCCC GATGATGATT TAGCAGCTGA GCAGATGCAG ACCACCAGTG TCTTGCTTCG TCAGGCAGGA TTTGGCCGCT ATGAAATCTC CAATTACGCC TTGCCAGGGC ACGCCTCGCG CCACAACCGC GTGTATTGGA GTGGGGCAGG GTGGTGGGCG TTTGGTCAGG GTGCGACCAG TGCGCCCTGG GGCGAGAGGC TGGCTCGTCC GCGCACAAGG GATGGTTACT GCAACTGGAT TGAGGTTCAG GAGGTAGAAG GACTGGATTC CTCTCTCGTT GCGGCTCAGG CAAGACCCTT ACCTCTGGAT GAACAGTTGT TGGTGGGTTT GCGTTGTCGT GAGGGAGTTG ATCTTGAGGC CCTCAGTAGG GCTTGGGGCT GGACTCATGA GCAGTGCAAC GCTCTATTGC CGTCGTTGCA GGTGCGCTGG CAGGCGGCGT TGGATCGGGG CTGGTTAGAG CTGCATGGAC GACGTTGGCA GCTCAGTGAT CCCGAGGGGA TGGCCATCAG CAATCAGGTT CTGGTTGAGA TGTTGTTGTG GTGGCAGTCT CTGCCTGCTG ATGCAGTTGC TTCACCCAAC CTTGAAGGGC TTCTACGCAC AGCTGGCGAC CTTGGATCAA TGGCGGACTG A
|
Protein sequence | MREGLAVLPP RSAYLHIPFC HRRCFYCDFA VVPLGDHANG AKGSGSASIQ SYLQLLQREI ALVKPGPTLA TVYIGGGTPS LLSSAQIGAL LDQLRQRFGV QLGAEITLEM DPASFDQAYL AAVLAAGVNR VSLGGQSFDD AVLETLGRRH RRHHLLEACG WLHQAHQCGE LKSWSLDLIQ NLPGQELVAW KQQLVEAIDT GSPHLSIYDL SVEPGTVFAW RQRRGELDLP DDDLAAEQMQ TTSVLLRQAG FGRYEISNYA LPGHASRHNR VYWSGAGWWA FGQGATSAPW GERLARPRTR DGYCNWIEVQ EVEGLDSSLV AAQARPLPLD EQLLVGLRCR EGVDLEALSR AWGWTHEQCN ALLPSLQVRW QAALDRGWLE LHGRRWQLSD PEGMAISNQV LVEMLLWWQS LPADAVASPN LEGLLRTAGD LGSMAD
|
| |