Gene NATL1_17311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_17311 
SymbolhemN 
ID4781146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1417590 
End bp1418822 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content37% 
IMG OID640085018 
Productputative oxygen-independent coproporphyrinogen III oxidase 
Protein accessionYP_001015551 
Protein GI124026436 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.24625 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAGCCC CACCAAGAAG TGCATATTTG CATATACCCT TTTGCCATAA AAGATGTTTT 
TACTGCGATT TTTCGATTAT CCCTTTAGGT GATAGCGCTC AAGCTCCAGA TAGTCCAGGG
ATAACTTCCA TTAACGCATA TTTGGATTTA CTTCATAGAG AGATTTCAAT TTCCCCTAAA
GGTCCTGCAT TATCAACAAT TTATTTGGGC GGTGGAACAC CTTCCTTATT AAAAAAAAAT
GAGGTGGGTG ATTTGTTGCA AAATCTTCAA AGAAAATTTG GATTTCAAGA TGGTGCAGAG
ATAACTATGG AGGTTGATCC AGCAACATTT TTTGAAAACG ACCTGCATGG ATACATAGGG
ATTGGAATTA ATAGATTTAG CTTGGGTGCG CAGGCATTTG ATGACAGTAC TTTGGCTTCA
ATTGGTAGAA AACATAATCG CTCACAATTA ATTGAGGCTT GTGATTGGGT AAATAATTTA
TTTAAAAAAG GAATGCTAAG AAGCTGGAGT CTTGATTTGA TTCAAAACCT TCCAGGATTA
AATTTGTCCA AGTGGATTAA AGAGTTGGAA CAAGCGGTTC GTACGGAGGC CCCTCATTTG
TCAATTTATG ATTTAACTAT CGAACCAGAT ACTGTTTTTG GAAGACTACA TAAAAAAGGA
AAGTTGAATA TCCCAATTGA TTCTGAATCT CAAAAAATAG ATTTTGAAAC CACTAGATTA
CTAAAAAGTA GAGGTTTCGC TAGATACGAG ATTTCAAGTT ATTCATTGCC TGGTCATGCA
TCTAGACATA ATCGCATGTA TTGGAGTGGT TCTGGTTGGT GGGGCTTTGG GATGGGGTCA
ACAAGTGCTC CTTGGGGGGA AAGATTCTCT AGACCAAGAA CAATTGCTGG TTATAAAAAA
TGGCTTGAAC AGCAAGAGAG TCAGTCATTA GAGAAAACTT TGTCTATTGA AAAGTCAAAA
TCAATGCCAT TGGATGAACT TCTGATGATT GGTCTTAGAA GACGGGAGGG TATTCATCTG
GAGGAACTTG CTAAAAATGC TGGATGGACA CAAAAAAAAT GTGATAAGAA TTTAAAATCA
CTTGAGAAAT TTTGGCTAAA TTCTATAAAT GAAGGATTTC TATTAAGACA CAATGGTAGA
TATTTTTTAA GTGATCCTAA GGGGTTTCAA ATTAGCAATC AGATTTTGAT TCAGATGTTT
TCGTGGTGGG ATTCACTTGA TCGAGATCAG TAG
 
Protein sequence
MVAPPRSAYL HIPFCHKRCF YCDFSIIPLG DSAQAPDSPG ITSINAYLDL LHREISISPK 
GPALSTIYLG GGTPSLLKKN EVGDLLQNLQ RKFGFQDGAE ITMEVDPATF FENDLHGYIG
IGINRFSLGA QAFDDSTLAS IGRKHNRSQL IEACDWVNNL FKKGMLRSWS LDLIQNLPGL
NLSKWIKELE QAVRTEAPHL SIYDLTIEPD TVFGRLHKKG KLNIPIDSES QKIDFETTRL
LKSRGFARYE ISSYSLPGHA SRHNRMYWSG SGWWGFGMGS TSAPWGERFS RPRTIAGYKK
WLEQQESQSL EKTLSIEKSK SMPLDELLMI GLRRREGIHL EELAKNAGWT QKKCDKNLKS
LEKFWLNSIN EGFLLRHNGR YFLSDPKGFQ ISNQILIQMF SWWDSLDRDQ