Gene NATL1_20281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_20281 
SymbolhemF 
ID4779705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1674087 
End bp1675133 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content40% 
IMG OID640085321 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_001015848 
Protein GI124026733 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0408] Coproporphyrinogen III oxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCTCCT CACAAGATAA AGCCAATTTA CCGGCAAACA ATTCAAGAGC TCGAGCAAAA 
AAGCTTGTTT TAGAGCTTCA AGATGAAATT TGCGCTGGCT TAGAGACTAT TGATGGTGAA
GGCAAATTCC TAGAAGAATC TTGGGAGAGA CCAGAGGGTG GTGGTGGACG CTCAAGAGTC
TTGAAAGATG GAAAAATCTT TGAGCAGGGA GGGGTGAATT TTTCTGAAGT ACATGGCAAT
GAACTTCCAC CCTCAATCAT TAGCCAAAGA CCAGAAGCAA AAGGTCACTC TTGGTTTGCG
ACAGGAACCT CAATGGTTCT TCACCCCAAA AATCCTTACA TACCTACTGT TCATCTTAAT
TACAGATATT TCGAAGCAGG TCCCGTTTGG TGGTTTGGGG GAGGGGCTGA TTTAACACCT
TTCTATCCAT ATCTATCTGA CACGCGTCAC TTTCACTCAT GCCATAAAAA TGCATGTGAC
ACAATTGACA AAGATCTACA TAAGGTTTTC AAACCTTGGT GTGACGAATA TTTCTTTCTA
AAACATAGAA ATGAGACTAG AGGAGTTGGA GGTATCTTTT ATGACTATCA AGACGGATCC
GGCTTGCTTT ATAAAGGACA AAATGCCAAT GGGAAGGCCT CTAAAATCGC AAAAGAGCTA
GGGGAATATT CTTTGAATTG GGAAAATCTT TTTTCACTTG CCAAAGCATG CGGGCAGGCA
TTTCTGCCCT CCTATGAGCC AATAATAAAA AAGCGAAAAA ATCAAAGCTT TTCAACCAAA
GAACGAGACT TTCAACTTTA TAGGCGAGGT AGATATGCTG AGTTCAATTT GGTATGGGAT
AGAGGAACCA TTTTTGGATT ACAAACGAAT GGAAGAACGG AGTCAATATT GATGTCATTA
CCTCCCTTAG CAAGATGGGA GTATGGATAT AAGCCAGAAG AAAATTCACG TGAGGCTCTA
TTAACAGATT TATTTACTAA GCCTCAAGAT TGGTTTACAG ATAAATCTCT GGAAAAGAGG
TGTTTAACTC ATCAAGCATT GGATTAG
 
Protein sequence
MSSSQDKANL PANNSRARAK KLVLELQDEI CAGLETIDGE GKFLEESWER PEGGGGRSRV 
LKDGKIFEQG GVNFSEVHGN ELPPSIISQR PEAKGHSWFA TGTSMVLHPK NPYIPTVHLN
YRYFEAGPVW WFGGGADLTP FYPYLSDTRH FHSCHKNACD TIDKDLHKVF KPWCDEYFFL
KHRNETRGVG GIFYDYQDGS GLLYKGQNAN GKASKIAKEL GEYSLNWENL FSLAKACGQA
FLPSYEPIIK KRKNQSFSTK ERDFQLYRRG RYAEFNLVWD RGTIFGLQTN GRTESILMSL
PPLARWEYGY KPEENSREAL LTDLFTKPQD WFTDKSLEKR CLTHQALD