Gene NATL1_06391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_06391 
SymbolhemE 
ID4779337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp579581 
End bp580639 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content41% 
IMG OID640083917 
Producturoporphyrinogen decarboxylase 
Protein accessionYP_001014466 
Protein GI124025350 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.401207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.983268 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAAA CTACTCCTTT ACTACTCCGT GCAGCTCGCG GAGAACATGT TGAAAGGCCC 
CCAGTTTGGA TGATGAGACA AGCGGGGAGA TACATGAAGG TATATCGCGA CCTTCGTGAT
AATCATCCAA GCTTCAGGGA AAGATCCGAA AACCCCGATC TTTCTTATGA AATTTCAATG
CAACCTTTTA CAGCTTTTCA ACCAGATGGA GTGATACTTT TTTCAGATAT CTTGACTCCT
CTACCTGGGA TGGGAATTAA CTTTGACATC GTTGAAAGCA AAGGACCCTT AATAAATGAC
CCAATAAGAA GCCTCAAACA GGTTAAAGAC CTTAAACCCC TTCAACCAGA AGAAAGCATG
TCTTTTGTTG GGGAAGTCCT TGGAAGGCTA AGGGAAAGCG TTGGGAACAA GGCTGCAGTT
CTTGGGTTTG TAGGTGCTCC TTGGACTCTT GCTGCATATG TTGTAGAGGG AAAAAGCAGC
AAAAATTATG CAGTTATAAA GGCGATGGCA TTCCAAGAGC CAGAACTACT GCATCAACTT
TTAAATCACT TTGCAGAATC AATTGCAAAC TATTTATCCT ATCAAATTCA ATCTGGGGCC
CAAGTAGTTC AAATGTTTGA TTCATGGGCA GGACAATTAA GTCCACAAGA TTATGACGAG
TTTGCTGCGC CTTATCAACA AAAAGTAGTC AATTTAGTAA AAGAAAAACA TCCAGATACA
CCTATGATTT TATACATCTC AGGTAGCGCG GGAGTTCTTG AAAGGATGGG ACAAACCGGA
GTAGATATAG TCTCTCTAGA TTGGACTGTT GACATGGCAG ATGGACTAAA AAGGCTGCCT
CAATCAGTTG GAGTCCAAGG AAATGTTGAT CCAGGACTTT TGTTTGGTAC TCCTGATGCG
ATCAGATCAA GAATTGTTGA TGTCGTCAAA AAAGCGAAAG GTAGAAAACA TATTCTTAAC
CTTGGTCATG GAATACTTCC AGGGACGCCA GAAGAAAATG CAAGAGTATT TTTCGAGGCT
GGTAAAAATG TGAATGAACT CATAAAAGTT TCATCTTGA
 
Protein sequence
MNETTPLLLR AARGEHVERP PVWMMRQAGR YMKVYRDLRD NHPSFRERSE NPDLSYEISM 
QPFTAFQPDG VILFSDILTP LPGMGINFDI VESKGPLIND PIRSLKQVKD LKPLQPEESM
SFVGEVLGRL RESVGNKAAV LGFVGAPWTL AAYVVEGKSS KNYAVIKAMA FQEPELLHQL
LNHFAESIAN YLSYQIQSGA QVVQMFDSWA GQLSPQDYDE FAAPYQQKVV NLVKEKHPDT
PMILYISGSA GVLERMGQTG VDIVSLDWTV DMADGLKRLP QSVGVQGNVD PGLLFGTPDA
IRSRIVDVVK KAKGRKHILN LGHGILPGTP EENARVFFEA GKNVNELIKV SS