Gene A9601_02351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_02351 
SymbolhemB 
ID4716919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp218928 
End bp219929 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content35% 
IMG OID640077934 
Productdelta-aminolevulinic acid dehydratase 
Protein accessionYP_001008630 
Protein GI123967772 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0113] Delta-aminolevulinic acid dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCGA TTATTCGTCC AAGAAGATTA AGAAGAACTG AGTCAATAAG AGAAATGGTT 
AGAGAAAACC ATTTGGCGGC ATCGGACTTT ATCTATCCAT TATTTATTCA TGAGAAAGAT
TTTAAAGAGG AAATTTCCGC AATGCCCGGA ACTTATAGAT GGGATATTGA TGGTTTACTA
AAGGAGGTTA CTAGGGCATG GGAATTGGGA ATTAGATGTG TGGTTCTTTT CCCAAAAATT
AATGATAGCT TAAAGACTGA AGATGGAGCA GAATGTTTTA ATGAGGACGG TTTAATACCT
AAAGCTATTC GAATATTAAA AAAAGAGATT CCAGAAATGG CAATAATGAC AGATGTTGCC
TTGGACCCTT ACTCCTGTGA TGGACATGAT GGCTTAGTTG ATGAAACTGG AAAAATATTG
AATGATGAAA CGATTGAAAT TTTAAAAAAA CAAGCTTTAA CTCAAGCTAG AGCTGGAGCA
GATTTTATTG GCCCTAGTGA CATGATGGAT GGGAGAGTTG GAGCAATTAG GACTGCTCTT
GATAGTGAAG GATTTAGTGA TGTAGGTATT ATTAGTTATA CAGCTAAATA TTCATCTGCT
TATTATGGTC CGTTTAGAAC TGCTTTAGAT TCGGCTCCTA GAGAAAATAG TAAGAAAGTA
ATTCCAGACA ATAAGTCTAC ATATCAAATG GACCCTGCCA ATTCAAAAGA GGCTTTAATT
GAATCTGCAT TGGATCAGTA TGAAGGAGCT GATATTTTGA TGGTAAAACC AGGAATTTCA
TATTTGGATA TTGTTTATAG AATAAGCACA TTTTCAAATA AGCCCATAGC TGCATACAAC
GTTAGTGGGG AGTATTCCAT GGTAAAGTCT GCTGCTATGA AGAACTGGAT TAACGAAAAA
GATATTGTTT TAGAGACATT GCTTAGTTTT AAAAGAGCAG GAGCAAAATT AATACTCACT
TATCATGCTT GTGATGCATC TCAATGGTTG CAGGATACTT AA
 
Protein sequence
MNSIIRPRRL RRTESIREMV RENHLAASDF IYPLFIHEKD FKEEISAMPG TYRWDIDGLL 
KEVTRAWELG IRCVVLFPKI NDSLKTEDGA ECFNEDGLIP KAIRILKKEI PEMAIMTDVA
LDPYSCDGHD GLVDETGKIL NDETIEILKK QALTQARAGA DFIGPSDMMD GRVGAIRTAL
DSEGFSDVGI ISYTAKYSSA YYGPFRTALD SAPRENSKKV IPDNKSTYQM DPANSKEALI
ESALDQYEGA DILMVKPGIS YLDIVYRIST FSNKPIAAYN VSGEYSMVKS AAMKNWINEK
DIVLETLLSF KRAGAKLILT YHACDASQWL QDT