Gene P9301_02361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_02361 
SymbolhemB 
ID4911902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp219896 
End bp220897 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content35% 
IMG OID640159802 
Productdelta-aminolevulinic acid dehydratase 
Protein accessionYP_001090460 
Protein GI126695574 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0113] Delta-aminolevulinic acid dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCGA TTATTCGTCC TAGAAGATTA AGAAGAACTG AAGCAATTAG AGAAATGGTT 
AGAGAAAACC ATCTAATGGC ATCTGACTTT ATATATCCAT TATTTATTCA TGAAAAAGAC
TTTAAAGAGG AAATTTCAGC AATGCCCGGA ACTTATAGAT GGGATATTAA TGGCTTAATA
AAGGAGGTTA CTAGGGCATG GCAATTAGGA ATAAGGTGTG TAGTTCTTTT CCCAAAAGTT
AACGATAGTT TAAAGACTGA AGATGGAGCA GAATGTTTTA ATGAAGACGG TTTAATTCCT
AGGGCTATTC GAATCTTAAA AAAAGAGATT CCAGAAATGG CAATAATGAC AGATGTTGCC
TTGGATCCAT ACTCTTGTGA TGGTCATGAT GGATTAGTTG ATGAAACTGG AAAAATATTG
AACGACGAAA CAATTGAAAT TTTAAAAAAA CAAGCTTTAA CACAAGCAAG AGCTGGAGCG
GATTTTATTG GTCCTAGTGA CATGATGGAT GGGAGAGTTG GAGCAATCAG AGCTGCTCTC
GATAGCCAAG GATTTAGTGA TGTAGGTATT ATTAGTTATA CAGCAAAATA TTCATCTGCT
TATTATGGAC CATTTAGAAC TGCTTTAGAT TCGGCTCCTA GAGAAAATAG TAAGAAAATA
ATTCCAGACA ATAAGTCTAC ATATCAAATG GACCCTGCGA ATTCTAAAGA GGCTTTGATT
GAATCTGCAT TGGATCAGTA TGAAGGAGCT GATATTTTGA TGGTAAAACC AGGAATTTCA
TACTTGGATA TTGTTTATAG ATTAAGCACT TTTTCAAATA AACCTATAGC TGCATACAAC
GTTAGTGGGG AGTATTCCAT GGTAAAGTCT GCTGCTATGA AGAACTGGAT TAACGAGAAA
GATATTGTAT TAGAAACATT GCTTAGTTTT AAAAGAGCAG GAGCAAAGTT AATACTAACT
TATCATGCTT GTGATGCATC TCAATGGTTG CAGGATACTT AA
 
Protein sequence
MNSIIRPRRL RRTEAIREMV RENHLMASDF IYPLFIHEKD FKEEISAMPG TYRWDINGLI 
KEVTRAWQLG IRCVVLFPKV NDSLKTEDGA ECFNEDGLIP RAIRILKKEI PEMAIMTDVA
LDPYSCDGHD GLVDETGKIL NDETIEILKK QALTQARAGA DFIGPSDMMD GRVGAIRAAL
DSQGFSDVGI ISYTAKYSSA YYGPFRTALD SAPRENSKKI IPDNKSTYQM DPANSKEALI
ESALDQYEGA DILMVKPGIS YLDIVYRLST FSNKPIAAYN VSGEYSMVKS AAMKNWINEK
DIVLETLLSF KRAGAKLILT YHACDASQWL QDT