Gene P9211_02361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_02361 
SymbolhemB 
ID5731638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp227496 
End bp228506 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content41% 
IMG OID641284580 
Productdelta-aminolevulinic acid dehydratase 
Protein accessionYP_001550121 
Protein GI159902777 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0113] Delta-aminolevulinic acid dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.984887 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTTAA CTTATCGTCC TCGACGTCTG CGTAGAACAG CTTCGTTAAG AAGCTTGGTT 
AGGGAAAATA TTTTAACTGC ATCTGATTTT ATCTACCCTC TATTTATTCA TGAAGGTCAA
GATGTTCAGC CAATTGGTGC GATGCCAGGA GCCAATCGTT GGAGTTTAGA TGCTCTTGTT
GGCGAGGTAG AAAGGGCTTG GGAACTAGGT GTTAAATGTG TAGTGCTTTT CCCAAAAGTG
TCTGAAGAAC TTAAATCTGA GGATGGCGCT GAGTGTTTTA ATGCAAATGG CTTAATCCCC
AAAGCGATTA GCCGTTTAAA GCAAGAGCTT CCTGAGATGA CGATAATGAC TGATGTTGCA
CTGGATCCAT ATTCAAGTGA TGGCCACGAT GGGATCGTTA GTTCTGATGG AGTTGTATTA
AATGATGAAA CTGTTGATAG TCTTTGCAAG CAAGCAATTG TACAGGCTCA AGCAGGTGCT
GATCTAATTG GTCCCAGCGA TATGATGGAT GGCCGTGTTG GTGCTATTCG TGAATCTTTA
GATGATGAGG GTTTTGAGCA TGTAGGAATT ATTAGTTATA CGGCTAAATA TTCTTCCGCT
TATTACGGGC CTTTTCGTGA GGCATTAGAT TCGGCCCCAA AGGTAATTAA TAAAAAACCT
ATTCCTAAAA ATAAAAATAG TTATCAAATG GATCCTGCTA ACTCACGTGA AGCAATTACA
GAAGCTCAAC TTGATGAGCA AGAAGGGGCT GATATTTTGA TGGTGAAGCC AGGCTTAGCG
TATCTGGACA TTATCTATCG TTTACGACAA GAATCTGAAT TGCCAATTGC TGCTTACAAC
GTTAGTGGAG AGTATGCAAT GGTTAAGGCT GCTGCGGAAA AATCATGGAT AGATGAGAAA
TCGGTAGTTC TTGAAACGCT ATTGAGTTTT AAAAGAGCAG GTGCAGACTT AATACTTACT
TATCATGCAT GTGATGCTGC ACATTGGTTG AGAGATGGAT TATTTGACTA A
 
Protein sequence
MELTYRPRRL RRTASLRSLV RENILTASDF IYPLFIHEGQ DVQPIGAMPG ANRWSLDALV 
GEVERAWELG VKCVVLFPKV SEELKSEDGA ECFNANGLIP KAISRLKQEL PEMTIMTDVA
LDPYSSDGHD GIVSSDGVVL NDETVDSLCK QAIVQAQAGA DLIGPSDMMD GRVGAIRESL
DDEGFEHVGI ISYTAKYSSA YYGPFREALD SAPKVINKKP IPKNKNSYQM DPANSREAIT
EAQLDEQEGA DILMVKPGLA YLDIIYRLRQ ESELPIAAYN VSGEYAMVKA AAEKSWIDEK
SVVLETLLSF KRAGADLILT YHACDAAHWL RDGLFD