Gene P9303_03881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_03881 
SymbolhemB 
ID4776800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp390435 
End bp391436 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content55% 
IMG OID640085891 
Productdelta-aminolevulinic acid dehydratase 
Protein accessionYP_001016405 
Protein GI124022098 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0113] Delta-aminolevulinic acid dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0840908 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTCA CTTATCGCCC TCGTCGGCTG CGTCGTACTC CTGCCCTGCG TTCCATGGTG 
CGGGAGCACA GCCTCACTGC TGCTGACTTT ATCTATCCCT TGTTTGTCCA TGAGGGAGCT
GATGTGGAGC CGATCGGTGC CATGCCAGGA ACCAATCGCT GGAGCCTGGA TCGCTTAATG
GGAGAAGTTC AACGGGCCTG GGGTCTGGGC ATTCGTTGTG TGGTGCTTTT CCCGAAAGTC
GCTGAGGGCC TCAAAACTGA AGATGGGGCG GAATGTTTCA GTGAGCATGG GTTGATTCCG
CGGGCCATCA CGCAGCTCAA GCATGAGCTG CCGGAAATGA CAGTGATGAC CGATGTGGCA
TTGGATCCCT ATTCCTGTGA TGGTCATGAC GGCATCGTGA GCGCCGAGGG TGTGGTGCTG
AACGACGAGA CGATTGAACA ACTCTGTCGT CAGGCTGTTG TTCAGGCTCG TGCTGGTGCT
GACCTGATTG GACCCAGCGA CATGATGGAT GGTCGCGTCG GTGCGATCCG TGAAGCTCTT
GACGACGAGG GTTTTGAGCA TGTGGGGATT ATTAGCTATA CAGCGAAATA CTCTTCGGCG
TACTACGGCC CGTTTCGCGA GGCGCTTGAT TCAGCCCCGC GCGTGGCTGG TGGCAAGCCA
ATTCCTAAAG ACAAGAGCAC GTATCAAATG GATCCAGCCA ATGCTCGTGA GGCGATTACT
GAGGCTCAAC TCGATGAGCA GGAGGGTGCA GATATTCTCA TGGTGAAGCC TGGCTTGGCC
TATCTCGACA TCATTCATCG TTTAAGCGAA GAATCGGAGT TACCCATCGC TGCTTACAAC
GTGAGTGGCG AGTATTCGAT GGTGAAGGCT GCTGCTGAAA GGGGTTGGCT TGATGAGCGG
GCGGTGGTCT TAGAAACCTT GTTGAGTTTC AAGCGAGCTG GTGCCGATTT GATCCTCACC
TACCACGCTT GTGATGCAGC CGCTTGGTTA CGGCAGGGAT GA
 
Protein sequence
MDLTYRPRRL RRTPALRSMV REHSLTAADF IYPLFVHEGA DVEPIGAMPG TNRWSLDRLM 
GEVQRAWGLG IRCVVLFPKV AEGLKTEDGA ECFSEHGLIP RAITQLKHEL PEMTVMTDVA
LDPYSCDGHD GIVSAEGVVL NDETIEQLCR QAVVQARAGA DLIGPSDMMD GRVGAIREAL
DDEGFEHVGI ISYTAKYSSA YYGPFREALD SAPRVAGGKP IPKDKSTYQM DPANAREAIT
EAQLDEQEGA DILMVKPGLA YLDIIHRLSE ESELPIAAYN VSGEYSMVKA AAERGWLDER
AVVLETLLSF KRAGADLILT YHACDAAAWL RQG