Gene Mpe_A3002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3002 
Symbol 
ID4784691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3191806 
End bp3192759 
Gene Length954 bp 
Protein Length317 aa 
Translation table11 
GC content75% 
IMG OID640091573 
Producthydroxymethylbilane synthase 
Protein accessionYP_001022190 
Protein GI124268186 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0181] Porphobilinogen deaminase 
TIGRFAM ID[TIGR00212] porphobilinogen deaminase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00423817 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGCGA GCACACAAGC CTGGGTCATC GCGACCCGCG AGAGCCGGTT GGCGCTGTGG 
CAGGCCGAGC ATGTGCGTGC CCTGCTCGGC TCGCGCCTGG TCGAGCCGGT CGAGCTGCTG
GGGATGACGA CGCGCGGCGA CCAGATCCTG GACCGCACGC TCAGCAAGGT CGGTGGCAAG
GGCCTGTTCG TCAAGGAACT CGAGACCGCG CTGGAGGCCG GTGACGCCCA CCTCGCCGTG
CATTCGCTGA AGGACGTGCC GATGGACCTG CCGGCCGGCT TCGTGCTGGC CGCGGTGCTG
GAGCGCGAGG ACCCGCGCGA CGCCTGGGTC TCCCCACGCT ACGCCGACCT GGCCGCGCTG
CCGGCCGGCG CGGTGGTCGG CACCTCGAGC CTGCGGCGGC TCAGCCAGTT GCGGGCGCGC
CGGCCCGACC TGCGCATCGA GCCGCTGCGC GGCAACCTCG ACACCCGCCT GCGCAAGCTC
GACGAGGGTC AGTACGACGC CATCGTGCTG GCCGCCGCCG GCCTGAAGCG CCTGGGCCTG
GCCGAGCGCA TCCGCAGCGT GTTCGAGGCC GACGCGATGA TCCCCGCTGC GGGCCAGGGG
GCGCTCGGCA TCGAGCTGCG GGCCGATGCG CCCGAGCGCC ATCCGGCGCT GTGGGCCGCC
CTGCGGGCGC TGACGCACGA GCCGAGCTGG CTGGCGGTGC ATGCGGAGCG CGCGGTCTCG
CGCGCACTGG GCGGCAGCTG CAGCATGCCG CTGGCGGCGC ATGCGCAATG GCAGGCCGAC
GGCCGGCTGG TCTTGCGGGC GGCGCTCGGC AGCGTGGCCG AGGCCGCGCC CGCGCTGGTG
CACGCCGAGG CCGGCGCGGC TGTGGCCGAC ACCGCCGCGG CCGAGGCGCT GGGCCTCGCG
GTGGCGCAGC AGTTGCGCCA GCGCGGTGGC GACGCGCTGC TGGCGGCGCT CTGA
 
Protein sequence
MEASTQAWVI ATRESRLALW QAEHVRALLG SRLVEPVELL GMTTRGDQIL DRTLSKVGGK 
GLFVKELETA LEAGDAHLAV HSLKDVPMDL PAGFVLAAVL EREDPRDAWV SPRYADLAAL
PAGAVVGTSS LRRLSQLRAR RPDLRIEPLR GNLDTRLRKL DEGQYDAIVL AAAGLKRLGL
AERIRSVFEA DAMIPAAGQG ALGIELRADA PERHPALWAA LRALTHEPSW LAVHAERAVS
RALGGSCSMP LAAHAQWQAD GRLVLRAALG SVAEAAPALV HAEAGAAVAD TAAAEALGLA
VAQQLRQRGG DALLAAL