Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3002 |
Symbol | |
ID | 4784691 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 3191806 |
End bp | 3192759 |
Gene Length | 954 bp |
Protein Length | 317 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 640091573 |
Product | hydroxymethylbilane synthase |
Protein accession | YP_001022190 |
Protein GI | 124268186 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0181] Porphobilinogen deaminase |
TIGRFAM ID | [TIGR00212] porphobilinogen deaminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00423817 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGCGA GCACACAAGC CTGGGTCATC GCGACCCGCG AGAGCCGGTT GGCGCTGTGG CAGGCCGAGC ATGTGCGTGC CCTGCTCGGC TCGCGCCTGG TCGAGCCGGT CGAGCTGCTG GGGATGACGA CGCGCGGCGA CCAGATCCTG GACCGCACGC TCAGCAAGGT CGGTGGCAAG GGCCTGTTCG TCAAGGAACT CGAGACCGCG CTGGAGGCCG GTGACGCCCA CCTCGCCGTG CATTCGCTGA AGGACGTGCC GATGGACCTG CCGGCCGGCT TCGTGCTGGC CGCGGTGCTG GAGCGCGAGG ACCCGCGCGA CGCCTGGGTC TCCCCACGCT ACGCCGACCT GGCCGCGCTG CCGGCCGGCG CGGTGGTCGG CACCTCGAGC CTGCGGCGGC TCAGCCAGTT GCGGGCGCGC CGGCCCGACC TGCGCATCGA GCCGCTGCGC GGCAACCTCG ACACCCGCCT GCGCAAGCTC GACGAGGGTC AGTACGACGC CATCGTGCTG GCCGCCGCCG GCCTGAAGCG CCTGGGCCTG GCCGAGCGCA TCCGCAGCGT GTTCGAGGCC GACGCGATGA TCCCCGCTGC GGGCCAGGGG GCGCTCGGCA TCGAGCTGCG GGCCGATGCG CCCGAGCGCC ATCCGGCGCT GTGGGCCGCC CTGCGGGCGC TGACGCACGA GCCGAGCTGG CTGGCGGTGC ATGCGGAGCG CGCGGTCTCG CGCGCACTGG GCGGCAGCTG CAGCATGCCG CTGGCGGCGC ATGCGCAATG GCAGGCCGAC GGCCGGCTGG TCTTGCGGGC GGCGCTCGGC AGCGTGGCCG AGGCCGCGCC CGCGCTGGTG CACGCCGAGG CCGGCGCGGC TGTGGCCGAC ACCGCCGCGG CCGAGGCGCT GGGCCTCGCG GTGGCGCAGC AGTTGCGCCA GCGCGGTGGC GACGCGCTGC TGGCGGCGCT CTGA
|
Protein sequence | MEASTQAWVI ATRESRLALW QAEHVRALLG SRLVEPVELL GMTTRGDQIL DRTLSKVGGK GLFVKELETA LEAGDAHLAV HSLKDVPMDL PAGFVLAAVL EREDPRDAWV SPRYADLAAL PAGAVVGTSS LRRLSQLRAR RPDLRIEPLR GNLDTRLRKL DEGQYDAIVL AAAGLKRLGL AERIRSVFEA DAMIPAAGQG ALGIELRADA PERHPALWAA LRALTHEPSW LAVHAERAVS RALGGSCSMP LAAHAQWQAD GRLVLRAALG SVAEAAPALV HAEAGAAVAD TAAAEALGLA VAQQLRQRGG DALLAAL
|
| |