Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0876 |
Symbol | |
ID | 4787199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 916066 |
End bp | 917397 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640089437 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_001020073 |
Protein GI | 124266069 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0480084 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGGCG AGGACGGGAC TTCGATGCAC CACGCTCCCC TGCGCTACCA GAGCGGCTTC GCCAACCACT TCGAGAGCGA GGCGCTGCCC GGTGCGCTGC CGGTGGGCCG CAACTCGCCG CAGCGCTGCC CCTACGGTCT GTACGCCGAG CAGTTCAGCG CCACCGCCTT CACCGCGCCG CGCGCCGACA ACCGCCGCAG CTGGCTCTAC CGCATCCGCC CGGCCGCGAT GCACGCACCG TTCGAGCCGT ACGACGACGG CGGCCGGTTG GTCAGCGACT TCTCCACGCT CGCCACGCCG CCCGATCCGC TGCGCTGGAA CCCGTTGCCG CTGCCGGCGG CGCCGACCGA CTTCGTCGAC GGCCTCGTGA CCTGGGCCGG CCACGGCGAT GCGGGCGTGC AGGCCGGCGC CGCGGTCCAT CTCTATGCCG CCGACCGCTC GATGGAGCAG CGCAGCTTCT GCAGTGCCGA CGGCGAGCTG CTGATCGTGC CCCAGCTCGG CCGCCACCGC TTCGTCACCG AGCTCGGCGT GCTGGAGGTC GAGCCGCAGG AGATCGTCGT CATCCCGCGC GGCCTGCGCT TTCGCGTCGA GCTGCCCGAC GGCGCGGGCC GCGGCTACGT GTGCGAGAAC CACGGCGCGC CGTTTCGCCT GCCCGACCTG GGGCCGATCG GCGCCAACGG CCTGGCGCAT GCGCGCGACT TCCTCGCGCC GGTGGCGGCC TACGAGGACA TCGACGGGCC GCACCAGCTG GTCACCAAGT TCATGGGCCG GCTGTGGTCG GCGGCGATGG ATCACTCGCC GCTCGACGTG GTGGCCTGGC ACGGCAACTG CGCACCGTAC AAGTACGACC TGCGGCGCTT CAATGCCATC GGCTCGATCA GCCACGACCA CCCCGATCCG TCGATCTTCC TGGTGCTGCA TGCGGCCTCC GACACGCCGG GCACCAGCGC CATCGACTTC GTGGTCTTCC CGCCGCGCAT CCTGGCGATG CAGGACACCT TCCGGCCGCC CTGGTTCCAC CGCAACGTCG CCAGCGAGTT CATGGGCCTG ATCCACGGCG TGTACGACGC CAAGGCCGAA GGCTTCCTGC CCGGCGGCGC GAGCCTGCAC AACTGCATGA CCGGCCACGG CCCCGACGCC GAGACCTTCG AGAAGGCGAG CCGTGCCGAC CTGTCGCAGC CCGACGTGAT CCGCGACACC ATGGCCTTCA TGTTCGAGGC GCGCCACGTC TGGCGCCCGA CACCCCGGGC GCTGGCCTCG CCGCTGCGGC AGGCCGACTA CGCGCGCTGC TGGCAGGGCC TGCGCCGGCA CTTCGATCCC GCACGGCGCT GA
|
Protein sequence | MLGEDGTSMH HAPLRYQSGF ANHFESEALP GALPVGRNSP QRCPYGLYAE QFSATAFTAP RADNRRSWLY RIRPAAMHAP FEPYDDGGRL VSDFSTLATP PDPLRWNPLP LPAAPTDFVD GLVTWAGHGD AGVQAGAAVH LYAADRSMEQ RSFCSADGEL LIVPQLGRHR FVTELGVLEV EPQEIVVIPR GLRFRVELPD GAGRGYVCEN HGAPFRLPDL GPIGANGLAH ARDFLAPVAA YEDIDGPHQL VTKFMGRLWS AAMDHSPLDV VAWHGNCAPY KYDLRRFNAI GSISHDHPDP SIFLVLHAAS DTPGTSAIDF VVFPPRILAM QDTFRPPWFH RNVASEFMGL IHGVYDAKAE GFLPGGASLH NCMTGHGPDA ETFEKASRAD LSQPDVIRDT MAFMFEARHV WRPTPRALAS PLRQADYARC WQGLRRHFDP ARR
|
| |