Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3633 |
Symbol | |
ID | 4786099 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 3842805 |
End bp | 3843788 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640092215 |
Product | hypothetical protein |
Protein accession | YP_001022821 |
Protein GI | 124268817 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0364537 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.266279 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTCG CCACCTACAA GGACGGCTCG CGCGACGGCC AGCTGGTCGT CGTCTCGCGC GACCTGTCCA CGGCCCATCA CGCGAACGGC ATCGCCGGTC GCCTGCAGCA GGTGCTGGAC GACTGGAACT TCCTGTCGCC GCAGCTGCAG GACCTGTACG AGACGCTGAA CCAGGGCAAG GCCCGCCACG CCTTCGCGTT CGAGCCGGCG CAGTGCATGG CGCCGCTGCC GCGCGCCTGC CAGTGGGCCG ACGGCTCGGC CTACCTCAAC CACGTGGAGC TGGTGCGCAA GGCGCGCGGC GCCGAGGTGC CCGAGAGCTT CTACACCGAC CCACTGATGT ACCAGGGCGG CAGCGACGAC CTGCTCGGCC CCTGCGACGA CATCGTCGTG CCGAGCGAGA AGATGGGCAT CGACTTCGAG AGCGAGGTCG CGGTGATCAC CGGCGACCTG CCGATGGGCG TGTCGCCGGA AGCGGCGATC GACGGCATCC GCCTGCTGAT GCTGGCCAAC GACGTGAGCC TGCGCCACCT GATCCCCGCC GAGCTGGCCA AGGGCTTCGG CTTCCTGCAG AGCAAGCCGG CCACCGCCTT CAGCCCGGTC GCCGTCACGC CCGACGAACT GGGCACGGCC TGGCAGGGCG GCCGCGTGCA CCTCACGCTG CAGACCCAGT GGAACGGCAG GAAGGTGGGC CTGTGCGAGG CCGGGCCCGA GATGACCTTC CACTTCGGCC AGCTGATCGC CCACCTGGCC ACCACGCGCC GGGTGCGCGC CGGCAGCATC GTCGGCAGCG GCACGGTCAG CAACAAGGAC TGGTCCAAGG GCTACAGCTG CATCGCCGAG AAGCGTGCGA TCGAGACGAT CGAGGGCGGC GCGCCGGTCA CCGAATTCAT GCGCTACGGC GACACGGTAC GCATCGAGAT GAAGGGCAGC GACGGCCAGA GCGTGTTCGG CGCGATCGAG CAGACGGTGG CAGCACCGGG CTGA
|
Protein sequence | MKLATYKDGS RDGQLVVVSR DLSTAHHANG IAGRLQQVLD DWNFLSPQLQ DLYETLNQGK ARHAFAFEPA QCMAPLPRAC QWADGSAYLN HVELVRKARG AEVPESFYTD PLMYQGGSDD LLGPCDDIVV PSEKMGIDFE SEVAVITGDL PMGVSPEAAI DGIRLLMLAN DVSLRHLIPA ELAKGFGFLQ SKPATAFSPV AVTPDELGTA WQGGRVHLTL QTQWNGRKVG LCEAGPEMTF HFGQLIAHLA TTRRVRAGSI VGSGTVSNKD WSKGYSCIAE KRAIETIEGG APVTEFMRYG DTVRIEMKGS DGQSVFGAIE QTVAAPG
|
| |