Gene Mpe_A3633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3633 
Symbol 
ID4786099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3842805 
End bp3843788 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content70% 
IMG OID640092215 
Producthypothetical protein 
Protein accessionYP_001022821 
Protein GI124268817 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0364537 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.266279 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCG CCACCTACAA GGACGGCTCG CGCGACGGCC AGCTGGTCGT CGTCTCGCGC 
GACCTGTCCA CGGCCCATCA CGCGAACGGC ATCGCCGGTC GCCTGCAGCA GGTGCTGGAC
GACTGGAACT TCCTGTCGCC GCAGCTGCAG GACCTGTACG AGACGCTGAA CCAGGGCAAG
GCCCGCCACG CCTTCGCGTT CGAGCCGGCG CAGTGCATGG CGCCGCTGCC GCGCGCCTGC
CAGTGGGCCG ACGGCTCGGC CTACCTCAAC CACGTGGAGC TGGTGCGCAA GGCGCGCGGC
GCCGAGGTGC CCGAGAGCTT CTACACCGAC CCACTGATGT ACCAGGGCGG CAGCGACGAC
CTGCTCGGCC CCTGCGACGA CATCGTCGTG CCGAGCGAGA AGATGGGCAT CGACTTCGAG
AGCGAGGTCG CGGTGATCAC CGGCGACCTG CCGATGGGCG TGTCGCCGGA AGCGGCGATC
GACGGCATCC GCCTGCTGAT GCTGGCCAAC GACGTGAGCC TGCGCCACCT GATCCCCGCC
GAGCTGGCCA AGGGCTTCGG CTTCCTGCAG AGCAAGCCGG CCACCGCCTT CAGCCCGGTC
GCCGTCACGC CCGACGAACT GGGCACGGCC TGGCAGGGCG GCCGCGTGCA CCTCACGCTG
CAGACCCAGT GGAACGGCAG GAAGGTGGGC CTGTGCGAGG CCGGGCCCGA GATGACCTTC
CACTTCGGCC AGCTGATCGC CCACCTGGCC ACCACGCGCC GGGTGCGCGC CGGCAGCATC
GTCGGCAGCG GCACGGTCAG CAACAAGGAC TGGTCCAAGG GCTACAGCTG CATCGCCGAG
AAGCGTGCGA TCGAGACGAT CGAGGGCGGC GCGCCGGTCA CCGAATTCAT GCGCTACGGC
GACACGGTAC GCATCGAGAT GAAGGGCAGC GACGGCCAGA GCGTGTTCGG CGCGATCGAG
CAGACGGTGG CAGCACCGGG CTGA
 
Protein sequence
MKLATYKDGS RDGQLVVVSR DLSTAHHANG IAGRLQQVLD DWNFLSPQLQ DLYETLNQGK 
ARHAFAFEPA QCMAPLPRAC QWADGSAYLN HVELVRKARG AEVPESFYTD PLMYQGGSDD
LLGPCDDIVV PSEKMGIDFE SEVAVITGDL PMGVSPEAAI DGIRLLMLAN DVSLRHLIPA
ELAKGFGFLQ SKPATAFSPV AVTPDELGTA WQGGRVHLTL QTQWNGRKVG LCEAGPEMTF
HFGQLIAHLA TTRRVRAGSI VGSGTVSNKD WSKGYSCIAE KRAIETIEGG APVTEFMRYG
DTVRIEMKGS DGQSVFGAIE QTVAAPG