Gene Mpe_A0919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0919 
SymbolmsmS2 
ID4787301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp975377 
End bp976624 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content64% 
IMG OID640089480 
Productmethanesulfonate monooxygenase, hydroxylase alpha (large) subunit 
Protein accessionYP_001020116 
Protein GI124266112 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCATCCC TGTCTTCGCA ACTCACGGAG AGCCGCATGT CGCGCAATGC AACCGAGTGG 
CAGCAACGCC CCAATTTCCC CGACACCCAC TTCGTCAGCA CCGACATCTA CACCGACGAG
CAGATCTTCC GGCAGGAGCA GGAGCTGATC TTCAACAAGG TGTGGATCAT TGCGTGCCAC
GAGTCGGAGC TGCAGAACGC CTACGACTAC CGCACCTTCA ACCACCCGGG CGGCGCGCCG
CTGATCGTGG TGCGCGGCGA GGACATGAAG GTCCGCAGCT TCTACAACAT CTGTCCGCAC
CGCGGCAACA CGCTGCTCTA CGAGCCGGTA GGCAATGCCA AGCGCATCAC CTGCATCTTC
CACGCGTGGT CGTTCGACGT GAAGGGCAAC TGCATCGACA TCTCGCGCGG CAAGCAGGGC
TACCAGGACC GCTACGGCTG CGAGCAGGCC GGCCTGCGCG AGGTGAAGAC CGAGATCGGC
TACGGCGGCT TCGTGTGGGT CAACGTGGAT GACCAGTGCT CATCGCTCGG TGAGTACATC
GGCGACTCGA TGAGCATGCT CGACGAGCAG CTGAACATGC CGCTCGAGGT CTTCCACTAT
CACAAGGCCG TGGTCAACAC GAACTACAAG CTGTGGCACG ACACGAACAG CGAGTTCTAT
CACGACTACA TGCACTACTT CAATCGCATC ACCGGGATGA TCCAGCCCGG CTATTTCGAT
CGGAAGTACA CCGGCTACCC CAACGGCCAC GCCTCGGTCG GCTCGATGGC GATCAAGTAC
GACGCCTACG AGGGCAGCAA GGCGCGCGGC GTGGGCTGGC CCGGCCTGGC GCCGGGAGGC
TGGGTGCTGA TCGACATCTT CCCGGGCATG ACCTTCAACC TGCGCACCTC GGTGCTGCGG
ATGGACACGG CGATCCCGCT GGGGCCGAAC AAGCTGCTGA TCGAGTTCCG CGGTCTGGGC
CTCAAGAGCG ACACGCCCGA AGAGCGGGCC GAGCGTATCC GCGACCACAA CACCATCTGG
GGGCCGTTCG GTCGCAACCT GCACGAGGAC CTGCTTGGGG TGCATGGCCA GGGGCTGGCG
ATGCGCGACC GCACCGACAG CAAGTGGGTG CTGCACGGGC GCGAGGAGAA CATGACCATC
CACGACGAAG GCGGCATGCG CCACTTCTAT GCGGAATGGA GCCGCCGCAT GGGCCGCATG
GCCCATGACC CGCACGGCAA GGCCGGCACG GCCCAGGCCG CCGCCTGA
 
Protein sequence
MSSLSSQLTE SRMSRNATEW QQRPNFPDTH FVSTDIYTDE QIFRQEQELI FNKVWIIACH 
ESELQNAYDY RTFNHPGGAP LIVVRGEDMK VRSFYNICPH RGNTLLYEPV GNAKRITCIF
HAWSFDVKGN CIDISRGKQG YQDRYGCEQA GLREVKTEIG YGGFVWVNVD DQCSSLGEYI
GDSMSMLDEQ LNMPLEVFHY HKAVVNTNYK LWHDTNSEFY HDYMHYFNRI TGMIQPGYFD
RKYTGYPNGH ASVGSMAIKY DAYEGSKARG VGWPGLAPGG WVLIDIFPGM TFNLRTSVLR
MDTAIPLGPN KLLIEFRGLG LKSDTPEERA ERIRDHNTIW GPFGRNLHED LLGVHGQGLA
MRDRTDSKWV LHGREENMTI HDEGGMRHFY AEWSRRMGRM AHDPHGKAGT AQAAA