Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0919 |
Symbol | msmS2 |
ID | 4787301 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 975377 |
End bp | 976624 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640089480 |
Product | methanesulfonate monooxygenase, hydroxylase alpha (large) subunit |
Protein accession | YP_001020116 |
Protein GI | 124266112 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCATCCC TGTCTTCGCA ACTCACGGAG AGCCGCATGT CGCGCAATGC AACCGAGTGG CAGCAACGCC CCAATTTCCC CGACACCCAC TTCGTCAGCA CCGACATCTA CACCGACGAG CAGATCTTCC GGCAGGAGCA GGAGCTGATC TTCAACAAGG TGTGGATCAT TGCGTGCCAC GAGTCGGAGC TGCAGAACGC CTACGACTAC CGCACCTTCA ACCACCCGGG CGGCGCGCCG CTGATCGTGG TGCGCGGCGA GGACATGAAG GTCCGCAGCT TCTACAACAT CTGTCCGCAC CGCGGCAACA CGCTGCTCTA CGAGCCGGTA GGCAATGCCA AGCGCATCAC CTGCATCTTC CACGCGTGGT CGTTCGACGT GAAGGGCAAC TGCATCGACA TCTCGCGCGG CAAGCAGGGC TACCAGGACC GCTACGGCTG CGAGCAGGCC GGCCTGCGCG AGGTGAAGAC CGAGATCGGC TACGGCGGCT TCGTGTGGGT CAACGTGGAT GACCAGTGCT CATCGCTCGG TGAGTACATC GGCGACTCGA TGAGCATGCT CGACGAGCAG CTGAACATGC CGCTCGAGGT CTTCCACTAT CACAAGGCCG TGGTCAACAC GAACTACAAG CTGTGGCACG ACACGAACAG CGAGTTCTAT CACGACTACA TGCACTACTT CAATCGCATC ACCGGGATGA TCCAGCCCGG CTATTTCGAT CGGAAGTACA CCGGCTACCC CAACGGCCAC GCCTCGGTCG GCTCGATGGC GATCAAGTAC GACGCCTACG AGGGCAGCAA GGCGCGCGGC GTGGGCTGGC CCGGCCTGGC GCCGGGAGGC TGGGTGCTGA TCGACATCTT CCCGGGCATG ACCTTCAACC TGCGCACCTC GGTGCTGCGG ATGGACACGG CGATCCCGCT GGGGCCGAAC AAGCTGCTGA TCGAGTTCCG CGGTCTGGGC CTCAAGAGCG ACACGCCCGA AGAGCGGGCC GAGCGTATCC GCGACCACAA CACCATCTGG GGGCCGTTCG GTCGCAACCT GCACGAGGAC CTGCTTGGGG TGCATGGCCA GGGGCTGGCG ATGCGCGACC GCACCGACAG CAAGTGGGTG CTGCACGGGC GCGAGGAGAA CATGACCATC CACGACGAAG GCGGCATGCG CCACTTCTAT GCGGAATGGA GCCGCCGCAT GGGCCGCATG GCCCATGACC CGCACGGCAA GGCCGGCACG GCCCAGGCCG CCGCCTGA
|
Protein sequence | MSSLSSQLTE SRMSRNATEW QQRPNFPDTH FVSTDIYTDE QIFRQEQELI FNKVWIIACH ESELQNAYDY RTFNHPGGAP LIVVRGEDMK VRSFYNICPH RGNTLLYEPV GNAKRITCIF HAWSFDVKGN CIDISRGKQG YQDRYGCEQA GLREVKTEIG YGGFVWVNVD DQCSSLGEYI GDSMSMLDEQ LNMPLEVFHY HKAVVNTNYK LWHDTNSEFY HDYMHYFNRI TGMIQPGYFD RKYTGYPNGH ASVGSMAIKY DAYEGSKARG VGWPGLAPGG WVLIDIFPGM TFNLRTSVLR MDTAIPLGPN KLLIEFRGLG LKSDTPEERA ERIRDHNTIW GPFGRNLHED LLGVHGQGLA MRDRTDSKWV LHGREENMTI HDEGGMRHFY AEWSRRMGRM AHDPHGKAGT AQAAA
|
| |