Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0922 |
Symbol | msmC |
ID | 4787304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 978187 |
End bp | 978915 |
Gene Length | 729 bp |
Protein Length | 242 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640089483 |
Product | methanesulfonate monooxygenase component; iron sulfur protein |
Protein accession | YP_001020119 |
Protein GI | 124266115 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0664] cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [COG2146] Ferredoxin subunits of nitrite reductase and ring-hydroxylating dioxygenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.738029 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGCA TCCGCATCTG CAGCGCCAGC GAGATCCCCG GCCAGGGGAT GAAGTGCTAC GACACGGCCA CGGGCAGCAA GGTCCTGGTG GTGAACAGCG GCGACCAATT CCACGCCTTC CAGGGCCTGT GCCCACACCA GGACGTGTGC CTCGACGAGG GCTTCTTCGA CGGCAGCACG CTGACCTGCC ACCAGCACCT GTGGCAGTGG GACGTGAAGA CGGGCGAGGC CCTCGGGCTG GCCGAGGCGC CGCTGGAGCG CTACGAGATC GAGCACGTCG ACGGCGAGAT CTTCGTGCTG CAGTCCAGTG CGCTGCGCGC CTGCGAGCTG TTCAAGGGCA TCTCCGACGC GGTGATCGGG CGGCTCGACG CCCTGGCGCG GCGGGAGGAG CACGGCGTGG CGGCGGCGCT GTACGACATC GGCGATCCGG CCGACGACCT CTACATCCTC GAATCGGGCC GCGTGGAATT CGAGATCGGC CGCGAGGACC GCACGCGCAT GGCGGGCTTC ATGCTGCGCA AGGGCGAGAT GTTCGGGTGG GCCGCGCTGC TGCAGGACCA GCCGCGCCGC ATCGCCCGCG CCACCTGCAT GGAGCCGTCC ACGTTCCTGC GGCTCAAGGG CGAGGACGTC CTGAAGGTGC TGGCGGAGGA GCCGGCAGCA GGCTTCCTCG TGATGCGCCA GCTCTCGACG CTGATCACGC GGCATCTGCG CACGCAGGGC GGCAAGTGA
|
Protein sequence | MKRIRICSAS EIPGQGMKCY DTATGSKVLV VNSGDQFHAF QGLCPHQDVC LDEGFFDGST LTCHQHLWQW DVKTGEALGL AEAPLERYEI EHVDGEIFVL QSSALRACEL FKGISDAVIG RLDALARREE HGVAAALYDI GDPADDLYIL ESGRVEFEIG REDRTRMAGF MLRKGEMFGW AALLQDQPRR IARATCMEPS TFLRLKGEDV LKVLAEEPAA GFLVMRQLST LITRHLRTQG GK
|
| |