Gene Mpe_A1351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1351 
Symbol 
ID4785457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1455633 
End bp1457258 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content65% 
IMG OID640089917 
Productsteroid monooxygenase 
Protein accessionYP_001020548 
Protein GI124266544 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2072] Predicted flavoprotein involved in K+ transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.881234 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACA AGACCGTTGC CAAGACCACC AGGGTGGACG CCGTGGTCAT CGGGGCAGGC 
ATCGCCGGCC TGTACCAGGT CTATCGCCTG CGTGAGCAGG GGTTCGACGT CCAGGCTTTC
GAGGCCGGCT CCAACGTCGG AGGCACCTGG TACTGGAATC GCTACCCGGG CGCCCGGTTC
GATTCCACGG CCGAGGTCTA CCAGTTCTGG TTTTCGGAAG ATCTCTACAA GGGGTGGAAA
CCGAGCGAGC GCTTCCCGGC GCAGCCCGAG TCCGAGCGCT GGCTGAACTA CGTGGCGGAC
CGCTGTGACC TGCGCAAGCA TTACCGGTTC AGCACGCGCG TCGAGGCGGC GCACTACGAT
GAAGCTGCGC AGAGCTGGTC GATCACGACC GACCAGGGCG ATACCGTCCA GGCGCGCTTC
CTCATCACCT GCTGCGGCAT GCTGTCGGCG CCGCTGACGT CGGTGTTCCC CGGTCAGGAC
AGCTTCAAGG GCCAGTTGTT CCACACCGCG CGTTGGCCGA AGGAACCGGT CGATTTCACC
GGCAAACGCG TGGGCATCGT GGGCACCGGG GCGACGGGCA TCCAGGTCAT CCAGACGATC
GCGAGTCAGG TCGGCCACCT CAAGGTGTTC CTGCGCACGC CGCAGTACAC GATCCCGATG
AACAACCCGA AGTACACCGA GGCGGTCTGG GCCGGATTCT CGAGCCGCTT CCACGAGATG
AAGGAACGCG TGCAGCGAAC CTTTGCCGGC CACGTCTACG ACTTCGGCGG CTACGGCACT
TGGGCCGAAA GGACGCCCGA GGAGCGGATC GCCGTGCTGG AGGAGCTCTG GAACGACGGC
TCGCTGGCGT TGTGGCTGGC CTCGTTCTCC GAAATGTTCT TCGACGAAAA GGTCAACGCC
GAGGTCTCCG AGTTCGTGCG CGGGAAGATG CGCGAGCGGC TCAAGGACCC GGTGCTGTGC
GAGAAGTTGA TCCCCACGAA CTACGGCTTC GGGACCAACC GCGTGCCACT GGACACCAAC
TACCTGGAGG CCTACCACCG CCCGAACGTC GAGATCGTTG ACGTGAAGGC GTCGCCGATC
GAGTGCGTCA CGCCCGAAGG TGTGCGAACG GCCGACGGCA AGCTCCACGA ACTCGACATC
CTGATCCTGG CGACGGGTTT CGATGCGGGA ACGGGTGCAC TGACGCGCAT CGACATCCGC
GGTCGCGGCG GGCGCTCGCT CAAGGACGAC TGGGGCCGCG AGATCCGCAC CACGATGGGC
CTGCAGGTGC ACGGCTATCC CAACCTCTTC ACGACCGGGG CGCCGCTGGC GCCGTCGGCG
GCCTTCTGCA ACATGACCAC CTGCCTGCAG CAGCAGGTCG ACTGGATCAC CGAGTGCCTG
GTGGCGCTGC GCCGTAAGGG CCTGACCGTC ATCGAGCCCA GCCGGGCGCT GGAAGACGAA
TGGGTGGCTC ACCACGACGA GACCTCCAAC GCGACGCTGC TGGTCAAGAC CGATTCCTGG
TACATGGGAA CCAATGTCAA GGGCAAGCAG CGCCGCATGC TTTCGTACAT CGGTGGGGTC
GGAAAATACC GCCAACGCTG CGAAGAACTG GCCGCCGGCG GCTATCCGGG TTTCGAGATG
CGCTGA
 
Protein sequence
MNDKTVAKTT RVDAVVIGAG IAGLYQVYRL REQGFDVQAF EAGSNVGGTW YWNRYPGARF 
DSTAEVYQFW FSEDLYKGWK PSERFPAQPE SERWLNYVAD RCDLRKHYRF STRVEAAHYD
EAAQSWSITT DQGDTVQARF LITCCGMLSA PLTSVFPGQD SFKGQLFHTA RWPKEPVDFT
GKRVGIVGTG ATGIQVIQTI ASQVGHLKVF LRTPQYTIPM NNPKYTEAVW AGFSSRFHEM
KERVQRTFAG HVYDFGGYGT WAERTPEERI AVLEELWNDG SLALWLASFS EMFFDEKVNA
EVSEFVRGKM RERLKDPVLC EKLIPTNYGF GTNRVPLDTN YLEAYHRPNV EIVDVKASPI
ECVTPEGVRT ADGKLHELDI LILATGFDAG TGALTRIDIR GRGGRSLKDD WGREIRTTMG
LQVHGYPNLF TTGAPLAPSA AFCNMTTCLQ QQVDWITECL VALRRKGLTV IEPSRALEDE
WVAHHDETSN ATLLVKTDSW YMGTNVKGKQ RRMLSYIGGV GKYRQRCEEL AAGGYPGFEM
R