Gene Mpe_A0099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0099 
Symbol 
ID4784501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp92985 
End bp95801 
Gene Length2817 bp 
Protein Length938 aa 
Translation table11 
GC content71% 
IMG OID640088646 
Productsignal transduction histidine kinase-like protein 
Protein accessionYP_001019296 
Protein GI124265292 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.878773 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATCC CCTTCCACCC CCGAGAGGCG CAGCGCCTGA CGCTGTTGCG CGAGCTCGGG 
GTGCTCGGCG GCGCAGCAGG TGCCGGCCTC GACGCGCTGA CCGCTTGCGC GGCGCGGCTC
ACGGGTGCGC CGCTGGCAGC GATCTCGCTG GTCGACGACG ACGCGCAGTG GTTCCTGTCG
CGCAGCGGCT TCGAGCTCGC CCAGATTCCG CGCAGCCAGG GCTTCTGTTC GCACGCGATC
CTCGGCGAGA GCCTGTTCAC CGTGCCGGAC CTGGCCCGCG ACCCCCGCTT TCGCGACAGC
CCGCTGGTCA CCGACGGCCC GCGCGTGCGC TTCTATGCCG GCCAGCCGCT GGCCATCGAC
GGGCTGGCCG TCGGGACGCT GTGCGTGTTC GACCTCCAGC CCCGCACGCT CGGTGCCGGC
GAACGCGAAG CGCTGGCGCA ACTGGGCTGT GCCGCGGTCG AACTGCTGCA GTCGCGGCAG
GGCCTGCGCG AAGCGTCGCA GCTGCGCGAG CGCCTGTTCG ACTTCGCGCG CGCCGGCGGT
GACTGGATGT GGGAGAGCGA TGCACGGCAT CGCTACCGGT GGCTTTCCGA CGCCTTCCAG
CCCATCACCG GCATCGACCC GCGCCGGCAG CTCGGCGAAC CCATCACCGA CGCCCAGCTG
CTCGACGCCG ACGGGAGTCC GCGCGTGCCG GCGCAGTACT ACCGGGCGCT GCTCGACCGC
CAGGAGCCGT TCGCACGGGT GCTGACCGCC AAGCACACGC CGCGCGGGCT GCTCTACCTG
TCGCGCAGCG CGGTGCCGGT GTTCGACGCG GCTGGCCGCT TCGAGGGCTA TCGCGGCACG
GTGCGCGACG TCACGTCGAC GCTGGCGGCA GCCAGCGAGG CCCGTCGCGG CGAGGTCACG
CTGCGCGAAC TGGCCGCGCA GGTGCCGGGC ATGCTGTTCA CCTTCGAGGA GCATCCCGAC
GGCCGGGCCG GCTACCCCTT CGTCAGCGAT GGCGCCGAAC GCGTGCTCGG GATGGCGGCG
GCCACGCTGC AGGCCGATGC GCTGACCTTC TTCCGCCACG TTCACCCACC GGATCTCGCG
CCGCTCGGCA AGGCCCTGGC CGAGGGCGCG GCGCGGCTGA CACGGCTGGA GCACAGCTTC
CGCATGACCT TGCCCGATGG CTCGCTGCGC TGGTTCGAGA TGCGTGCGGC GCCGACGCGC
CTGGGTGACG GCGGTACGTT GTGGCACGGA TTCACCGCCG ACGTGACCGC GCGCAAGGCG
ATCGAGGAGG CGCTGCAGGC CCATGAAGAG CGCTGGCAGA TCGCCGCCGA CGCAGCGCGC
ATCGGCATCG CCGAGTTCGT GCTGGCCGAC GGCCTTGTCA ACCTCGATCG CCGCGCCTGC
ATCAATCACG GACTTTCGTA CCCGCATGGC CGGGTGACGC TCGACGACTG GATGGCACAG
ATCGATCCGG CCGACCGCGA CGCCGTGATG GCCGGTATCG GCGAGGCGCT GGCCGGCGAA
CGACCGTTCG AGGGCCGGTA TCGGATCCGC CAGCCCGATG GCAGCGTGCG CTGGCTCGAG
TTCGTGGTGC GTGCCACGCG CGACGAGGCC GGCGCGCCGA CCGGTGTCAT CGGCACCTGC
CGCGACGTGC ACGAGCAGCA GCTGGCCGAC GAGTTGTCCC GCGGCATGCA GGAGGCCGAG
CGGGCGAGCC GCGCCAAGAG CGAATTCCTG TCGCGGGTGA GCCACGAGCT GCGCACGCCA
CTGAACGGCA TCCTCGGCTT CACGCAGCTG ATGGCGCTGG ACGAGGACCA TCCGCTGGCG
GCTCCGCAGG CGCAGCGCCT CGCGAGCGTG CAGCGGGCCG GCAACCGGCT GCTGAATCTG
ATCAACGACG TGCTGGAGAT CAGCCGCATC GAAAGCGCCG AGGTCGCGGT GCGCACGCTG
GCGGTCGATC TCGACGCCGC GCTGCACGCG AGCCTGAGCC TGGTGCAGTC GCTGGCGCGC
GGGCGCTCGA TCACCATCGA GCCGACGCCA CCGAGCGGGC TGTGGGTCAG CGGCGACGAG
CGTGCGCTCG AACAGGTGCT GGTGAACCTG CTGTCCAACG CCATCAAGTA CAGCGGCGAA
CAGAGTCCGG TGGTCTTGAA CCTGCAGCGC AGCGACGGGA CGGTGAAGAT CGCGGTGCGC
GACCGGGGCG TCGGCCTGAC GGCCGAGCAG CAGGCGCGGC TGTTCCAACC CTTCGATCGG
CTGGGTGCCG AACGACGCCG CATCGAAGGC AGCGGTCTCG GCCTGGTGAT CGCACGCCAG
CTGGCCGAGG CGATGGGCGG TCGCATCGAT GTCGTCAGCG TGCCGGGCGC CGGCTCGACC
TTCACCCTGC AGTTGCCCGA GGCCGAGGAG CCGAGCGGCA GCGCCTACCT CGCCACACAG
CCGGCAGCCT TGCTGCACGA ACCGCCCGCG CCGCGGCGCG CGCAGCGGCA GGTGGTCTAC
ATCGAGGACG AATTGCTGAA CCAGGTGTTG CTGCAGGAGG TGTTTCGAGC CCGCCCCGAC
TGGCAGCTCC ACATCGCCGA CGACGGCGCC AGCGGCCTGC GACTGGCGCG CGAGCTCTCC
CCGCACCTGA TGCTGATCGA CATGAACTTG CCCGATACCA ACGGCTTGGC TTTGGTGCAG
GCGCTGCGCG CCGATCCGTC GACACGGGCG CTGCGCTGCA TCGCGCTGTC GGCCGACGCG
ATGAACGAGC AGATCGCGGC GGCGCGCCGT GCCGGCTTCG ACGACTACTG GACCAAGCCG
ATCGATGTGG CGCAGGTGCT GGCCGGGCTC GACGCCGTGC TGGACGGTAC CGCCTGA
 
Protein sequence
MPIPFHPREA QRLTLLRELG VLGGAAGAGL DALTACAARL TGAPLAAISL VDDDAQWFLS 
RSGFELAQIP RSQGFCSHAI LGESLFTVPD LARDPRFRDS PLVTDGPRVR FYAGQPLAID
GLAVGTLCVF DLQPRTLGAG EREALAQLGC AAVELLQSRQ GLREASQLRE RLFDFARAGG
DWMWESDARH RYRWLSDAFQ PITGIDPRRQ LGEPITDAQL LDADGSPRVP AQYYRALLDR
QEPFARVLTA KHTPRGLLYL SRSAVPVFDA AGRFEGYRGT VRDVTSTLAA ASEARRGEVT
LRELAAQVPG MLFTFEEHPD GRAGYPFVSD GAERVLGMAA ATLQADALTF FRHVHPPDLA
PLGKALAEGA ARLTRLEHSF RMTLPDGSLR WFEMRAAPTR LGDGGTLWHG FTADVTARKA
IEEALQAHEE RWQIAADAAR IGIAEFVLAD GLVNLDRRAC INHGLSYPHG RVTLDDWMAQ
IDPADRDAVM AGIGEALAGE RPFEGRYRIR QPDGSVRWLE FVVRATRDEA GAPTGVIGTC
RDVHEQQLAD ELSRGMQEAE RASRAKSEFL SRVSHELRTP LNGILGFTQL MALDEDHPLA
APQAQRLASV QRAGNRLLNL INDVLEISRI ESAEVAVRTL AVDLDAALHA SLSLVQSLAR
GRSITIEPTP PSGLWVSGDE RALEQVLVNL LSNAIKYSGE QSPVVLNLQR SDGTVKIAVR
DRGVGLTAEQ QARLFQPFDR LGAERRRIEG SGLGLVIARQ LAEAMGGRID VVSVPGAGST
FTLQLPEAEE PSGSAYLATQ PAALLHEPPA PRRAQRQVVY IEDELLNQVL LQEVFRARPD
WQLHIADDGA SGLRLARELS PHLMLIDMNL PDTNGLALVQ ALRADPSTRA LRCIALSADA
MNEQIAAARR AGFDDYWTKP IDVAQVLAGL DAVLDGTA