Gene Mpe_A3167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3167 
Symbol 
ID4786564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3365410 
End bp3368643 
Gene Length3234 bp 
Protein Length1077 aa 
Translation table11 
GC content68% 
IMG OID640091739 
Productsignal transduction histidine kinase-like protein 
Protein accessionYP_001022355 
Protein GI124268351 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0540842 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGATA CGCTCACTCA CTCCAGTCCG ACAGCCTCCG CCCCCCCCTT CGTCGCCACA 
TCGCTGGTGA CCCTGGTGCT GGTCGCGCTG GCCTATGTGC TGACGGGCGC GGCCTCGCTG
CAACTCGCGA TCCCCCCCGG CTACGCTTCG CCGCTGTTCC CGCCGGCCGG CATCGCGCTG
GCCGCGGCCT TGGTCTACGG TTGGCGGGTG CTGCCGGCGG TGATGCTCGG GAGCCTGAGC
GTGCAGATGC TCGTGTACTT CAACGGCGAT CAGCAGCATG TCTTCCGCGA TTCGCTCACC
GGGCTCGCAA TCGCTTGCGG TGCGACGATG CAGACTGCGC TCGGCGCCTG GCTGGTGCGC
CGCGGCCTGA GGGAGCCACT GACGCTGGAC GGGCCGGCTT CGATCCTGCG CCTGTTCGGT
TTCGGTGCGT TGGCCGCCTG TACCGTCGGG GCAAGCCTGA GCGTCCTGGC GCTGTGGTTG
GCCGGTCTGC TGCCGACCGC CCTGCTATTC GACACCTGGT GGACCTGGTG GGGCGGCGAT
GCCTTGGGCG TGATGATCGC CACACCCGTG GTGCTGACGC TGATCGGCCG ACCGCGGACC
GCCTGGCGCC CGCGCCGCAA CACCGTGGCC TTGCCGTTGG TGGTTGCCAC CTTGCTGCTG
GCGCTGGCCA TCTGGCAAGT CGCCCGCTGG GACGGCCAGC GCGACGCGGC GCGCTTCGAG
CGCGATGCGA TCAGCGTGAG CCGTACAGTC GAGCTGCGGC TCAATGCCCA CCTCGACGCA
CTGGATGCGA TGCACAGCAT CTATGTCGCG TCGTCGAACG TCGACAGCCG TGAGTTCGAG
CGGGCCAGCG CCGGCTGGCT GCGCAAGCTG CCGAGCGTGC AGGCGATCGG TTGGCATGAG
CGCGTTGCGC GGCAGGACCT GCCGGCCTTC GAGGCAAGCC TGCGCTCTCT CGGTCAACCG
CACTACCGCG TGTTCGAACG CGACAGCGCC CTCAGCGCGA ACGACTCCGA ACTGATGGTG
ATGCGCTACA TCGAGCCGCG TGCTGGCAAC GACACCGCGC TGGGTGTCAA CGCGCTGTCG
ATTCGCGCTG CCCGCGAGGC CATCGCGCGC GCGCGCCTGA ACGATGGCGC CACGGTCACC
CCTGGCTTCA GGCTCACGCA GGAAACCGCA CAGCAGACCG GCGTTGTCGT CTATCGCGCC
GTGTACCGTG GGGACGAGCC ATCACGCAGC TTGCAGGGAA TGGTCTTCAT TACTCTACGC
ATGGAAGACG CTCTGGCCCG GCTTGCTGTC GGTGCCCCAC GCTACCTTCA GTTCTGTCTG
CTGGACAGTG ATGCCCCCCC CGAGAGCCGG CTGCTGGCCG GCTCGTCCGA CTGCGCGCGC
CCTGCCGTCA ATCCCCTATG GAGCCACACC GCGGAGATCC CGTTCGCCGG TCGCACCTGG
GAGCTTCGGG TGCAGGCACC GAATGGCGTT CCCAATGGGC CCGCCGCCGG GCCGCTATCG
GGAGAGAGCG CGACGGCTTG GCTGTTCTCA CTGACCGGGC TGCTCGGCAC CGGCATGATG
GGTGCGCTGC TTCTGCTCGT CACAGGGCGC ACTCGGCAGA CCGAAACGGC TGTCGCAGAA
CGTACCGAGC AGTTGGAGCA CGAGATCACC GAGCGCCTGG CCACCGAACA GGCGCTGCGC
GAGAGCGAGC AGCGCTTCCG CAGCATCTTC AACTCGGTGC CTATCGGCGT GGTCTACACC
GACCTTCAGG GCCGCATCAA GCAGCCTAAC GCAGCCTACT GCACCATGAC CGGCTACAGC
GAAGAAGAGC TGTTGAGCCT GTCGCTCGCC AGCCTGACCC ACCCCGAGGA CCGTGCCGCC
GATCTGGCAA GCCAGCAGGA TCTGGTGGAC GGGCGGCTGC CGCTGTACCG ACGGCGCAAA
CGCTACGTCA CCAGGAACGG CCCGGTACTG TGGGTCAGCG TGACGGTCAG CCTGCTGCGC
GACGAGCACG GCCAGCCGCA CCGGCTGGTG GGACTGGTCG AGGACATCAG CGAGCACCTG
CGCCTCGAAG AAGCGGAGCA TGCGCGCGAA TCCGCGGAGA CCGCCAACCG CGCGAAAAAC
GAGTTCCTTT CGCGCATGAG CCACGAACTG CGCACGCCGC TCAACGCGAT GCTCGGCTTC
GCGCAACTGC TCGACCTCGA CCGCGGCGAG CCTCTGCACG AGCGCCACCG GATGTGGGTA
ACGCAGATCC AACAGGCCGG CTGGCATCTG CTGGAGATGA TCAACGACGT GCTGGACCTG
TCGCGCATCG AGTCCGGCAC ACTGAGCCTG CAGGTCGAGT CGTTGTCGCT GGAGCCGCTG
ATCGCCGCAT CGATGGCACT GGTGGAGCCG CAGGCGCGCG GTCGCGGCCT CACGCTTGTC
CGCACGATGG CCGAGCACGC CCCGCTGCAG GTTTTGGGCG ACGCGACTCG CGTCAAGCAG
ATCCTCACCA ACCTGCTGAG CAACGCCGTC AAATACAACC GCGAGGCGGG TGAGGTGCGT
GTGTCGACCC GTCGCACTGA GGACCGCGAT GGCACGCCGA TGCTGGAACT CGACGTCTGC
GACACCGGCC TGGGCATCGA GCCTCAGCTG CTGACCCAGC TGTTCCAGCC CTTCAACCGG
CTGGGGCGCG AGCGTGGCGC GACCGAAGGC ACCGGCATCG GCCTAGTGAT CGCCAAGCTG
CTGGCCGAGC GCCAGGGCGG GTCGCTGAAG GTGCGAAGCG AGCCCGGTGT CGGCTCGACC
TTCACGCTGC GCCTGCCGCT GGACTCGCAA GCGCGGCCGA TGACCGCGAG CCAGGAGGCC
GACGAAGAGG CCAACGCAGC CTACAACCGT CGTCATGTGC TCTACATCGA GGACAACGAC
ACCAACGTCG AGGTGATGCG CGGCATCTTC GGCCTGCGTC CGCAGGTCCG TCTGGCCGTC
GCGACCACCG GGCTGGACGG CTTGGCGGCC GTGCGCACCT CGGCGCCCGA CCTGATCCTG
CTCGACATGC ACCTGCCCGA CATCGACGGC ATGGTGCTGC TCCAGCACCT GAAGTCCGAC
GAGCAGACCG CCGAGATCCC GGTCATCGCC GTCTCCGCCG ACGCGCTGCC GGCCCAGATC
TCGGCCGCCC TGGGGGCCGG CGCGATCCGC TACCTCACGA AGCCGGTCGC CATCGACGAG
GTGCTCGGCG TCCTGGACGA GCTGCTGTCG CAGCTGACGA CCCGCTTCGG CTAG
 
Protein sequence
MADTLTHSSP TASAPPFVAT SLVTLVLVAL AYVLTGAASL QLAIPPGYAS PLFPPAGIAL 
AAALVYGWRV LPAVMLGSLS VQMLVYFNGD QQHVFRDSLT GLAIACGATM QTALGAWLVR
RGLREPLTLD GPASILRLFG FGALAACTVG ASLSVLALWL AGLLPTALLF DTWWTWWGGD
ALGVMIATPV VLTLIGRPRT AWRPRRNTVA LPLVVATLLL ALAIWQVARW DGQRDAARFE
RDAISVSRTV ELRLNAHLDA LDAMHSIYVA SSNVDSREFE RASAGWLRKL PSVQAIGWHE
RVARQDLPAF EASLRSLGQP HYRVFERDSA LSANDSELMV MRYIEPRAGN DTALGVNALS
IRAAREAIAR ARLNDGATVT PGFRLTQETA QQTGVVVYRA VYRGDEPSRS LQGMVFITLR
MEDALARLAV GAPRYLQFCL LDSDAPPESR LLAGSSDCAR PAVNPLWSHT AEIPFAGRTW
ELRVQAPNGV PNGPAAGPLS GESATAWLFS LTGLLGTGMM GALLLLVTGR TRQTETAVAE
RTEQLEHEIT ERLATEQALR ESEQRFRSIF NSVPIGVVYT DLQGRIKQPN AAYCTMTGYS
EEELLSLSLA SLTHPEDRAA DLASQQDLVD GRLPLYRRRK RYVTRNGPVL WVSVTVSLLR
DEHGQPHRLV GLVEDISEHL RLEEAEHARE SAETANRAKN EFLSRMSHEL RTPLNAMLGF
AQLLDLDRGE PLHERHRMWV TQIQQAGWHL LEMINDVLDL SRIESGTLSL QVESLSLEPL
IAASMALVEP QARGRGLTLV RTMAEHAPLQ VLGDATRVKQ ILTNLLSNAV KYNREAGEVR
VSTRRTEDRD GTPMLELDVC DTGLGIEPQL LTQLFQPFNR LGRERGATEG TGIGLVIAKL
LAERQGGSLK VRSEPGVGST FTLRLPLDSQ ARPMTASQEA DEEANAAYNR RHVLYIEDND
TNVEVMRGIF GLRPQVRLAV ATTGLDGLAA VRTSAPDLIL LDMHLPDIDG MVLLQHLKSD
EQTAEIPVIA VSADALPAQI SAALGAGAIR YLTKPVAIDE VLGVLDELLS QLTTRFG