Gene Mpe_A2858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2858 
Symbol 
ID4785552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3043379 
End bp3046384 
Gene Length3006 bp 
Protein Length1001 aa 
Translation table11 
GC content68% 
IMG OID640091429 
Productsignal transduction histidine kinase-like protein 
Protein accessionYP_001022047 
Protein GI124268043 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.920291 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACG AGCACGTCGA CTCGCAGTTC GAGCGTGCCG CGGCCGAGGC GCGCGCCGCG 
GTCGATCGCC GGGTGCTGCG GCCGCTGCGC GATCTGGCGG TGGCGCGCAA GCTCGCGTTG
ATCGTGTTCC TGCTGGTCGG CGTGGTCGTC GGCCTCGTCT ACCTCAGCAA GCTCAGCTCC
GACATCCTGT CCGGCGTGCG CGCCTACGTG GCCGGGGACG GGCTGTGGGC CAAAGGCAAC
CGCGACGCGG TGTTCTACCT GGTGCGCTAT GCACAGACCC ACGAGGAAGC TGATTACCGC
CAGTACGCAG CGGCGTTGTC CGTGACGCTG GGTGACCGGC AGGCGCGACT GGAACTGGAG
AAGCCCGATT TCGACTACGA CGTGGCGTAT GCGGGCTTCT TGGCAGGGCG CAACCACCCC
GACGACATCG ACAGCCTGAT CTGGTTGTTC CGGACCTTCC GCAATCTGAG CTACATGGAC
CGGGTGATCC AGCTCTGGAC CCGCGGCGAC GAGGAAATGG CGCTGCTGCG CAGCTACGGC
GACACCATCC GCACGCAAGT GCTGAACGGC AGCCTGAGCC CGGCGCACGC TGCCCTCCTG
ATCGGCGAGA TCGAGGTGCT CAATCGCCGC CTCGTGCCGA TCGAGGACGC CTTCTCCACC
GTGCTGGGGG AGGCCAGCCG CTGGATCGCC CAGCTGTTGT TCGCTCTGAT GCTGGTGACG
GCCTGCGGTT TGCTCACGCT CGGGCTGGTG GTCGCCGGCT ATCTGACGCG CCGCATCAAC
CGTCAGATCG ACGGGCTGCG CGACGGGGCG TTGCGCATGG CGGACGACGA CTTCGAGCAA
CCGGTGGAGA TCGTCTCCGA CGACGAACTG GGCCGGCTCG CCGCCACCTT CAACAGCATG
CAGGCCCGGC TGCGGGAGCA TCGCAGTGCC ATCGAGGCCA GCGCCGCCGA GTTGCAGCAG
GCCACGGCTG CGGCCCAGGC GCTGGCGCTG CAGGCCGAGA CGGCGAGCCA GGCGAAGAGC
CAGTTCATGG CGACGATGAG CCACGAGATC CGCACGCCGA TGAACGGCGT GCTCGGCATG
ACCGAACTCC TGCTGGGCAC CGCGCTCGAT TCGCGGCAGC GACGCTTTGC CCAGGCGGTG
TACCGCTCCG GCGAAAGCCT GCTCGAGATC ATCAACGACA TCCTCGATTT CTCGAAGATC
GAGGCCGGCA AGCTGGAACT GGCGCCGGCC GACTTCACGC CGCGAGCCCT GGTCGAGGAC
GTGTTGGAAC TGTTGGCGCC TCGGGCGCAG GAGCGCGGGC TGGAGTTGAG CTTCCGGGAG
GAGCCCGGCC TGCCGCCGGC ACTGCATGGC GATGCGCTGC GGCTGCGCCA GGTGCTCACG
AACCTGGTCG CCAACGCGAT CAAATTCACC GAGCATGGCG AGGTGGTCGT CGAGATGCGC
CGGGTCGAGC CGACCGCGGC AGAGACGGCC CTGGCCACCG GGGACCGGCT GTGGGTCGAG
CTGTGCGTGC GAGATACCGG GATCGGCATT CCCCCCGAGG CCTTGTCACG GCTCTTCATC
GCCTTCAGCC AGGCCAGCAG CGGCATGGCG CGACGCTACG GCGGCACCGG TCTGGGGCTG
GCGATCTCGC GCCAGCTGGT CGAACTGATG TCCGGATCGA TCACGGTCCG CAGCCAGCCG
GGGGTGGGCT CGCGGTTCTG CGTTCGGCTG CCCCTGTCGC CGGCCTCCAG CGACGTCGAT
GTCGACATGC TCGAACTGCA TGACATGCCG GCCCTGCGCG TGCTCGTCGT CGATGACAAC
GAGACCAACC GGACGGTGCT CGAAAACCTG CTCGGGGCCT GGGGCATGGA GGTCGTGGTG
GCGAACGACG GCGTGCATGC GCTGGAGCGG CTGCATGCGG AGCGCGATGC CGCACGCAGC
TTCGACATTG CGCTGATCGA CATGCAGATG CCGCGGCTCG ATGGCCTGCA ACTGGCCGAC
CGGATCTCGG CCGAGCCGGA CTTCGCGGAC GTGAAGCTGA TCATGCTGTC GTCCGTGAGC
TCGCCCGACG ACGCCAAGCG CGCGCAGGCC GTGGGCTTCA AGCGCTTCGT CAACAAGCCG
GTGCGGAAGG CCGAACTGCG CCAGGCGATC CTCGGCGTGT CGGGTGTTGC CGGCGCCGGC
GGCGGTTCGT CGCGCAAGAT CGGCGCCCAT ATCCTGGTGG TCGAGGACAA TCCCGTGAAC
CAGGAAGTGA TCGGGCAGAT GCTGCGCCAC TTCGGTTGCC GCGTGCAGCT CGCCTCGTCG
GCGCTCGAAG GGCTGCGTGC CTTGTGTGCC GAGCGCTTCG ACCTGATCAT GATGGACATC
CAGATGCCGG GCATGGACGG TGTCGAGGCG CTCGGCTGGT TCCGCCGAGG ACCTGGCGAG
CGCTTCGCCT TCCGCACTCC ACCCACCACG CCGGTGGTCG CGGTGACCGC CAACGCATTG
GGGGGTGACC GCGAGCGATT CCTCGGCCTC GGATTCGATG AATACCTTTC CAAACCCTTC
CGGCAAAGCC AGTTGCACAC CATGCTGTCA CAACGCCTGA ACATTCCCGA CACCGGGGCG
GGAGAGCTGA CGCCGGCTCC GGCCGAGCCA GCGTTGGCTG GAGCTCCGGC GATCCCCCCT
GCGGCGACGG CAGGAGCTCT GGATGCCCAG GCTCTGCAGC GGCTGCGCGA CCTCGATCCC
ACCGGGGCGA ACCGGCTGCT GGAGCGCGTC GTGCAGGCGT TCGAGACGTC CACGGGGCGT
TTGCTGCCGC AGCTCGACGA AGCGCATGCA GCGGGCGACC TCGACGGCGT GAAGCATGTC
GCCCATACGC TGAAATCCTC GTCGGCCAGC ATCGGGGCGC TCAAGTTGTC GGCCCTGTGT
GCCGACATCG AGGGCATGAT TCGCAACAAC GAGGTGCAGG CCCTGGGGCC GCGCGTCGCG
GCGCTGCGCG CCGAGATCGC GTCGGTTCGT GGCAGCCTGC ACGCTCTGCT GCTGCCTGCC
GCCTGA
 
Protein sequence
MSNEHVDSQF ERAAAEARAA VDRRVLRPLR DLAVARKLAL IVFLLVGVVV GLVYLSKLSS 
DILSGVRAYV AGDGLWAKGN RDAVFYLVRY AQTHEEADYR QYAAALSVTL GDRQARLELE
KPDFDYDVAY AGFLAGRNHP DDIDSLIWLF RTFRNLSYMD RVIQLWTRGD EEMALLRSYG
DTIRTQVLNG SLSPAHAALL IGEIEVLNRR LVPIEDAFST VLGEASRWIA QLLFALMLVT
ACGLLTLGLV VAGYLTRRIN RQIDGLRDGA LRMADDDFEQ PVEIVSDDEL GRLAATFNSM
QARLREHRSA IEASAAELQQ ATAAAQALAL QAETASQAKS QFMATMSHEI RTPMNGVLGM
TELLLGTALD SRQRRFAQAV YRSGESLLEI INDILDFSKI EAGKLELAPA DFTPRALVED
VLELLAPRAQ ERGLELSFRE EPGLPPALHG DALRLRQVLT NLVANAIKFT EHGEVVVEMR
RVEPTAAETA LATGDRLWVE LCVRDTGIGI PPEALSRLFI AFSQASSGMA RRYGGTGLGL
AISRQLVELM SGSITVRSQP GVGSRFCVRL PLSPASSDVD VDMLELHDMP ALRVLVVDDN
ETNRTVLENL LGAWGMEVVV ANDGVHALER LHAERDAARS FDIALIDMQM PRLDGLQLAD
RISAEPDFAD VKLIMLSSVS SPDDAKRAQA VGFKRFVNKP VRKAELRQAI LGVSGVAGAG
GGSSRKIGAH ILVVEDNPVN QEVIGQMLRH FGCRVQLASS ALEGLRALCA ERFDLIMMDI
QMPGMDGVEA LGWFRRGPGE RFAFRTPPTT PVVAVTANAL GGDRERFLGL GFDEYLSKPF
RQSQLHTMLS QRLNIPDTGA GELTPAPAEP ALAGAPAIPP AATAGALDAQ ALQRLRDLDP
TGANRLLERV VQAFETSTGR LLPQLDEAHA AGDLDGVKHV AHTLKSSSAS IGALKLSALC
ADIEGMIRNN EVQALGPRVA ALRAEIASVR GSLHALLLPA A