Gene Mpe_A3738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3738 
Symbol 
ID4786027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3954652 
End bp3957405 
Gene Length2754 bp 
Protein Length917 aa 
Translation table11 
GC content75% 
IMG OID640092321 
Productsensor histidine kinase/response regulator 
Protein accessionYP_001022926 
Protein GI124268922 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.410794 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.466672 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAGCG GCTTGCCGCG GGCGCTGCGA TGGCTGGCCG GGTCCGTGCT CGGCCTGTGT 
GCGCTCGCCT GCAGCGCCGC GTCGCCATCG CCGCTGCGCT ACGGGATGCT GGCGGATTTC
CCGCCGTTCC AAATCTGGCC CGAGGGCAGC GAGGCCGGGG GCGCCGATCT GGAAATGCTG
CGCCGGCTGG CGCCCGCGCT CGGGGTGCAG GTGGAGCCGG TGCGCTACAC CGACTTCGTG
GCGCTCGAGC GCGATCTGCG CGCCGGGCGG CTCGACCTCG CCTCGTCGAT GGCCCGCAAC
GCCGAACGCG AGCCGACGCT GGCGTTCTCG GTGCCGTATG CGCTGATCGA GCAGGCGGTC
GTCACCCGCG CGAGCGATCC GTCGGGCTCG CTCGCGGCCG ACCTGAACGG CCGGCGTCTG
GCCACCGTGG CCGGCTACGC CGCCGAGGCC AATGCGCGCG AGCTGTTCCC GCTGGCGCAG
CGGGTGCCGG TGGCCAGCAT CGCCGACGGC CTGCGGGCCG TGCAGCAGGG CCGTGCCGAC
GTGATGATCG AGGCGCAGCC GGTACTCGTC GGCACCATCG AGCGTGAGCG CCTCGCCGGC
TTGCGGCTGG TGCGCACGCT GGCGTTGCCG AGCGGCGCGC TGCATTTCGC GGCACCGAAG
GCCGGCGCCG CCCGGCTCGA CTCGCTCTCG GCGGCGTTGG CGGGCATGGG CGCCGAGACG
CGCGAGTTCA TCATCGGCCG ATGGCGTGCG GAGCCGCATT TCGCGCGACG CGCCGAGCCG
CTGCGCCTGG ACGACCCCGA GCGCGCGCTG CTCACCGCGC AGCGGCCGCT GAATGTCGCC
GTGGTGGACG GGCGGCTGCC GTTCGCGGTG CTCGACGCCG ATGGCCGGCC GCAGGGCCTG
TCGGTCGACG TGCTGCGCGC GGTGCTCGGA CGCCTCGGGC TCGTGTCCGG CACCTGGCGC
GCGAGCGGCG CCACCGAGGC GCTGGCGGCG CTGCAGCGCG GAGAGGTCGA CCTCGCGCTC
GGCCTGTCGG CGGTGGCCAC GCCGGCCGAC ACGGTGCGCT TCATCGGTCC CTACATCGAG
CACCCGATGG TGCTGCTGGG CCGCCCCGGC GGCAGCGCCT GGGGTCTGGA GCAGCTCGTC
GGCCGGCGCC TGGCGCTGCC ACCCGCGCAT TTCGCGCGAC CGTTGATCGC CGCGCGCTAC
CCCGGCATCG AGCTGGTGGC TTGCGATCCG CTGCCGAACT GCGTCGAGCG CGTCGCGAGC
GGCGGCGCCG ACGCGGCACT GGCCGACGTC ATCACCGCGG CGGTGCTGCT GGCCGAATCG
CCGCGCAGCG ACGTGCAGAT CACCGGCACG GCGGCCGGCT TGCGCCAGGA GCACGGCATC
GCGGTGTCGT CTCGTCACGC GGCGCTGCTG CCGCTGCTGC AGCGGGCGCT GGATGCGACC
GTGGCCGACG ACCTGCTCGA GCTCAAGCGC CGCTGGCTGA CGCGTCCGAC GCCGGCGCGC
GTCGTGCGCG AGCTGCTGCT GCGCTACCTG CCCTGGGTCG CGACGGCGCT CGCCCTGCTG
CTCGCGGCGT GGTGGTGGCA CCACCGCGGA CTGCGGCGCG AGATGCAGCG CACGCTCGGC
GCACAGCGCC AGGCCGAGCG GGCCCGCGGC GCGAGCGAGC GTTTCGTGAC CTTCCTGGCC
CACGAGGTGC GCAACTCGCT GCATTCGGTG ATCGCCGGGG CCGAGCTGCT GCGCAGCGGG
CGCGAGGTCT CGGCGTCGGT GGCCGGTTCG CTGGGTGACT CGGCGCGCTC GACGCTGAAC
CTGCTGAACA ACCTGCTCGA CCGCAACCGG CTCGACGCCG GCCGGCTGAG CCTGCACCTG
GAGCCTGTGC AGCTCGGGCC GCTGCTGCGC GGCGTGCGCG CGGAGATGCT GCCGGCCGCA
CGGGCGAGAC AACTCACGCT GACCTGCACC GCGCCGATGC CCGACCCGTT GCTGCGCGTC
GACGCGCTGC GCGTCGAGCA GATCGTGCGC AACCTGGTGG CCAACGCGAT CAAGTACAGC
GAGCGCGGAG AGATCCTGAT CGAAGCCCGC TGCACGCCGC AGGCGGGCGA AGAAGCGTCG
CGCTGCGTGA TCGAAGTGCG CGTGCGCGAC CAGGGCCCGG GCATCGCCGA GGCCGACCAG
GCGCGGCTGT TCGAGCGCTA CTACACCGCC GGCGGCCGGA CGGCGGCGCG CAGCGGCACC
GGGCTCGGGT TGTCGCTGTG CCGCGACCTC GCCGCGCTGA TGGGCGGTGC GCTGCACATC
GAGAGCAGCC CCGGGCACGG CACCACCGCG CTGCTGCGCT GGACGGCCGA CGTCGAGACC
GAGGCCACCG GCCCCGCGCC CCTGGCCGAG GACGCGCCGC GCCGGCTGCT GCTGGTCGAG
GATGCCGATG TCTACGCGAT GCTGCTCGAG CGTGCGCTCG CGGCCCAGGG CTACGCGGTG
CGGGTCGCCG GTTCTCTGGC GGAAGCGCAG GCCGCGCTGG CGGCGCCGGG CCCGCCCTTC
GAGGTGGTGC TGACCGACCT GAACCTCGGC GACGGCGACG CGCACGGCGT GATCGCCGCC
GTGCGCGCCC GCGCGGGCGC AGGGCTGCCG GCGATCGTCG TGATGTCGGC CGATGTCGAC
CCTGAGCGGG TGGCCCCGCT GCGCGAGGCC GGGGCCGAGG CGCTGCTGCA GAAGACCGGC
GACGTGGCGC TGCTGGTGCG CCAGTTGCTG ACGCACGTGA ACACGGGGCC CTGA
 
Protein sequence
MRSGLPRALR WLAGSVLGLC ALACSAASPS PLRYGMLADF PPFQIWPEGS EAGGADLEML 
RRLAPALGVQ VEPVRYTDFV ALERDLRAGR LDLASSMARN AEREPTLAFS VPYALIEQAV
VTRASDPSGS LAADLNGRRL ATVAGYAAEA NARELFPLAQ RVPVASIADG LRAVQQGRAD
VMIEAQPVLV GTIERERLAG LRLVRTLALP SGALHFAAPK AGAARLDSLS AALAGMGAET
REFIIGRWRA EPHFARRAEP LRLDDPERAL LTAQRPLNVA VVDGRLPFAV LDADGRPQGL
SVDVLRAVLG RLGLVSGTWR ASGATEALAA LQRGEVDLAL GLSAVATPAD TVRFIGPYIE
HPMVLLGRPG GSAWGLEQLV GRRLALPPAH FARPLIAARY PGIELVACDP LPNCVERVAS
GGADAALADV ITAAVLLAES PRSDVQITGT AAGLRQEHGI AVSSRHAALL PLLQRALDAT
VADDLLELKR RWLTRPTPAR VVRELLLRYL PWVATALALL LAAWWWHHRG LRREMQRTLG
AQRQAERARG ASERFVTFLA HEVRNSLHSV IAGAELLRSG REVSASVAGS LGDSARSTLN
LLNNLLDRNR LDAGRLSLHL EPVQLGPLLR GVRAEMLPAA RARQLTLTCT APMPDPLLRV
DALRVEQIVR NLVANAIKYS ERGEILIEAR CTPQAGEEAS RCVIEVRVRD QGPGIAEADQ
ARLFERYYTA GGRTAARSGT GLGLSLCRDL AALMGGALHI ESSPGHGTTA LLRWTADVET
EATGPAPLAE DAPRRLLLVE DADVYAMLLE RALAAQGYAV RVAGSLAEAQ AALAAPGPPF
EVVLTDLNLG DGDAHGVIAA VRARAGAGLP AIVVMSADVD PERVAPLREA GAEALLQKTG
DVALLVRQLL THVNTGP