Gene Mpe_A1168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1168 
Symbol 
ID4785567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1250384 
End bp1252441 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content72% 
IMG OID640089731 
Productsensor histidine kinase 
Protein accessionYP_001020364 
Protein GI124266360 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.527437 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTTCT ACGAGCGCTT CCAGCGGCTG CCGATCCGCA GCAAGCTGCT GGCGATGGTG 
CTGCTGCCGT TGGTGGTGGT GCTGCCGCTG CTGGGCCTGC TGCTGCTGGT CTGGGGCAAC
GTGGCGCTGG ACCGCCTGCT GATCACCAAG GTGCGCAGCG ACCTGGCGGT GGCCCAGGGC
TACTTCGAGC GCGTGCTCGG CGAGGTGGGC AGCAGCGCCG CCGCCGTGGC CGACTCGCAG
GCACTGCACC GTGCGCTGGA CGACGACACG GCGGCGACGT CGGCCGACGC GGGGACGTGG
GTGACGCTGC TGCAGGCGTT CAAGGCCCGC GAAGGCCTCG ATTTCATCAA CTTGCGCGCC
CCCGACGGGA CGTTGCGCAT CACCGACTTC GGCGCGGCCC CGGCCACCGA CGGGCGCTCG
CCGGCGTTCC GTGCGGCGGC GGCCGAAAGC GGCCGGGCAC GCGCCAGCAT CGAGGTGCTG
CAGCCCGACG AACTGGCGCG TCTCGCGCCG GCGCTGCAGG ACCGCGTCGC GGTGCCGCTG
GTGAGCACGC GCAACGCCGC GCCCACGGCT CGCACGCAGG AAGACCGCGC GATGGTCGTG
CTGGCCACCG CGCCGGTGCA CGACGAACGC GGCGTGCTGC GCGGCCACGT GCAGGCCGGC
GTGCTGCTGA ACCGCAACCT GCCCTTCATC GACCACATCA ACGAGATCGT CTACCCCGAA
GGCGCGCTGC CCTTCGGCAG CCGCGGCACC GCGACGCTGT TCCTCGACGA CGTGCGCATC
AGCACCAACG TGCGGCTGTT CGGCGACGAT CCCAAGAACC GCGCCATCGG CACCCGCGTG
TCGCAGGCGG TACGCGACAC CGTGCTCGGC GGCGGTCAGC CGTGGCTGGA CCGCGCCTTC
GTCGTCAACG ACTGGTACGT CTCGGGCTAC CTGCCGCTGG CCGACGGCGC CGGCCGGCGC
GTCGGCATGC TCTACGTCGG CTACCTGGAA CGGCCCTTCA CCTGGCTGAA GTACGCCGTG
CTGCTGAGCA TCGGCGCGAT CTTCTTCGCC GTGATGATCG GCGCCACCGT GGTGTCGCTG
CGCTGGGCGC GCAGCATCTT CAAGCCGCTG GAGCAGATGG CGCGCACGAT GCAGCAGGTC
GAGGCCGGTG GGCTCGACGC GCGCGTCGGC GCGGCCGGCC ACCACCCCGA CGAGATCGGC
CGGCTCGCCG CCCACCTGGA CCACCTGCTC GACGTGATCG ACGACAAGAC GCGCGCGCTG
CAGCGCTGGG GCGACGAGCT CGACCGCAAG GTGGTCGAGC GCACGCGCGA CCTGGAGCAG
GCGCAGGCGC AGCTGCTGCG CTCGGAGAAG CTGGCCACCG TGGGCCAGCT CACCGCCAGC
ATCGCGCACG AGGTCAACAA CCCGATCGCG GTGATCCAGG GCAACCTCGA CCTGCTCCGC
GAGCTGCTCG GGCCGCAGAC CGCCGCCAAG GTCGACGCCG AGCTGCGGCT GGTGGACGAG
CAGATCGAGC GCATGCGGCT GATCGTCACG CAACTTCTGC AGTTCGCGCG CCCGAACGAA
TACGCCGGCT ACGTGGACAG CGTGAGCGTG GCGCGCGCGC TCGACGACTC GCTGCTGCTG
GTCGGGCCTC AGCTCGCGCG CACCCGGATC GCGGTGCAGC GCGACGACCG GGCGACGGCC
AGTGCCGCCA TCAACCGCCA GGAGCTGCAG CAGGTGCTGC TCAACCTGCT GATCAACGCG
CTGCACGCGA TGCCCGACAG CGGCACGCTG TCGCTGCACA CACGCGACTG GCACGCCGCC
GACGGCCGCG TGCAGGGCGT GCAGATCGAC GTGGCCGACA GCGGCCCCGG GCTGGGACCG
GAGATCGAGT CGCGGCTGTT TCAGCCCTTC GTCACCACCA AGACCGACGG CACCGGCCTG
GGCCTGTGGA TCAGCCGCAG CCTGATCGAG CGCTACGGCG GCACGCTGAC CGCGGCCAAC
CGCGACGACG GCGCGCGCGG CGCGGTGTTC AGCGTGCGGC TCTACAGCGA ACTGCCGGAG
ACGACGCTGC CGGCCTGA
 
Protein sequence
MTFYERFQRL PIRSKLLAMV LLPLVVVLPL LGLLLLVWGN VALDRLLITK VRSDLAVAQG 
YFERVLGEVG SSAAAVADSQ ALHRALDDDT AATSADAGTW VTLLQAFKAR EGLDFINLRA
PDGTLRITDF GAAPATDGRS PAFRAAAAES GRARASIEVL QPDELARLAP ALQDRVAVPL
VSTRNAAPTA RTQEDRAMVV LATAPVHDER GVLRGHVQAG VLLNRNLPFI DHINEIVYPE
GALPFGSRGT ATLFLDDVRI STNVRLFGDD PKNRAIGTRV SQAVRDTVLG GGQPWLDRAF
VVNDWYVSGY LPLADGAGRR VGMLYVGYLE RPFTWLKYAV LLSIGAIFFA VMIGATVVSL
RWARSIFKPL EQMARTMQQV EAGGLDARVG AAGHHPDEIG RLAAHLDHLL DVIDDKTRAL
QRWGDELDRK VVERTRDLEQ AQAQLLRSEK LATVGQLTAS IAHEVNNPIA VIQGNLDLLR
ELLGPQTAAK VDAELRLVDE QIERMRLIVT QLLQFARPNE YAGYVDSVSV ARALDDSLLL
VGPQLARTRI AVQRDDRATA SAAINRQELQ QVLLNLLINA LHAMPDSGTL SLHTRDWHAA
DGRVQGVQID VADSGPGLGP EIESRLFQPF VTTKTDGTGL GLWISRSLIE RYGGTLTAAN
RDDGARGAVF SVRLYSELPE TTLPA