Gene Mpe_A1739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1739 
Symbol 
ID4785292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1865037 
End bp1866368 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content70% 
IMG OID640090310 
Producttwo-component sensor histidine kinase 
Protein accessionYP_001020934 
Protein GI124266930 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.337337 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCGCA AGCTTGACGC GGCGACGACC GGCGCTCGCG CCGCGATCGT CGACACCTCG 
ACCGGCCTGA ACAACATGCA GCAGCTGATC CAGCTGCGGT GGATCGCGGT GGTGGGCCAG
GTGGCGACGA TCGCGGTCGT GCACTTCGGC TTCGGGATCC GGCTGCCGCT GGAGCAGATG
GCGGCGACGC TGGCCTGCCT GGTCGCGTTC AACGTCGCGA GTCAGGCTCG CTGGCGATCG
CGCCAGCGGG TGTCCGACCG CGAGCTCTTC GTCGCGCTGC TGGTCGACGT CGGCGCGCTG
ACGGTCCAGC TCTCTCTGAG CGGCGGGACC GGCAACCCTT TCGTCTTCCT GTTCCTGCTG
CAGGTGATCC TGGGAGCCGT GCTGCTGGAG ACGGCGTCGA TCTGGACGCT GGTGGCGATC
ACCGGAGGCT GCGTCGTCGC CCTCACCAGC CTGCACCGCC CGCTGCCGAT CCCGCTCGAC
CCCGGGGGCG GTCTCGCCAG CCCCTACATC CGCGGCCTGC TGATCTGCTT CGTGCTCGAC
GCGGCGCTGC TGGTCACCTT CGTCACCCGC ATCGAGCGCA ACCTGCGCCA CCGTGACGCC
CGGCTCGCCG CACTGCGCCA GCGCGCAGCC GAGGAGGACC ACATCGTGCG CATGGGGCTG
CTCGCCTCCG GCGCGGCCCA CGAGCTCGGC ACGCCGCTGT CGACGCTGTC GGTGATCCTC
GGCGACTGGC GCCACTTGCC GCACTTCACC TCCGACCCCG AACTGCACGG CGAGGTCTGC
GAGATGCAGA CGCAGATCGA GCGCTGCAAG GCGATCGTCA GCGGTATCCT GCTGTCGGCC
GGCAAGGCGC GCGGCGAGGC CTCGGTGGCC ACCACGCTGG GTCGCTTCCT GGACGACCTG
CTGGCCGAAT GGCGCACCAC GCGACCCGCC GTCACGCTGA CCTATGCGAA CCGCTTCGGC
GACGACGACC CGCCGATCGT CTCGGATTCG GCCGTCAAGC AGACCCTGCA CAACCTGCTC
GACAACGCGC TCGAGGCCTC GCCACGCTGG GTCGGCCTGG ATGTCGACCG CGACGGCGAC
ACGCTGGTGC TGACCGTCAA GGACACCGGA CCCGGCTTCG CGCCGGAGAT GCTGGCGCAG
TTCGGCAAGC CCTACCAGTC GAGCAAGGGG CGACCCGGGG GCGGCCTGGG ACTGTTCCTG
GTGGTCAACG TCGTGCGCAC GCTGGGCGGT ACCGTCGCTG CGCGCAACCG CTCGAGCGGT
GGCGCGATCG TCACAATACG GCTGCCGCTG TCAGCGATCA CGCTCGAGGA ACCGGAAGCC
CATGGACGTT GA
 
Protein sequence
MRRKLDAATT GARAAIVDTS TGLNNMQQLI QLRWIAVVGQ VATIAVVHFG FGIRLPLEQM 
AATLACLVAF NVASQARWRS RQRVSDRELF VALLVDVGAL TVQLSLSGGT GNPFVFLFLL
QVILGAVLLE TASIWTLVAI TGGCVVALTS LHRPLPIPLD PGGGLASPYI RGLLICFVLD
AALLVTFVTR IERNLRHRDA RLAALRQRAA EEDHIVRMGL LASGAAHELG TPLSTLSVIL
GDWRHLPHFT SDPELHGEVC EMQTQIERCK AIVSGILLSA GKARGEASVA TTLGRFLDDL
LAEWRTTRPA VTLTYANRFG DDDPPIVSDS AVKQTLHNLL DNALEASPRW VGLDVDRDGD
TLVLTVKDTG PGFAPEMLAQ FGKPYQSSKG RPGGGLGLFL VVNVVRTLGG TVAARNRSSG
GAIVTIRLPL SAITLEEPEA HGR