Gene Mpe_A3044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3044 
Symbol 
ID4784966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3234663 
End bp3236090 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content72% 
IMG OID640091615 
Productmethanol utilization control sensor protein MoxY, putative 
Protein accessionYP_001022232 
Protein GI124268228 
COG category[T] Signal transduction mechanisms 
COG ID[COG3851] Signal transduction histidine kinase, glucose-6-phosphate specific 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.739153 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTTCT GGAATCGGCG CCGCGCCGGC GGGTGGCCGG CCGATACCAT GCGGCGCATG 
AGTCTGCGCT TGAAGATCCA CCTGATCGTC GGTGTGCTGG TGGCGCTGTG CCTGGTGGCC
GTCATGGCGC TGCAGGTCAA GAGCGCGCGC GACGCGATCC GGGAGGAGAT CGAGGCCGCC
AACCGCGTGG CCGCGCAGTT GCTGCAGCGC ACGCTGTGGC TGCAGGCGGC GCGCGGCACG
CCGGCGATGA TCGGCTACCT GCAGGGGGTG GGCCGCGTGC GGGCGAACGA CATCACGCTG
CTCGACGGCA AGGGCGAACT GGTCTACCAG TCGCCGCCGT CGCCCTACAA GTCGGGCCGA
GATGCGCCCG ACTGGTTCGT CGATTTCATG GCGCCGCCGC TGGAGCCGCA GAAGATGGAT
TTCCCCGACG GCACGCTGGT GGTGCGGGCC GACCCCTCGC GCGCGGCGCT CGATGCCTGG
GACCAGTTCG CGGTGCTGGG GTTGGCCGCG CTGGGCGTGT TGGCGGTTCT CAACCTGGTG
GTGTTCTGGG TGGTCGGCCG CACGGTCGAG CCGTTCGGGC AGATCGTCGC GGCGCTCAAC
CGCATCGAGG CCGGGCAGCT CGACGTCACG CTGCCGCGCC TGCCGGGCAC CGAGGCGGCT
GCCATCGGCG CCGCCTTCAA CCGCATGGTG GTGGGCGTCA GCGAGCGCAT CGAGGCCGAG
CGCCGGGCCG CGCAGGCCGA GCACGAGCTG TCCGACCGCC GCGACCTGGC ACGCTGGATC
GACCGCCACC TGGAGCAGGA ACGCCGCCTG ATCGCTCGTG AGCTGCACGA CGAACTGGGC
CAGTCGGTGA CCGGCATGCG CAGCCTGGCG CTGTCGGTGG CGCAGCGTGT CGCCATCGCC
GACCCCGAGG CCGCGCGCGC CGCGCAGGTG ATCGCCGACG AAAGCTCGCG CCTCTACGAT
GCGATGCACG GCCTGATCCC GCGGCTGGCG CCGCTGGTGC TCGACGTCTT CGGGCTGGCC
GATGCGCTGC GCGACCTGGT CGAGCGCACC CGGGTCAGCC AGCCGCAGGC CAGCGTCGAA
CTGCACATCG ACCTGGGCGA CGTGCAGCTG GGCAGCGAAG CGACGCTGGC GCTGTACCGT
GCGGCCCAGG AGGGGTTGAC CAACGCGCTG CGCCACGGCC AGGCCAGGCA GCTGAGCGTC
AGCCTGCATG CCGAGTCCGA AGGCGCCGAG CTGCAGGTCG ACGACGACGG CCAGGGCCTT
GCGCCCGACT GGCGCGAGAA GGCGCGGCAG GACGGCGGCC ACTACGGCCT GCGCTGGCTG
GCCGAGCGCG TGGAGGCGCT GGGCGGTGTG CTGCGCATCG ACAACCGCAG CCCGCGCGGT
GTCGCCCTGC GGGTGTGGTT GCCGTTCACC GCGGCGGAGC CGGCGTGA
 
Protein sequence
MIFWNRRRAG GWPADTMRRM SLRLKIHLIV GVLVALCLVA VMALQVKSAR DAIREEIEAA 
NRVAAQLLQR TLWLQAARGT PAMIGYLQGV GRVRANDITL LDGKGELVYQ SPPSPYKSGR
DAPDWFVDFM APPLEPQKMD FPDGTLVVRA DPSRAALDAW DQFAVLGLAA LGVLAVLNLV
VFWVVGRTVE PFGQIVAALN RIEAGQLDVT LPRLPGTEAA AIGAAFNRMV VGVSERIEAE
RRAAQAEHEL SDRRDLARWI DRHLEQERRL IARELHDELG QSVTGMRSLA LSVAQRVAIA
DPEAARAAQV IADESSRLYD AMHGLIPRLA PLVLDVFGLA DALRDLVERT RVSQPQASVE
LHIDLGDVQL GSEATLALYR AAQEGLTNAL RHGQARQLSV SLHAESEGAE LQVDDDGQGL
APDWREKARQ DGGHYGLRWL AERVEALGGV LRIDNRSPRG VALRVWLPFT AAEPA