Gene Mpe_A0956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0956 
Symbol 
ID4787339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1013538 
End bp1015574 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content70% 
IMG OID640089518 
Producttranscriptional regulatory protein 
Protein accessionYP_001020153 
Protein GI124266149 
COG category[K] Transcription
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3284] Transcriptional activator of acetoin/glycerol metabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAACA GCCATTTCCC CAAGCCCGTG ACGACGCCGC CGTTGTCGAT CGGGCCACAA 
GGCGAGTTCG AGCCGCAAGG TGTTCGGGAG ACCACACAGG CCTGGGAGCA GTTCGTCTCC
GGGCAGTCGC TCGTGCAGTC GGCCGTGCCC AAGCACGTGC TGTCGTCGTG GCAGCGCAGC
CGCCGCTTCG GCGTCGAACC CGGCTCGCGC AAGGCCCCGC TCGCGATCCG GGGCGACGAC
GAGCTCGAGC AGCTCCGGCT GCGCAACCGC GATCTGGTCT GGGCCGCGCA GGGCATCTTC
ATGTCGTCGG CACACCTGCT CGCGAAGTCG GGCTCGATCA TGCTGCTGAC CGATGCCACC
GGCATCGTGC TCGAATCATC GGGCGATGCG CGCACGCTGG ACGCGGCGCA GGATATCCAC
CTGACCCACG GCGGCAACTG GAACGAGAGC GTGGTCGGTA CCAACGGCAT CGGCACCGCG
CTGGCCACCG GCCGGCCGAT CCAGGTGCAT GCCGCGCAGC ATTTCTGCGA AGGCATCAAG
AGCTGGACCT GCGCGGCCGC CCCGGTCTAC CTGTCCGGCA CCGATCAGCT GCTCGGCGTG
ATCGATATCT CGGGCCCGCC CGCGACCTTC CAGCTCAACA ACCTCGCGCT CGCCGTGGCG
TGCGCCCGGC AGATCGAGTC GGTGCTGGCC GAGCGCACCA GCCGCGAGCA CAACGTGCTG
CTGGAGGCCT GCCTCAACCA CCCGGGGCGG GCCGGCGCCG CCGCCATGGT GGTGATCGAC
CGCAACGCGC GCATCGTGCA CAGCAGCGGC TGCCTCACGC CGGATCTCGC GGTGCGGCTG
GCGGACGGTC TCAAGGGGCG CCAGATCTCC GCCTGGAACA ACCGGCTGCC GGACGGGCTG
CTGGGCGAAT GGATGAACCC GGTGCGCCTC GACGGCAGCG CGATCGGCGC GCTGCTGATC
GTGCCCAAGC GCTCGATCGG CCGGATCCTG CAGCGCCCGG AAGGCGATCT GCCCCAGCTT
CCGGCACCGG CCGCGCCGAC CCTCGCCGCC GCCCCGGCAC CGGAGTGCCT GCCGGGCGTG
GTGGGCGCCA GCAGTGTCTT TCGCGGCGCG ATCGAACGCA CCAAGCTGCT CGGGCGTCGC
CGGGTCTCGG TGCTGATCCA GGGCGAAACC GGAGCGGGCA AGGAGTTGTT CGCCCGCGCA
CTGCACGAGG AGGAGCGCAA GGGCGGCAGC TTCGTGGCCT TCAACTGCGG CGCGACGACC
AAGGAGCTGA TCGGCAGCGA GCTCTTCGGG CACGTGCGCG GCGCCTTCAC CGGGGCCACC
AGCGAAGGAC GAGCCGGGCG CTTCGAACTC GCCCACGGCG GCACCTTGTG CCTCGACGAG
GTGGGCGAGC TGCCGCTGGA CCTGCAGCCG GTGCTGCTGC GCGCGCTCGA GGAGGGCGTG
GTCTACCGGC TCGGCGACAC CACGCCGCGG CCGGTCGACG TGCGGCTGGT GGCGATGACC
AACCGTGATC TGCTGCAGGA GGTCGAGGCA GGGCGCTTCC GTCGCGATCT CTTCCATCGC
ATCGGCGTCA CGCGGATCCA GGTGCCGGCC CTGCGGGAAC GCGATGGCGA CGTCGACCTG
CTCGTCGACC ACTTCGTGCG GACGCTGTCG GTGCGCCATG GCGTCGCGAG CCGGGAGATC
GGCCCGGACG TTCGGCAGCT GTTGCGGGCC TATGCCTGGC CCGGCAATGT GCGTGAACTG
CGCAACGTGA TCGAGTCGCT CCTGCTCACC TCCGATGACG AGGTGGTGCG GCGGGAGGAG
TTGCCGGCAG AACTGCTGGC GACGCTCGAC GGCGCCAAGG CTCCGGCGCT CGACGCCGAT
CTCACGAGCC TGGAGGCCAC TGAACGCCTG ACCATCCTGC AGGCGATCCA GCGCGTGCAC
GGCAACCTGG CGTTGGCTGC ACGGGTGCTC GGCATCTCGC GCAGCACGCT GTACCGGAAG
GTCGAGCGCT ACCAGCTCGA CGATGTCGTG AAGGCGAGCA ACGATGGCGA AAACTGA
 
Protein sequence
MVNSHFPKPV TTPPLSIGPQ GEFEPQGVRE TTQAWEQFVS GQSLVQSAVP KHVLSSWQRS 
RRFGVEPGSR KAPLAIRGDD ELEQLRLRNR DLVWAAQGIF MSSAHLLAKS GSIMLLTDAT
GIVLESSGDA RTLDAAQDIH LTHGGNWNES VVGTNGIGTA LATGRPIQVH AAQHFCEGIK
SWTCAAAPVY LSGTDQLLGV IDISGPPATF QLNNLALAVA CARQIESVLA ERTSREHNVL
LEACLNHPGR AGAAAMVVID RNARIVHSSG CLTPDLAVRL ADGLKGRQIS AWNNRLPDGL
LGEWMNPVRL DGSAIGALLI VPKRSIGRIL QRPEGDLPQL PAPAAPTLAA APAPECLPGV
VGASSVFRGA IERTKLLGRR RVSVLIQGET GAGKELFARA LHEEERKGGS FVAFNCGATT
KELIGSELFG HVRGAFTGAT SEGRAGRFEL AHGGTLCLDE VGELPLDLQP VLLRALEEGV
VYRLGDTTPR PVDVRLVAMT NRDLLQEVEA GRFRRDLFHR IGVTRIQVPA LRERDGDVDL
LVDHFVRTLS VRHGVASREI GPDVRQLLRA YAWPGNVREL RNVIESLLLT SDDEVVRREE
LPAELLATLD GAKAPALDAD LTSLEATERL TILQAIQRVH GNLALAARVL GISRSTLYRK
VERYQLDDVV KASNDGEN