Gene Mpe_A1002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1002 
Symbol 
ID4787178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1064436 
End bp1066148 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content70% 
IMG OID640089564 
Productphenol-degradation regulator 
Protein accessionYP_001020199 
Protein GI124266195 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCCA CCACCCACGC GGCGCCATCG GCGACGGTCC GCGCCGACGG ACCGGACGAC 
CGCACGCTCA CCGAGCACCT GCGTTTCTCG CCCGACGCCG GCCACATCGT GCTGTTCGAG
CAGCGCATGC TGCTGATGCA CGCGTCGTCG TTCGCCGAGC TGCGGCGCGA GCTCATCGAA
CAGCTGGGCA CCGGCAAGGC GCGCGAGCTG TTCACGCGGC TCGGCTACCA GCAGGGCTTC
GAGGACGGCC AGCGCCTGCG CGCCCTGTGC GGCACGGACC GTGACCGCAT GCTCGAGCTC
GGCCCGCGCC TGCGCGAGAT CGAAGGCTTC GTGCGCAACC AGCCGATCGA CGCTATGCAG
TTCGACCTCG AAGGCGGCGA GTTCTGGGGC GACTACTACT GGACCTCCTC CTGGGAGGCC
GAAGCCCACC TCAAGCACTG CGGCGTGTTC GGCGAACCGG CCTGCTGGAT GATGGTGGGC
TATGCGAGCG GCTGCTGCAC CGCCATCCTC GGGCAACCGG TGCTGTGGCG CGAGATCGAG
TGCGTGGCGA TGGGCCACGA GCGCTGCCGC GTGGTCGGTC GCCCGCTCGC CGCCTGGGAC
GACCTGAGCG AGCACGAGAC CAACTTCCTG AAGATCGAGT CCTTCGTGCG CACGCCACCG
AAGGCCCGGC CGACCCGCAG TGCGCCGGCC AAGGTCTGCG TCCCGGGCGA GTTCGACGAC
TTCGTCGGCA CCTCGGCCGG CTTCAACGCG GTGGCCAACC TGGTCCGCCG TGTCGGTGTG
ACCGACTCCA CCGTGCTGTT CCGCGGCGAG AGCGGCGTCG GCAAGGAGCG TTTCGCACGG
GCGCTGCATT CGGTCAGCCG CCGGCACGAG CAGCCGATGG TGTCGATCAA CTGCGCGGCG
ATTCCGCCCG ATCTGGTGGA ATCCGAGCTG TTCGGTGTCG AGAAAGGTGC CTTCACCGGA
GCCGACCATT CGCGGCCCGG ACGCTTCGAG CGGGCCCACG GCGGCACGCT GTTCCTCGAC
GAGATCAGCA GCCTGCCGCT GCCGGCGCAG GGCAAGCTGC TGCGCGTGCT GCAGGAGCGC
GAGATCGAGC GCGTGGGCGA CGTGCGCGTG CGCAAGGTCG ACGTGCGACT GGTCGCGGCG
TCCAACCGCG ACCTGCGCGA CGAGGTCGAG GCCGGCCGAT TCCGCGAGGA CCTGTTCTAC
CGGCTCAACG TGTTCCCGAT CGTCATTCCG CCGCTGCGCG AGCGCCTGGA GGACATCCCG
CTGCTCGTCG CACTGTTCCT GGAACGTTGC AACCGGCGCT GCGGCAAGCG CGTCGCCGGC
CTGACCACGC GCGCCTACGA CGCGCTGTGG GACTACCACT GGCCCGGCAA CGTGCGCGAG
CTCGAGAACA TGGTCGAGCG CGCGGTGATC CTGGCCGACG ACGACGGCAA GATCGACGTG
CAGCACCTGT TCTCCGGCGG CGAGCAGCTC AAGCTGCGTT CGCTGAGCGT GGGCGCGCAC
GGCGAGCTGG TCCCGCGCGC CGGCGAGGCC AGCGACCAGT GCAAGCGACT GGCCGACGAC
CTGCTGGCCC GCCTGCCGTC CTTCGAGCAG ATCGAGTCGC TGCTGTTCGA GCGCGCGATG
GAGCGCAGCG ACGGCAACAT CTCCGCGGCC GCGCGCCTGC TGCGCCTGCG GCGCGGCCAG
GTCGAATACC GCCTCAAGAA GCGCGAGTCC TGA
 
Protein sequence
MPPTTHAAPS ATVRADGPDD RTLTEHLRFS PDAGHIVLFE QRMLLMHASS FAELRRELIE 
QLGTGKAREL FTRLGYQQGF EDGQRLRALC GTDRDRMLEL GPRLREIEGF VRNQPIDAMQ
FDLEGGEFWG DYYWTSSWEA EAHLKHCGVF GEPACWMMVG YASGCCTAIL GQPVLWREIE
CVAMGHERCR VVGRPLAAWD DLSEHETNFL KIESFVRTPP KARPTRSAPA KVCVPGEFDD
FVGTSAGFNA VANLVRRVGV TDSTVLFRGE SGVGKERFAR ALHSVSRRHE QPMVSINCAA
IPPDLVESEL FGVEKGAFTG ADHSRPGRFE RAHGGTLFLD EISSLPLPAQ GKLLRVLQER
EIERVGDVRV RKVDVRLVAA SNRDLRDEVE AGRFREDLFY RLNVFPIVIP PLRERLEDIP
LLVALFLERC NRRCGKRVAG LTTRAYDALW DYHWPGNVRE LENMVERAVI LADDDGKIDV
QHLFSGGEQL KLRSLSVGAH GELVPRAGEA SDQCKRLADD LLARLPSFEQ IESLLFERAM
ERSDGNISAA ARLLRLRRGQ VEYRLKKRES