Gene Mpe_A3052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3052 
Symbol 
ID4784914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3244345 
End bp3246681 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content71% 
IMG OID640091623 
Productdiguanylate cyclase/phosphodiesterase with PAS/PAC sensor(s) 
Protein accessionYP_001022240 
Protein GI124268236 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.479414 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATCG GCCGCGACGC GCGCCTGCTG GCCGTGAACG AACCGCTGGC GGCATGCACC 
GGCTGGCACC GCGACGCCGC CGCCGACGGC GAGGCGTGGC TCGACCGGCT CTCCGCGCCG
TCGCGCCGCG AGCTGCTGAA GGCACTGGAC CATCTCGACA CGGCCGACCG CTTCCGCCTC
GAGCTGCAGT TCCGCCGGCC TGGCGGCGAG GTCGGCTGGC TCGAGTGCGA TGCACGATGG
AACCGCGACA CCGAGCTCTG CCTGTGCGCG CTGCACGACG TGAGCGCGCG CAAGCGCACC
GAGCTCGCGC TGCGCGAGCA GACCACGCAG CTGCGAGTGC TGGCCGACAA CGTGCCGGCG
CTGATCGCCT ACTACGAACA GCAGGGCTTC CGCTGCACCT TCGCGAACCG CCACTATGCG
AGCGCACTGG GCTGGACCCC CGAGAACATC CTCGGCCGCA CCGCGGCCGA GATCGTCGGC
GAGCGCGGCT GGCAACAGAT CCGCCCCTAC ATCGAGCGGG TGGTCGAGAC CCGCGCCCAG
GTGTCCTACG AGCGCGAACT GACCGATGCC GCCGGCCGGC CGCAATGGCT CGAGGTCGAC
GTGCTGCCGA ACTTCGACGA CGAGGGCGAG CTGTCGGGCG CCTTCGTGCG GGTCACCGAC
ATCACCAAGC ACCGGCTGGC CGAGCGCGCC GTGCGCGAAT CGGAAGAACG GCTCGCGAAG
TTCATGCAGG CGAGCGCCGA GGGCATCCTG TTCCACCGCA ACGGCACGGT GATCGACATC
AACGAGCCGA TCTGCGAGCT CACCGGCTAC ACCCGCGACG AGGTCATCGG CCGGCAGACC
GCGGAGTTCC TGGCGCCCGA CCACCTGCCG CGCGCGCAGC AGGCGCTGGC CACCGGCGCC
GAGGCGGCGT GGGAGAGCGT CGCGCTCGGC CGCGCCGGAG AACGCATCCC GGTCGAGCTG
ATCACGCGCT CGATCATGCG CAACGGCGAG CGGCTGCGCC TGGTGATCGT GCGCGACATC
CGCGAGCGTG TGGCCGCGCA GGCGCGCATC CACCACCTCG CGCACCACGA CGCGCTGACC
GGGCTGCCCA ACCGCGCCGC CTTCGGCCTG CAGCTCGAGC GACTCACGGC CTCGCACCGC
AGCGGGGACG CGCAGATCGC GCTGCTGTTC ATCGACCTCG ACCACTTCAA GCGCATCAAC
GACTCGCTCG GCCACCTGGT CGGCGACCTG CTGCTGCAGA CCGTGACGGC GCGCATCACC
GACAACCTTC GCGCCGACGA CCTGGTGGCA CGCTTCGGCG GCGACGAGTT CATGGTGCTG
ATCCCGGGCG TGACCGACCG CAGCGTCGTG GAGCAGGTGG CGGGCAAGCT GCTGGCGGTG
ATCGAGGCGC CGCTCGAGGC CGCGGGCCGG TCGATCTCGG TGTCGCCGTC GATCGGCGTC
TCGCTGTTCC CCGAGCATGG CCGCACGCCG GCCGAGCTGA TCCAGCATGC CGACGCGGCG
ATGTACATGG CCAAGGCGCG CGGTCGCGCC AACGTGCAGT TCTTCGACCC GGTGGTGGCC
AGCGCCGCCT ACGACGCGCT GGTGGTCGAG AGCCAGCTGG CGCAGGCGCT GGAGACCGGG
GCCTTCGAGC TCCACTACCA GCCGCAGCTG CGCTCCGGCG ACGGCCGGCT GGTCGGGGTC
GAGGCGTTGA TCCGCTGGCA CCATCCCGAG CGCGGCCTGC TGCTGCCCGA TGACTTCATC
CCGGTGGCCG AGGAGCGCCG GCTGATGCTG CCGATCGGCC AGTGGGTGCT GCGCGAGGCG
ATGCGCTGCG CCCGGCGCTG GCATGCGCAG GGGCTCGAGC TGCCGATCTC GGTCAACCTC
AGCAGCATGC AGTTCCAGCA GCGCGGCTTC GTCGACTCGC TGGCCGAACT GCTGCGGCAG
GAGCAGGTGA ACGGGGCATG GCTGGAGCTC GAGCTCACCG AGCGCATGCT GATGGACGAC
CTGGACGAGG TGAAGGCCAC GCTGGCGCAG CTCAAGGCGC TGGGCATGCG CATCTCGGTC
GACGACTTCG GCACCGGCTA CTCGTCGCTC GGTCACCTGA AGGAACTGCC GATCGACAAG
GTGAAGATCG ATCGATCCTT CGTGCAGGAC GTGCCGCAGA ACGCCGACGC CGCGGCCATC
GTGCAGGCGA TCGTGCAGCT CGGCCGCAGC CTGGGCATGA CCGTCATCGC CGAGGGCGTG
GAGACCGAGG CCCAGGAGCG CTTCCTGCGC GAGCTGGGTT GCGAGGAACT GCAGGGCCTG
CGGATCGCGC CGCCGCTCTC GGAGGCCGAC CTGCAGCTCT GGGCCTCGCG GCGCTAG
 
Protein sequence
MAIGRDARLL AVNEPLAACT GWHRDAAADG EAWLDRLSAP SRRELLKALD HLDTADRFRL 
ELQFRRPGGE VGWLECDARW NRDTELCLCA LHDVSARKRT ELALREQTTQ LRVLADNVPA
LIAYYEQQGF RCTFANRHYA SALGWTPENI LGRTAAEIVG ERGWQQIRPY IERVVETRAQ
VSYERELTDA AGRPQWLEVD VLPNFDDEGE LSGAFVRVTD ITKHRLAERA VRESEERLAK
FMQASAEGIL FHRNGTVIDI NEPICELTGY TRDEVIGRQT AEFLAPDHLP RAQQALATGA
EAAWESVALG RAGERIPVEL ITRSIMRNGE RLRLVIVRDI RERVAAQARI HHLAHHDALT
GLPNRAAFGL QLERLTASHR SGDAQIALLF IDLDHFKRIN DSLGHLVGDL LLQTVTARIT
DNLRADDLVA RFGGDEFMVL IPGVTDRSVV EQVAGKLLAV IEAPLEAAGR SISVSPSIGV
SLFPEHGRTP AELIQHADAA MYMAKARGRA NVQFFDPVVA SAAYDALVVE SQLAQALETG
AFELHYQPQL RSGDGRLVGV EALIRWHHPE RGLLLPDDFI PVAEERRLML PIGQWVLREA
MRCARRWHAQ GLELPISVNL SSMQFQQRGF VDSLAELLRQ EQVNGAWLEL ELTERMLMDD
LDEVKATLAQ LKALGMRISV DDFGTGYSSL GHLKELPIDK VKIDRSFVQD VPQNADAAAI
VQAIVQLGRS LGMTVIAEGV ETEAQERFLR ELGCEELQGL RIAPPLSEAD LQLWASRR