Gene Mext_3063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3063 
Symbol 
ID5835427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3406649 
End bp3409660 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content71% 
IMG OID641368863 
Productdiguanylate cyclase 
Protein accessionYP_001640523 
Protein GI163852480 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.964207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCGT CCCGTCCGGT GCCACACAGC GATCGTGGAG CCGCCGCACC CGTCCAGAAC 
GCGCCGCGAG AGGACGGGCT GGAAAGCCGC CTCGACGCCC TGTGCCGCTC CGTCGCCGAG
GTCTTCGCCG TCCCCATGGC GGCGATCGCG CTCATCGACG CGGATCGCAT CCGTTTCCGC
GCCCGCTACG GGCTCGCGGA GGCCGTCATC GACCGCGACG ACGCCCTCTG CCATCACACA
ATCCACCAGC CGCGCGGTCA CGCTCTCGTG GTGCCGGACC TGATCCGTGA CGAGCGTTTC
GTCCATTCTC CCCTCGTCGT CGGCGCGCCG CATGCCCGCT TCTACGCGGG TCTGGCGATC
CGCTCGGGAG CGGGCCGGGT CGTCGGCACG CTGTGCCTGA TGGATCGGGT GCCGCGGGAC
GATGTCTCGT CCGACCGCGT GCGCGTCCTG CAAGAATTGG CGCTCGTCGC CGAGGCGCAT
CTCGAGCTCG ACGAAGCCCG GCGCGCGAGC GAGGCCGCGG AACGCCGGCG GGCGGAGGCG
CATCTCTTGG AATGGGAGGC GCGCCAGAGG GCGCTCGAGG CGGCCCACGC CATGGCCGAA
CAGATCGCCG CCTTCGGCCA TTGGCGGATC GATGCGGCCA CCCGCACCAT CGCGTGGTCG
GACGGGATCG CCCGCATCTT CGGGCGCAAC GCCGAACGCG CGACGCTGCC GCTCGAAACC
CATATCGGCT TCTATCATCC GGATGATCGC GAACGCGTCT GGGCCGCCAT GGACGAGGCG
CTCGCGGGCC GCAGCCGGAC CCTGGGGGGT GGCTACGAGC ACCGCTCTCG CATCCTTCGT
CCCGACGGCG AGATCCGGGT GGTGGCCGTT CACGGGATCG GTGAACACGA TGAGGCGGGC
CGGCTCGTCT CGATCTTCGG CGTCTGCCTC GATGTCACCG GCATGGCCCG CTCCGAGCAG
CGCCTGCGCG AGACCGGCGA GGCAATGCGG GCCGCTCTCG AGGCGATGGA TCAGGGCCTT
GTCATGATCG GACCCGACGA CCGGGTTCAG GTCCACAACC AGCGCGTCCG CGATCTCCTC
GAACTGCCAG AGGACGTCCT GCACGAGGGT GTGTCCTACC GGGCGGTGCG GCGCTTTCTC
GGCCGGCGCG GCGAGTTCAT GCATGCGCCG CCCGAAGCCC AGGAATGGCT GGAGCACGGT
GACTTCCCGC CCGGCGTCCA ACGCTACGAG GGGATGCGGC CCAACGGCAC GATCCTGGAG
GTGCGGCACG CTCCGATGGC CTCCGGCTGC CACATCTGCA CCTTCACCGA CCTGACGGCG
TCTCGGCAGA GCGAGGCGGC CCTGCGCTCG GCCGAGGCCG ATTACCAGTC GTTGTTTCAG
AATGCGGTGA TCGGCGTCTA TCGGGCCCGG CTCGACGGCG GCATCGTCCA AGCCAACCGG
GCGCTCGCCC GGCTGCACGG CTACGGCGAT GCGGACCTAT CCCTGCCTGA GGGCGGCTTC
AGCCACGACT GGTACATCGA GCCGGGCCGG CACGAGGCCT TTCTGGCCTG CCTGGAGCGC
GAGGGCCACG TCGAGGACTT CGTATCCGAG GTGCGCCGCC ACGCGGGCGG GGAACGCATC
TGGGTCTCCG AGACGGCCTG GGTGGTGCGC GACGCGGCGG GCCGGCCGAT CTGGTTCGAG
GGCACCGTCG CGGATGCGAC GGAGCGCAAG CGTGCCCAGG CGCTGATCGA GCACATGGCC
CGCCACGACG CGCTGACCGG GCTGCCCAAC CGGCGGCTGT TCCAGGAGAC TCTGGCCCGG
GAGATCGACG GGGCCCGGCG CGACGGCGGC TCGGTGGTGG TGCTGTGCTG CGACCTCGAC
CGCTTCAAGG CGGTCAACGA CACCTTCGGG CATCCCGCGG GCGACGCCCT GCTCCGCGTC
ATCGCGGGCC GCCTCCGCGC GACCCTGCGC GAGGGCGACG TGGTGGCCCG GCTCGGCGGG
GACGAGTTCG CGATCATCCT GCCAAGCCGA GGGAAGCAGC GCCGTATCGC CGCCTTCGCC
CGCCGGCTGA TCCAGGCCGC CGGGCGGCCG GTCGATCTCG GCGGCCGCGC CACCACCGTC
GGCGTCAGCA TCGGCGTGGC GGTTTGGCCC AAGCACGGTG ACAGCGCCGA CACCCTGTTC
AAGAACGCCG ACATCGCACT CTACCGGGCC AAGGATTCCG GGCGGAACAC CTTCCGTTTT
TACGAGAGCG GGATGGCTCT CGCGGTCGTG ACCCGCAACC TCCTGGAAAT CGAGATGCGC
GAGTCGATCC GCTCCGGCGG GTTCGCGCTG CATTACCAAC CGATCTTCGC CCTTGCGGAC
GGTGCACCGC AGGGCTTCGA GGCTCTGTTG CGCTGGAATC ATCCGTTGCG CGGGCCGATC
TCGCCGGGGG CCTTCATCCC GCTGGCGGAG GAGAGCGGCC TCATCACACA GCTCGGCGCA
TGGGCGCTGC ACGAGGCCTG CCGCGAGGCG GCCTCCTGGC CGGGCGATCT GCGGGTCGCC
GTCAACGTCT CGGCGGTGCA GTTCCGCAAG ACCGGGCTGG AGCAGAGCGT CATGCGCGCG
CTGGCCGCGT CGGGCCTACC GGCCGGGCGG CTCGAACTGG AGATCACCGA AAGCGTGCTG
ATGCAGGATT CGGACGCCGT GATCGGTTCT CTCCACCGCC TGCGCGCCAT GGGCGTGCGG
ATCGCACTCG ACGATTTCGG CACGGGCTAC TCGTCCCTGA GCTACCTGTG CCGGTTCCCC
TTCGACAAGA TCAAGATCGA TCGCGCCTTC ATCCGCGACA TCGACGAGCC CGTGGCGGCG
GCGGTGGTGC GCGCGGTGGT GGGCTTGGGC GAGCGCCTCG GCATGGCCAT CACCGCCGAG
GGCGTGGAGA CGGAGGAGCA GTTGGTGCAG GTGCGGCGCA AGGGCTGCAC CGAGGTGCAG
GGCTTCCTGC TCGGCCGCCC GTTGCCGGCC GCGGAGGCCA TGACCCTCGT CGCGGGGCGG
GTGGCGGCCT GA
 
Protein sequence
MSASRPVPHS DRGAAAPVQN APREDGLESR LDALCRSVAE VFAVPMAAIA LIDADRIRFR 
ARYGLAEAVI DRDDALCHHT IHQPRGHALV VPDLIRDERF VHSPLVVGAP HARFYAGLAI
RSGAGRVVGT LCLMDRVPRD DVSSDRVRVL QELALVAEAH LELDEARRAS EAAERRRAEA
HLLEWEARQR ALEAAHAMAE QIAAFGHWRI DAATRTIAWS DGIARIFGRN AERATLPLET
HIGFYHPDDR ERVWAAMDEA LAGRSRTLGG GYEHRSRILR PDGEIRVVAV HGIGEHDEAG
RLVSIFGVCL DVTGMARSEQ RLRETGEAMR AALEAMDQGL VMIGPDDRVQ VHNQRVRDLL
ELPEDVLHEG VSYRAVRRFL GRRGEFMHAP PEAQEWLEHG DFPPGVQRYE GMRPNGTILE
VRHAPMASGC HICTFTDLTA SRQSEAALRS AEADYQSLFQ NAVIGVYRAR LDGGIVQANR
ALARLHGYGD ADLSLPEGGF SHDWYIEPGR HEAFLACLER EGHVEDFVSE VRRHAGGERI
WVSETAWVVR DAAGRPIWFE GTVADATERK RAQALIEHMA RHDALTGLPN RRLFQETLAR
EIDGARRDGG SVVVLCCDLD RFKAVNDTFG HPAGDALLRV IAGRLRATLR EGDVVARLGG
DEFAIILPSR GKQRRIAAFA RRLIQAAGRP VDLGGRATTV GVSIGVAVWP KHGDSADTLF
KNADIALYRA KDSGRNTFRF YESGMALAVV TRNLLEIEMR ESIRSGGFAL HYQPIFALAD
GAPQGFEALL RWNHPLRGPI SPGAFIPLAE ESGLITQLGA WALHEACREA ASWPGDLRVA
VNVSAVQFRK TGLEQSVMRA LAASGLPAGR LELEITESVL MQDSDAVIGS LHRLRAMGVR
IALDDFGTGY SSLSYLCRFP FDKIKIDRAF IRDIDEPVAA AVVRAVVGLG ERLGMAITAE
GVETEEQLVQ VRRKGCTEVQ GFLLGRPLPA AEAMTLVAGR VAA