Gene Mext_2032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2032 
Symbol 
ID5834987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2266809 
End bp2269739 
Gene Length2931 bp 
Protein Length976 aa 
Translation table11 
GC content69% 
IMG OID641367830 
Productdiguanylate cyclase 
Protein accessionYP_001639499 
Protein GI163851456 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGCACGC TTTCACTGGC CCTTGCCCTG GCCCTCGCGA TCCTGACGGC CGTGGCGGGC 
TCCCTCGCCG GCCTGATCGG GACCGCCGCC CCCGCCCATG CGGTCGAGGC CGTGCGCGTC
ACCCTCGACG CGCCCGCCAT CGACTTGACG CCGACGATCG AGCGCTACCG CTCGGACGGC
GACCTGATCC AGATCTCCAC CGCCCCGGGC AAGGACGGGA TCGTGCGGCG CATCGCGGTC
AAAGCGCGCG AGGCGGGCGC GCGGCCGGAC TGGATGGTGT TCGCGCTCAC CAACGACACC
GACGAGCAGA TCGACCGCCT GCTGGTCGCC CCGCATTTCC GCCTCGTCGG CTCCGGGGTG
ATCTGGCCCG ATCTCGGCGG CTCGCGCATC GCCGCGATCA CCGCGAGCGA GGGCATCCGG
CCCGAGCGCG ACGAGAGCCT CGACGCCGAC CAGTTCCTGA TCACCATCGA TCCCGGCACC
ACGGTGACCT ACGTCGCCGA GCTGAAAGGC CCGAACATCC CGCAGGTCTA CCTCTGGGAT
CAGGACGCCT ACCGCAAGAA GACCTCGGGG CTGACGCTCT ACAAGGGCAT CATCATCGGC
ATTGCCGGGC TCTTGGCGCT GTTCCTCACC ATCATCTTCG TGGTGAAGGG CGCGATCATC
TTCCCCGCCG CCGCCGCGCT CTCCTGGGCG GTGCTGGCCT ATGCCTGCAT CGATTTCGGC
TTCCTGCAAC GGGTGTTTCC CGTCACCGAA CTCGCCGAGC GGATCTACCG CGCCTCGGCC
GAGGCCGTGC TCGGCGCGAC CCTGCTGGTG TTCCTGTTCG CCTACCTGAA CCTGTCGCGC
TGGCACGTGC GCTACAGTCA CGTCGCCTTT TTCTGGCTCA CCTTCCTCGC CGGCCTCGTG
GCGCTTGCGG TGTTCGACCC GCCCGTGGCG GCGGGCGTGG CGCGCATCTC CATCGCCGCG
GTGGCTGGCG TCGGACTCCT GCTGATCCTC TATCTCGCCG TCCATAACGG CTACGACCGG
GCCATCCTTC TGGTGCCGAC CTGGCTGCTG CTGCTGGTCT GGGTGGTGGC GGCGGGCTTT
GCCATCACTG GGCAGATCGG CTCCGACCTC GTGCAGCCGG CGCTCATCGG CGGCCTCGTG
CTCATCGTCA TGCTGATCGG CTTCACGGTG ATGCAGCACG CCTTCGCCGG CGGCGGCCTC
AGCCACAGCC TCGTCTCGGA CACCGAGCGC CGGGCGCTGG CGCTGACGGG TGCGGGCGAC
ATCGTGTTCG ACTGGGACGT GCCGGGTGAC CGCGTTTTCG CCGGCCCCGA GATCGAGGCC
CAGCTCGGGC TCAAGCGCGG CGCCCTCGAA GGACCGGCAG CGAACTGGCT CGGCCTGCTC
CATCCCTTCG ACGTGGAGCG CTACTCGGCC GCCCTCGACA CCGTGATCGA GGAGCGGCGC
GGGCGCATCA CCCACGATTT CCGCCTCCGT TCGGCTGACG GTCCCTACGC GTGGTACCGG
CTGAAGGCGC GGCCGGTGAT CGGCACCGAT GGCGAGGTGA TCCGCATCGT CGGCACGATC
AGCGACGTGA CCGAGGCGAA GACCGCCGAG GAGCGGCTGC TGCACGACGC GGTTCACGAC
AGCCTCACCG GCCTGCCGAA CCGCGAGCTG TTCCACGACC GGCTGGAGGC CGCGCTGGCG
ATGGCGAGCC AGGATCCGCG CCTCAAGCCC GCGGTGATCG CCCTCGACAT CGACCGGTTC
AAGGCGATCA ACGACGCCAT CGGCCTCTCG GCGGGTGACT CGATCCTGCT GACGCTCTCG
CGCCGGCTCG GGCGGCTGCT GCGGCCGCAG GACACGCTCG CGCGGGTCGC GGGCGACGAA
TTCGCGGTGA TCCTGCTCTC GGAGCGCGAG CCCGACCGCA TCCTCTCGTT CGCCGAGATG
ATCCGGCGCG CCATCGCCAC CCCGGTCACC TATGCCGACC GCGAGGTGTT CCTCACCGTC
TCGATCGGCA TCGCGTTGCA CGAGGCCACG CAAGGCGTGG GTAACCAGGG CAGCGGGCAG
ACGCGGCGCG AAGAGGTGTT CAAGAACGCC GAGATGGCGA TGATCCAAGC CAAGCGCGGC
GGCGGCGACC GGATCGAGGT GTTCCGCGCC AACATGCGCC TCGAACGCTC CGACCGGCTG
ATGCTGGAGG CGGACCTGCG CAAGGCGCTG GAGCGCAACG AGATCAAGGT GCTGTTCCAG
CCGATCGTCC GGCTCGAAGA CCGCACGGTC GCCGGCTTCG AGACGCTGCT GCGCTGGGAC
CATCCGAAGC TCGGACGCAT CCCACCCTCG ACCTTCCTGC CGGTGGCGGA AGAGACCGGC
GTGATCGTCC CGCTCGGCAA TTTCGCCATC GAGCGCACGG CGCTGGAACT CGCCGCCTGG
CAGCGCTCGC TCGACGTCGA ACCGCCGATC TTCGCCTCGG TCAACGTCTC CTCGCGCCAG
CTCCTGCGCC ACGACCTCCT GCACGACGTG AAGACGGTGA TCGCCCGCAC CGGCGTGCTG
CCCGGCTCGC TCAAGCTGGA GATGGCCGAG GGGCTGGTGA TGGAGAACCC GGAATACGCC
GCCCAGATGC TCACCCGCAT CCACGATCTC GGCGCCGGCC TCGTCCTCGA CGATTTCGGC
ACCGGCTACT CGGCAATCTC CTACCTCCAG CGCTTCCCCT TCGACACGAT CAAGATCGAC
CAGAGCTTCG TGCGCCAGAT GGGCCAGGGC CGCACCGCCA TGCTGCGCTC GGTCCTGCGG
ATGGGGCAGG AACTCGGTCT GGCCACCATC GCCGAGGGTG CCGAGTCGGA GGAGGATGCG
CAGGTGTTGC AGGAGTTCGG CTGCGATTAC GCGCAAGGGG CGGCCTTCGG CGAGCCGATG
ACCGTGCTCC AGGCCCGCCA GCTCGTCGGC GCCGCGCCCG AAGCGGCCTG A
 
Protein sequence
MRTLSLALAL ALAILTAVAG SLAGLIGTAA PAHAVEAVRV TLDAPAIDLT PTIERYRSDG 
DLIQISTAPG KDGIVRRIAV KAREAGARPD WMVFALTNDT DEQIDRLLVA PHFRLVGSGV
IWPDLGGSRI AAITASEGIR PERDESLDAD QFLITIDPGT TVTYVAELKG PNIPQVYLWD
QDAYRKKTSG LTLYKGIIIG IAGLLALFLT IIFVVKGAII FPAAAALSWA VLAYACIDFG
FLQRVFPVTE LAERIYRASA EAVLGATLLV FLFAYLNLSR WHVRYSHVAF FWLTFLAGLV
ALAVFDPPVA AGVARISIAA VAGVGLLLIL YLAVHNGYDR AILLVPTWLL LLVWVVAAGF
AITGQIGSDL VQPALIGGLV LIVMLIGFTV MQHAFAGGGL SHSLVSDTER RALALTGAGD
IVFDWDVPGD RVFAGPEIEA QLGLKRGALE GPAANWLGLL HPFDVERYSA ALDTVIEERR
GRITHDFRLR SADGPYAWYR LKARPVIGTD GEVIRIVGTI SDVTEAKTAE ERLLHDAVHD
SLTGLPNREL FHDRLEAALA MASQDPRLKP AVIALDIDRF KAINDAIGLS AGDSILLTLS
RRLGRLLRPQ DTLARVAGDE FAVILLSERE PDRILSFAEM IRRAIATPVT YADREVFLTV
SIGIALHEAT QGVGNQGSGQ TRREEVFKNA EMAMIQAKRG GGDRIEVFRA NMRLERSDRL
MLEADLRKAL ERNEIKVLFQ PIVRLEDRTV AGFETLLRWD HPKLGRIPPS TFLPVAEETG
VIVPLGNFAI ERTALELAAW QRSLDVEPPI FASVNVSSRQ LLRHDLLHDV KTVIARTGVL
PGSLKLEMAE GLVMENPEYA AQMLTRIHDL GAGLVLDDFG TGYSAISYLQ RFPFDTIKID
QSFVRQMGQG RTAMLRSVLR MGQELGLATI AEGAESEEDA QVLQEFGCDY AQGAAFGEPM
TVLQARQLVG AAPEAA