Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3063 |
Symbol | |
ID | 5835427 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 3406649 |
End bp | 3409660 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641368863 |
Product | diguanylate cyclase |
Protein accession | YP_001640523 |
Protein GI | 163852480 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.964207 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCGT CCCGTCCGGT GCCACACAGC GATCGTGGAG CCGCCGCACC CGTCCAGAAC GCGCCGCGAG AGGACGGGCT GGAAAGCCGC CTCGACGCCC TGTGCCGCTC CGTCGCCGAG GTCTTCGCCG TCCCCATGGC GGCGATCGCG CTCATCGACG CGGATCGCAT CCGTTTCCGC GCCCGCTACG GGCTCGCGGA GGCCGTCATC GACCGCGACG ACGCCCTCTG CCATCACACA ATCCACCAGC CGCGCGGTCA CGCTCTCGTG GTGCCGGACC TGATCCGTGA CGAGCGTTTC GTCCATTCTC CCCTCGTCGT CGGCGCGCCG CATGCCCGCT TCTACGCGGG TCTGGCGATC CGCTCGGGAG CGGGCCGGGT CGTCGGCACG CTGTGCCTGA TGGATCGGGT GCCGCGGGAC GATGTCTCGT CCGACCGCGT GCGCGTCCTG CAAGAATTGG CGCTCGTCGC CGAGGCGCAT CTCGAGCTCG ACGAAGCCCG GCGCGCGAGC GAGGCCGCGG AACGCCGGCG GGCGGAGGCG CATCTCTTGG AATGGGAGGC GCGCCAGAGG GCGCTCGAGG CGGCCCACGC CATGGCCGAA CAGATCGCCG CCTTCGGCCA TTGGCGGATC GATGCGGCCA CCCGCACCAT CGCGTGGTCG GACGGGATCG CCCGCATCTT CGGGCGCAAC GCCGAACGCG CGACGCTGCC GCTCGAAACC CATATCGGCT TCTATCATCC GGATGATCGC GAACGCGTCT GGGCCGCCAT GGACGAGGCG CTCGCGGGCC GCAGCCGGAC CCTGGGGGGT GGCTACGAGC ACCGCTCTCG CATCCTTCGT CCCGACGGCG AGATCCGGGT GGTGGCCGTT CACGGGATCG GTGAACACGA TGAGGCGGGC CGGCTCGTCT CGATCTTCGG CGTCTGCCTC GATGTCACCG GCATGGCCCG CTCCGAGCAG CGCCTGCGCG AGACCGGCGA GGCAATGCGG GCCGCTCTCG AGGCGATGGA TCAGGGCCTT GTCATGATCG GACCCGACGA CCGGGTTCAG GTCCACAACC AGCGCGTCCG CGATCTCCTC GAACTGCCAG AGGACGTCCT GCACGAGGGT GTGTCCTACC GGGCGGTGCG GCGCTTTCTC GGCCGGCGCG GCGAGTTCAT GCATGCGCCG CCCGAAGCCC AGGAATGGCT GGAGCACGGT GACTTCCCGC CCGGCGTCCA ACGCTACGAG GGGATGCGGC CCAACGGCAC GATCCTGGAG GTGCGGCACG CTCCGATGGC CTCCGGCTGC CACATCTGCA CCTTCACCGA CCTGACGGCG TCTCGGCAGA GCGAGGCGGC CCTGCGCTCG GCCGAGGCCG ATTACCAGTC GTTGTTTCAG AATGCGGTGA TCGGCGTCTA TCGGGCCCGG CTCGACGGCG GCATCGTCCA AGCCAACCGG GCGCTCGCCC GGCTGCACGG CTACGGCGAT GCGGACCTAT CCCTGCCTGA GGGCGGCTTC AGCCACGACT GGTACATCGA GCCGGGCCGG CACGAGGCCT TTCTGGCCTG CCTGGAGCGC GAGGGCCACG TCGAGGACTT CGTATCCGAG GTGCGCCGCC ACGCGGGCGG GGAACGCATC TGGGTCTCCG AGACGGCCTG GGTGGTGCGC GACGCGGCGG GCCGGCCGAT CTGGTTCGAG GGCACCGTCG CGGATGCGAC GGAGCGCAAG CGTGCCCAGG CGCTGATCGA GCACATGGCC CGCCACGACG CGCTGACCGG GCTGCCCAAC CGGCGGCTGT TCCAGGAGAC TCTGGCCCGG GAGATCGACG GGGCCCGGCG CGACGGCGGC TCGGTGGTGG TGCTGTGCTG CGACCTCGAC CGCTTCAAGG CGGTCAACGA CACCTTCGGG CATCCCGCGG GCGACGCCCT GCTCCGCGTC ATCGCGGGCC GCCTCCGCGC GACCCTGCGC GAGGGCGACG TGGTGGCCCG GCTCGGCGGG GACGAGTTCG CGATCATCCT GCCAAGCCGA GGGAAGCAGC GCCGTATCGC CGCCTTCGCC CGCCGGCTGA TCCAGGCCGC CGGGCGGCCG GTCGATCTCG GCGGCCGCGC CACCACCGTC GGCGTCAGCA TCGGCGTGGC GGTTTGGCCC AAGCACGGTG ACAGCGCCGA CACCCTGTTC AAGAACGCCG ACATCGCACT CTACCGGGCC AAGGATTCCG GGCGGAACAC CTTCCGTTTT TACGAGAGCG GGATGGCTCT CGCGGTCGTG ACCCGCAACC TCCTGGAAAT CGAGATGCGC GAGTCGATCC GCTCCGGCGG GTTCGCGCTG CATTACCAAC CGATCTTCGC CCTTGCGGAC GGTGCACCGC AGGGCTTCGA GGCTCTGTTG CGCTGGAATC ATCCGTTGCG CGGGCCGATC TCGCCGGGGG CCTTCATCCC GCTGGCGGAG GAGAGCGGCC TCATCACACA GCTCGGCGCA TGGGCGCTGC ACGAGGCCTG CCGCGAGGCG GCCTCCTGGC CGGGCGATCT GCGGGTCGCC GTCAACGTCT CGGCGGTGCA GTTCCGCAAG ACCGGGCTGG AGCAGAGCGT CATGCGCGCG CTGGCCGCGT CGGGCCTACC GGCCGGGCGG CTCGAACTGG AGATCACCGA AAGCGTGCTG ATGCAGGATT CGGACGCCGT GATCGGTTCT CTCCACCGCC TGCGCGCCAT GGGCGTGCGG ATCGCACTCG ACGATTTCGG CACGGGCTAC TCGTCCCTGA GCTACCTGTG CCGGTTCCCC TTCGACAAGA TCAAGATCGA TCGCGCCTTC ATCCGCGACA TCGACGAGCC CGTGGCGGCG GCGGTGGTGC GCGCGGTGGT GGGCTTGGGC GAGCGCCTCG GCATGGCCAT CACCGCCGAG GGCGTGGAGA CGGAGGAGCA GTTGGTGCAG GTGCGGCGCA AGGGCTGCAC CGAGGTGCAG GGCTTCCTGC TCGGCCGCCC GTTGCCGGCC GCGGAGGCCA TGACCCTCGT CGCGGGGCGG GTGGCGGCCT GA
|
Protein sequence | MSASRPVPHS DRGAAAPVQN APREDGLESR LDALCRSVAE VFAVPMAAIA LIDADRIRFR ARYGLAEAVI DRDDALCHHT IHQPRGHALV VPDLIRDERF VHSPLVVGAP HARFYAGLAI RSGAGRVVGT LCLMDRVPRD DVSSDRVRVL QELALVAEAH LELDEARRAS EAAERRRAEA HLLEWEARQR ALEAAHAMAE QIAAFGHWRI DAATRTIAWS DGIARIFGRN AERATLPLET HIGFYHPDDR ERVWAAMDEA LAGRSRTLGG GYEHRSRILR PDGEIRVVAV HGIGEHDEAG RLVSIFGVCL DVTGMARSEQ RLRETGEAMR AALEAMDQGL VMIGPDDRVQ VHNQRVRDLL ELPEDVLHEG VSYRAVRRFL GRRGEFMHAP PEAQEWLEHG DFPPGVQRYE GMRPNGTILE VRHAPMASGC HICTFTDLTA SRQSEAALRS AEADYQSLFQ NAVIGVYRAR LDGGIVQANR ALARLHGYGD ADLSLPEGGF SHDWYIEPGR HEAFLACLER EGHVEDFVSE VRRHAGGERI WVSETAWVVR DAAGRPIWFE GTVADATERK RAQALIEHMA RHDALTGLPN RRLFQETLAR EIDGARRDGG SVVVLCCDLD RFKAVNDTFG HPAGDALLRV IAGRLRATLR EGDVVARLGG DEFAIILPSR GKQRRIAAFA RRLIQAAGRP VDLGGRATTV GVSIGVAVWP KHGDSADTLF KNADIALYRA KDSGRNTFRF YESGMALAVV TRNLLEIEMR ESIRSGGFAL HYQPIFALAD GAPQGFEALL RWNHPLRGPI SPGAFIPLAE ESGLITQLGA WALHEACREA ASWPGDLRVA VNVSAVQFRK TGLEQSVMRA LAASGLPAGR LELEITESVL MQDSDAVIGS LHRLRAMGVR IALDDFGTGY SSLSYLCRFP FDKIKIDRAF IRDIDEPVAA AVVRAVVGLG ERLGMAITAE GVETEEQLVQ VRRKGCTEVQ GFLLGRPLPA AEAMTLVAGR VAA
|
| |