Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2032 |
Symbol | |
ID | 5834987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 2266809 |
End bp | 2269739 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641367830 |
Product | diguanylate cyclase |
Protein accession | YP_001639499 |
Protein GI | 163851456 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCGCACGC TTTCACTGGC CCTTGCCCTG GCCCTCGCGA TCCTGACGGC CGTGGCGGGC TCCCTCGCCG GCCTGATCGG GACCGCCGCC CCCGCCCATG CGGTCGAGGC CGTGCGCGTC ACCCTCGACG CGCCCGCCAT CGACTTGACG CCGACGATCG AGCGCTACCG CTCGGACGGC GACCTGATCC AGATCTCCAC CGCCCCGGGC AAGGACGGGA TCGTGCGGCG CATCGCGGTC AAAGCGCGCG AGGCGGGCGC GCGGCCGGAC TGGATGGTGT TCGCGCTCAC CAACGACACC GACGAGCAGA TCGACCGCCT GCTGGTCGCC CCGCATTTCC GCCTCGTCGG CTCCGGGGTG ATCTGGCCCG ATCTCGGCGG CTCGCGCATC GCCGCGATCA CCGCGAGCGA GGGCATCCGG CCCGAGCGCG ACGAGAGCCT CGACGCCGAC CAGTTCCTGA TCACCATCGA TCCCGGCACC ACGGTGACCT ACGTCGCCGA GCTGAAAGGC CCGAACATCC CGCAGGTCTA CCTCTGGGAT CAGGACGCCT ACCGCAAGAA GACCTCGGGG CTGACGCTCT ACAAGGGCAT CATCATCGGC ATTGCCGGGC TCTTGGCGCT GTTCCTCACC ATCATCTTCG TGGTGAAGGG CGCGATCATC TTCCCCGCCG CCGCCGCGCT CTCCTGGGCG GTGCTGGCCT ATGCCTGCAT CGATTTCGGC TTCCTGCAAC GGGTGTTTCC CGTCACCGAA CTCGCCGAGC GGATCTACCG CGCCTCGGCC GAGGCCGTGC TCGGCGCGAC CCTGCTGGTG TTCCTGTTCG CCTACCTGAA CCTGTCGCGC TGGCACGTGC GCTACAGTCA CGTCGCCTTT TTCTGGCTCA CCTTCCTCGC CGGCCTCGTG GCGCTTGCGG TGTTCGACCC GCCCGTGGCG GCGGGCGTGG CGCGCATCTC CATCGCCGCG GTGGCTGGCG TCGGACTCCT GCTGATCCTC TATCTCGCCG TCCATAACGG CTACGACCGG GCCATCCTTC TGGTGCCGAC CTGGCTGCTG CTGCTGGTCT GGGTGGTGGC GGCGGGCTTT GCCATCACTG GGCAGATCGG CTCCGACCTC GTGCAGCCGG CGCTCATCGG CGGCCTCGTG CTCATCGTCA TGCTGATCGG CTTCACGGTG ATGCAGCACG CCTTCGCCGG CGGCGGCCTC AGCCACAGCC TCGTCTCGGA CACCGAGCGC CGGGCGCTGG CGCTGACGGG TGCGGGCGAC ATCGTGTTCG ACTGGGACGT GCCGGGTGAC CGCGTTTTCG CCGGCCCCGA GATCGAGGCC CAGCTCGGGC TCAAGCGCGG CGCCCTCGAA GGACCGGCAG CGAACTGGCT CGGCCTGCTC CATCCCTTCG ACGTGGAGCG CTACTCGGCC GCCCTCGACA CCGTGATCGA GGAGCGGCGC GGGCGCATCA CCCACGATTT CCGCCTCCGT TCGGCTGACG GTCCCTACGC GTGGTACCGG CTGAAGGCGC GGCCGGTGAT CGGCACCGAT GGCGAGGTGA TCCGCATCGT CGGCACGATC AGCGACGTGA CCGAGGCGAA GACCGCCGAG GAGCGGCTGC TGCACGACGC GGTTCACGAC AGCCTCACCG GCCTGCCGAA CCGCGAGCTG TTCCACGACC GGCTGGAGGC CGCGCTGGCG ATGGCGAGCC AGGATCCGCG CCTCAAGCCC GCGGTGATCG CCCTCGACAT CGACCGGTTC AAGGCGATCA ACGACGCCAT CGGCCTCTCG GCGGGTGACT CGATCCTGCT GACGCTCTCG CGCCGGCTCG GGCGGCTGCT GCGGCCGCAG GACACGCTCG CGCGGGTCGC GGGCGACGAA TTCGCGGTGA TCCTGCTCTC GGAGCGCGAG CCCGACCGCA TCCTCTCGTT CGCCGAGATG ATCCGGCGCG CCATCGCCAC CCCGGTCACC TATGCCGACC GCGAGGTGTT CCTCACCGTC TCGATCGGCA TCGCGTTGCA CGAGGCCACG CAAGGCGTGG GTAACCAGGG CAGCGGGCAG ACGCGGCGCG AAGAGGTGTT CAAGAACGCC GAGATGGCGA TGATCCAAGC CAAGCGCGGC GGCGGCGACC GGATCGAGGT GTTCCGCGCC AACATGCGCC TCGAACGCTC CGACCGGCTG ATGCTGGAGG CGGACCTGCG CAAGGCGCTG GAGCGCAACG AGATCAAGGT GCTGTTCCAG CCGATCGTCC GGCTCGAAGA CCGCACGGTC GCCGGCTTCG AGACGCTGCT GCGCTGGGAC CATCCGAAGC TCGGACGCAT CCCACCCTCG ACCTTCCTGC CGGTGGCGGA AGAGACCGGC GTGATCGTCC CGCTCGGCAA TTTCGCCATC GAGCGCACGG CGCTGGAACT CGCCGCCTGG CAGCGCTCGC TCGACGTCGA ACCGCCGATC TTCGCCTCGG TCAACGTCTC CTCGCGCCAG CTCCTGCGCC ACGACCTCCT GCACGACGTG AAGACGGTGA TCGCCCGCAC CGGCGTGCTG CCCGGCTCGC TCAAGCTGGA GATGGCCGAG GGGCTGGTGA TGGAGAACCC GGAATACGCC GCCCAGATGC TCACCCGCAT CCACGATCTC GGCGCCGGCC TCGTCCTCGA CGATTTCGGC ACCGGCTACT CGGCAATCTC CTACCTCCAG CGCTTCCCCT TCGACACGAT CAAGATCGAC CAGAGCTTCG TGCGCCAGAT GGGCCAGGGC CGCACCGCCA TGCTGCGCTC GGTCCTGCGG ATGGGGCAGG AACTCGGTCT GGCCACCATC GCCGAGGGTG CCGAGTCGGA GGAGGATGCG CAGGTGTTGC AGGAGTTCGG CTGCGATTAC GCGCAAGGGG CGGCCTTCGG CGAGCCGATG ACCGTGCTCC AGGCCCGCCA GCTCGTCGGC GCCGCGCCCG AAGCGGCCTG A
|
Protein sequence | MRTLSLALAL ALAILTAVAG SLAGLIGTAA PAHAVEAVRV TLDAPAIDLT PTIERYRSDG DLIQISTAPG KDGIVRRIAV KAREAGARPD WMVFALTNDT DEQIDRLLVA PHFRLVGSGV IWPDLGGSRI AAITASEGIR PERDESLDAD QFLITIDPGT TVTYVAELKG PNIPQVYLWD QDAYRKKTSG LTLYKGIIIG IAGLLALFLT IIFVVKGAII FPAAAALSWA VLAYACIDFG FLQRVFPVTE LAERIYRASA EAVLGATLLV FLFAYLNLSR WHVRYSHVAF FWLTFLAGLV ALAVFDPPVA AGVARISIAA VAGVGLLLIL YLAVHNGYDR AILLVPTWLL LLVWVVAAGF AITGQIGSDL VQPALIGGLV LIVMLIGFTV MQHAFAGGGL SHSLVSDTER RALALTGAGD IVFDWDVPGD RVFAGPEIEA QLGLKRGALE GPAANWLGLL HPFDVERYSA ALDTVIEERR GRITHDFRLR SADGPYAWYR LKARPVIGTD GEVIRIVGTI SDVTEAKTAE ERLLHDAVHD SLTGLPNREL FHDRLEAALA MASQDPRLKP AVIALDIDRF KAINDAIGLS AGDSILLTLS RRLGRLLRPQ DTLARVAGDE FAVILLSERE PDRILSFAEM IRRAIATPVT YADREVFLTV SIGIALHEAT QGVGNQGSGQ TRREEVFKNA EMAMIQAKRG GGDRIEVFRA NMRLERSDRL MLEADLRKAL ERNEIKVLFQ PIVRLEDRTV AGFETLLRWD HPKLGRIPPS TFLPVAEETG VIVPLGNFAI ERTALELAAW QRSLDVEPPI FASVNVSSRQ LLRHDLLHDV KTVIARTGVL PGSLKLEMAE GLVMENPEYA AQMLTRIHDL GAGLVLDDFG TGYSAISYLQ RFPFDTIKID QSFVRQMGQG RTAMLRSVLR MGQELGLATI AEGAESEEDA QVLQEFGCDY AQGAAFGEPM TVLQARQLVG AAPEAA
|
| |