Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2774 |
Symbol | |
ID | 5831472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 3111706 |
End bp | 3114570 |
Gene Length | 2865 bp |
Protein Length | 954 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641368575 |
Product | diguanylate cyclase |
Protein accession | YP_001640236 |
Protein GI | 163852193 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.158121 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCTTC CACGCCGACC CTTTGGCTTT TCGCTCGGGC GCACCGCCGT CGGGGCCGTG CTGCGCGGTG GTCGCACGAT TCGCGGCCGC ATCTTCCTCG CCTTCCTGAT GCTGAGCGCG ATCACGGCGG CACTCGGCGG CTACGCCGCC TTCGGCATCA TGACGACGGG TGCGCTGGTC GACAAAACCT ACGATCGCTC TCTGATGGCG ATCAATTACG CCCGGGCTGC TGCGACTGAC CTCGCGATGC TCCAGGCTGC GGTGGCCCGA GCCCGCCTCG CGGGCGATAG CGCCGAGCGG CGGACGCTCG AAGCGCGGAT CGAGGCGCTG ACGCAATCGT TCGAAGAAGA TCTGGAGATC GCCGCGGACC GGGCCCAGTC GGAGCGTGCG ACGCGCGAGG CCGCCGCCGC CAAAGACGCC GTCGCGCACT GGCTCGCCGC CCGGAGGGAA TTCGGCCCCG ACCAACAGAC GGCGGATGTC TGGCAGGCGC TCGATGCCCA CGCAGCGGTG GCCGAGCAGC ATATCGACCT GCTGATCAAC TTCACCGCCG GGGACGGCTT CTCCTATCGG CAGACGGCCC GCGCGGACGT GGCCCGCGAC GTCGGGCTCA GCATCTTCGC GACCTCGCTC GCGATTGTTC TCTCGGGCAT CGTCGCCTTG CTCCTGTTGC GGCAGATCGT TCGGCCGGTG GCCGACGCCT CGACGGTCGC GAGCCGCATC GCCGCCGGCG AGTTGACCGT GCGCGTGCCC GAGGGCGGGG ACGACGAGTT CGGCGCCCTG CTGCGCGCCA TGGCGGTCAT GCGCGACAAC ATCGCGGCGA TGATGCAGGA GGAGGTCGCG CAGCGCCGCT CCGCCCAAAG CCTGCTCGCT GACGCGGTGC AGGGGTCGAT CGAAGGGATC GTCGTGGTCG ATGCCGCGGG CCGCATCGTG CTCGCGAACG CGCGGGCGGC CGCTTTGCTC GGGATCGATC AGGCCGAGCC CGGCCATCGC CCGCTCTCCG ACGCGCGCGG TTCGGCCGTC GCCGATGCCC TGCTCGCCAT GCCCGGCCAC GCCACCCTGA CCGCCGAGAC GCACACCGCC GACGGGCGCT GGCTGCGCAT CAGCCGCAGT CCCACACGCG AGGGCGGCTT CGTTGCCGTC TGCAGCGACG TCTCCCTGCT GAAGGAGCAG GAGGATCAGC TCAAGCGCAG CAATGCACAG CTCGACGCGG CTCTCAGCAA CATGCTCCAG GGGCTCTGCC TCTACGACGC AGAGGGCGGC CTCCTCGTCT ACAATCGCCG CTTCTGCGAC ATGTTCGGCG TCGATGCCGC CGCGTTGCGT CCCGGCATGA GCATCCGCGA CGTAGCCCGT CTCGTCGAAG CCTCGTCCGA TAATGGCCGC GCGATCGATC TCCTCGTCGA ACAGGAGGCC CTGCTCCAAC GCGGGTTGAG CGCCTCACTG TGTTGCCCGA TCCGGACGGA CTGCATCGTC GCGCTGAAGC AGCAGCCGAC GGCGGAAGGT GGTTGGATCG CCACCTACGA GGACGTCACC CAGCGCTACG AGGCGGAAGC GCGGATTATC ACCATGGCGC GCAAGGACGC GCTGACGGGG CTCGCCAACC GCATGGTGTT CGGCGAGCGC CTGGAAGAGG CCGCCGCGCG TCTCGACGAT GGGGCCGGTG CCGGGTTCGC CACGCTTTGC CTCGATCTCG ACCGGTTCAA GGAAGTCAAC GACACCCTCG GCCATCCCAT CGGCGATGGG TTGCTGCGCA GCGTTGCGGA ACGGCTGCAA GGCTGTCTGC GCGACACCGA TCTCGTGGCG CGTCTCGGCG GCGACGAGTT CGCCATCGTC CAGGCCGGCG TCCAGGCCGG CACGCATGCG CGGCGGGATG CGAGCGCCCT GGCCAAGCGC CTGATCGCGG CGTTCCAGCA GCCCTTCCTG CTCGACGGAC ACACCGTGAC GGTCGGGCTC AGCATCGGCA TCTCGCTCGC ACCGGAACAC GGAACGAGCC CGGAGAAGCT GCTGAAGAGC GCCGATCTCG CGCTCTACCG CGCCAAGGCG ACCGGACGGG GCTGCTGGGC GTTCTTCGAC GAGGAAATGG ACGTCGAACT GCGCAAGCGC CGGGCTCTCG AAAGCGATCT CAAGAAGGCC GTCGGCAACG GCGAGTTCGA GCTCGTGTTC CAGCCGATCG TCAAGCTCGA CCGGCAGCGC ATCGCCAGTT GCGAGGCGCT GCTGCGCTGG CGCCATCCCG AGCGCGGCTA TGTCTCTCCC GCGGATTTCA TTCCCCTGGC GGAGGAGACG GGCACCATTG GCGAGATCGG CGAATGGGTG CTGCGCAAGG CTTGTAGCGA GGCCGCGACC TGGCCCTCGA ACATCCGCGT CGCCGTCAAT GTCTCCGCCG CGCAATTCAA GAACGCGGCG GTCGTCCGGG CGGTGATGGA TGCGCTCGCC GCGAGCGGGT TGCCGGCGCA TCGGCTGGAA CTGGAAATCA CCGAGTCGGT CCTGCTCAAC GACAGCGTGA CGACGCTGGC GACGCTCCAC ACCCTGAAGC GCCTCGGTGT GCGGGTGGCG ATGGACGATT TCGGCACCGG CTTCTCGTCG TTGAGCTACC TGCAGAGCTT CCCGTTCGAC AAGATCAAGA TCGACCAGTC CTTCGTGCGT AACCTCGCCG CGCCGGGCAA CTCGCGGCTG ATCGTGCGCT CCGTGGTCGG CCTCGGCCGC AGCCTCGGCA TCACGACCAC GGCGGAGGGC ATCGAGACCG AGGCGCAACT CGAGCAGCTT CGGCTCGAAG GATGCGACGA GGGTCAGGGC TACCTGTTCA GCCGCCCCGT CCCCTCGGCC ACGATCCGTG AATTGGTCAC GGCACTCGGC CGCAACGCGG CCTGA
|
Protein sequence | MSLPRRPFGF SLGRTAVGAV LRGGRTIRGR IFLAFLMLSA ITAALGGYAA FGIMTTGALV DKTYDRSLMA INYARAAATD LAMLQAAVAR ARLAGDSAER RTLEARIEAL TQSFEEDLEI AADRAQSERA TREAAAAKDA VAHWLAARRE FGPDQQTADV WQALDAHAAV AEQHIDLLIN FTAGDGFSYR QTARADVARD VGLSIFATSL AIVLSGIVAL LLLRQIVRPV ADASTVASRI AAGELTVRVP EGGDDEFGAL LRAMAVMRDN IAAMMQEEVA QRRSAQSLLA DAVQGSIEGI VVVDAAGRIV LANARAAALL GIDQAEPGHR PLSDARGSAV ADALLAMPGH ATLTAETHTA DGRWLRISRS PTREGGFVAV CSDVSLLKEQ EDQLKRSNAQ LDAALSNMLQ GLCLYDAEGG LLVYNRRFCD MFGVDAAALR PGMSIRDVAR LVEASSDNGR AIDLLVEQEA LLQRGLSASL CCPIRTDCIV ALKQQPTAEG GWIATYEDVT QRYEAEARII TMARKDALTG LANRMVFGER LEEAAARLDD GAGAGFATLC LDLDRFKEVN DTLGHPIGDG LLRSVAERLQ GCLRDTDLVA RLGGDEFAIV QAGVQAGTHA RRDASALAKR LIAAFQQPFL LDGHTVTVGL SIGISLAPEH GTSPEKLLKS ADLALYRAKA TGRGCWAFFD EEMDVELRKR RALESDLKKA VGNGEFELVF QPIVKLDRQR IASCEALLRW RHPERGYVSP ADFIPLAEET GTIGEIGEWV LRKACSEAAT WPSNIRVAVN VSAAQFKNAA VVRAVMDALA ASGLPAHRLE LEITESVLLN DSVTTLATLH TLKRLGVRVA MDDFGTGFSS LSYLQSFPFD KIKIDQSFVR NLAAPGNSRL IVRSVVGLGR SLGITTTAEG IETEAQLEQL RLEGCDEGQG YLFSRPVPSA TIRELVTALG RNAA
|
| |