Gene Mext_2774 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2774 
Symbol 
ID5831472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3111706 
End bp3114570 
Gene Length2865 bp 
Protein Length954 aa 
Translation table11 
GC content69% 
IMG OID641368575 
Productdiguanylate cyclase 
Protein accessionYP_001640236 
Protein GI163852193 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.158121 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTTC CACGCCGACC CTTTGGCTTT TCGCTCGGGC GCACCGCCGT CGGGGCCGTG 
CTGCGCGGTG GTCGCACGAT TCGCGGCCGC ATCTTCCTCG CCTTCCTGAT GCTGAGCGCG
ATCACGGCGG CACTCGGCGG CTACGCCGCC TTCGGCATCA TGACGACGGG TGCGCTGGTC
GACAAAACCT ACGATCGCTC TCTGATGGCG ATCAATTACG CCCGGGCTGC TGCGACTGAC
CTCGCGATGC TCCAGGCTGC GGTGGCCCGA GCCCGCCTCG CGGGCGATAG CGCCGAGCGG
CGGACGCTCG AAGCGCGGAT CGAGGCGCTG ACGCAATCGT TCGAAGAAGA TCTGGAGATC
GCCGCGGACC GGGCCCAGTC GGAGCGTGCG ACGCGCGAGG CCGCCGCCGC CAAAGACGCC
GTCGCGCACT GGCTCGCCGC CCGGAGGGAA TTCGGCCCCG ACCAACAGAC GGCGGATGTC
TGGCAGGCGC TCGATGCCCA CGCAGCGGTG GCCGAGCAGC ATATCGACCT GCTGATCAAC
TTCACCGCCG GGGACGGCTT CTCCTATCGG CAGACGGCCC GCGCGGACGT GGCCCGCGAC
GTCGGGCTCA GCATCTTCGC GACCTCGCTC GCGATTGTTC TCTCGGGCAT CGTCGCCTTG
CTCCTGTTGC GGCAGATCGT TCGGCCGGTG GCCGACGCCT CGACGGTCGC GAGCCGCATC
GCCGCCGGCG AGTTGACCGT GCGCGTGCCC GAGGGCGGGG ACGACGAGTT CGGCGCCCTG
CTGCGCGCCA TGGCGGTCAT GCGCGACAAC ATCGCGGCGA TGATGCAGGA GGAGGTCGCG
CAGCGCCGCT CCGCCCAAAG CCTGCTCGCT GACGCGGTGC AGGGGTCGAT CGAAGGGATC
GTCGTGGTCG ATGCCGCGGG CCGCATCGTG CTCGCGAACG CGCGGGCGGC CGCTTTGCTC
GGGATCGATC AGGCCGAGCC CGGCCATCGC CCGCTCTCCG ACGCGCGCGG TTCGGCCGTC
GCCGATGCCC TGCTCGCCAT GCCCGGCCAC GCCACCCTGA CCGCCGAGAC GCACACCGCC
GACGGGCGCT GGCTGCGCAT CAGCCGCAGT CCCACACGCG AGGGCGGCTT CGTTGCCGTC
TGCAGCGACG TCTCCCTGCT GAAGGAGCAG GAGGATCAGC TCAAGCGCAG CAATGCACAG
CTCGACGCGG CTCTCAGCAA CATGCTCCAG GGGCTCTGCC TCTACGACGC AGAGGGCGGC
CTCCTCGTCT ACAATCGCCG CTTCTGCGAC ATGTTCGGCG TCGATGCCGC CGCGTTGCGT
CCCGGCATGA GCATCCGCGA CGTAGCCCGT CTCGTCGAAG CCTCGTCCGA TAATGGCCGC
GCGATCGATC TCCTCGTCGA ACAGGAGGCC CTGCTCCAAC GCGGGTTGAG CGCCTCACTG
TGTTGCCCGA TCCGGACGGA CTGCATCGTC GCGCTGAAGC AGCAGCCGAC GGCGGAAGGT
GGTTGGATCG CCACCTACGA GGACGTCACC CAGCGCTACG AGGCGGAAGC GCGGATTATC
ACCATGGCGC GCAAGGACGC GCTGACGGGG CTCGCCAACC GCATGGTGTT CGGCGAGCGC
CTGGAAGAGG CCGCCGCGCG TCTCGACGAT GGGGCCGGTG CCGGGTTCGC CACGCTTTGC
CTCGATCTCG ACCGGTTCAA GGAAGTCAAC GACACCCTCG GCCATCCCAT CGGCGATGGG
TTGCTGCGCA GCGTTGCGGA ACGGCTGCAA GGCTGTCTGC GCGACACCGA TCTCGTGGCG
CGTCTCGGCG GCGACGAGTT CGCCATCGTC CAGGCCGGCG TCCAGGCCGG CACGCATGCG
CGGCGGGATG CGAGCGCCCT GGCCAAGCGC CTGATCGCGG CGTTCCAGCA GCCCTTCCTG
CTCGACGGAC ACACCGTGAC GGTCGGGCTC AGCATCGGCA TCTCGCTCGC ACCGGAACAC
GGAACGAGCC CGGAGAAGCT GCTGAAGAGC GCCGATCTCG CGCTCTACCG CGCCAAGGCG
ACCGGACGGG GCTGCTGGGC GTTCTTCGAC GAGGAAATGG ACGTCGAACT GCGCAAGCGC
CGGGCTCTCG AAAGCGATCT CAAGAAGGCC GTCGGCAACG GCGAGTTCGA GCTCGTGTTC
CAGCCGATCG TCAAGCTCGA CCGGCAGCGC ATCGCCAGTT GCGAGGCGCT GCTGCGCTGG
CGCCATCCCG AGCGCGGCTA TGTCTCTCCC GCGGATTTCA TTCCCCTGGC GGAGGAGACG
GGCACCATTG GCGAGATCGG CGAATGGGTG CTGCGCAAGG CTTGTAGCGA GGCCGCGACC
TGGCCCTCGA ACATCCGCGT CGCCGTCAAT GTCTCCGCCG CGCAATTCAA GAACGCGGCG
GTCGTCCGGG CGGTGATGGA TGCGCTCGCC GCGAGCGGGT TGCCGGCGCA TCGGCTGGAA
CTGGAAATCA CCGAGTCGGT CCTGCTCAAC GACAGCGTGA CGACGCTGGC GACGCTCCAC
ACCCTGAAGC GCCTCGGTGT GCGGGTGGCG ATGGACGATT TCGGCACCGG CTTCTCGTCG
TTGAGCTACC TGCAGAGCTT CCCGTTCGAC AAGATCAAGA TCGACCAGTC CTTCGTGCGT
AACCTCGCCG CGCCGGGCAA CTCGCGGCTG ATCGTGCGCT CCGTGGTCGG CCTCGGCCGC
AGCCTCGGCA TCACGACCAC GGCGGAGGGC ATCGAGACCG AGGCGCAACT CGAGCAGCTT
CGGCTCGAAG GATGCGACGA GGGTCAGGGC TACCTGTTCA GCCGCCCCGT CCCCTCGGCC
ACGATCCGTG AATTGGTCAC GGCACTCGGC CGCAACGCGG CCTGA
 
Protein sequence
MSLPRRPFGF SLGRTAVGAV LRGGRTIRGR IFLAFLMLSA ITAALGGYAA FGIMTTGALV 
DKTYDRSLMA INYARAAATD LAMLQAAVAR ARLAGDSAER RTLEARIEAL TQSFEEDLEI
AADRAQSERA TREAAAAKDA VAHWLAARRE FGPDQQTADV WQALDAHAAV AEQHIDLLIN
FTAGDGFSYR QTARADVARD VGLSIFATSL AIVLSGIVAL LLLRQIVRPV ADASTVASRI
AAGELTVRVP EGGDDEFGAL LRAMAVMRDN IAAMMQEEVA QRRSAQSLLA DAVQGSIEGI
VVVDAAGRIV LANARAAALL GIDQAEPGHR PLSDARGSAV ADALLAMPGH ATLTAETHTA
DGRWLRISRS PTREGGFVAV CSDVSLLKEQ EDQLKRSNAQ LDAALSNMLQ GLCLYDAEGG
LLVYNRRFCD MFGVDAAALR PGMSIRDVAR LVEASSDNGR AIDLLVEQEA LLQRGLSASL
CCPIRTDCIV ALKQQPTAEG GWIATYEDVT QRYEAEARII TMARKDALTG LANRMVFGER
LEEAAARLDD GAGAGFATLC LDLDRFKEVN DTLGHPIGDG LLRSVAERLQ GCLRDTDLVA
RLGGDEFAIV QAGVQAGTHA RRDASALAKR LIAAFQQPFL LDGHTVTVGL SIGISLAPEH
GTSPEKLLKS ADLALYRAKA TGRGCWAFFD EEMDVELRKR RALESDLKKA VGNGEFELVF
QPIVKLDRQR IASCEALLRW RHPERGYVSP ADFIPLAEET GTIGEIGEWV LRKACSEAAT
WPSNIRVAVN VSAAQFKNAA VVRAVMDALA ASGLPAHRLE LEITESVLLN DSVTTLATLH
TLKRLGVRVA MDDFGTGFSS LSYLQSFPFD KIKIDQSFVR NLAAPGNSRL IVRSVVGLGR
SLGITTTAEG IETEAQLEQL RLEGCDEGQG YLFSRPVPSA TIRELVTALG RNAA