Gene Mext_3803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3803 
Symbol 
ID5834730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4222153 
End bp4223622 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content66% 
IMG OID641369595 
Productdiguanylate cyclase 
Protein accessionYP_001641248 
Protein GI163853205 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain
[COG2203] FOG: GAF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0126112 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGTGG AGCGGCACAT GTCGGACAAT CAGCGCGAGG CACTTCGCCA AAATGCGTTG 
GCCGAGATCG CCCTCCTCGA CACGCCACCC GAGCGCGAGT TCGATACACT CGCGAAGCTT
GCACAACGCA TGCTTGGCAC CGGCATGTCT TCGATCACCC TGATTGCTCC GGAGCGCCAA
TGGTTCAAGG CACGCTGCGG TCCCCTGGCA CCGGAGACGA CGCGGGCTCA GGCGTTCTGC
CCGGTCGTGG TCGAGACCGA GGCGCCGCTC ACCGTAGCGG ACGCTCGCCT CGACCCTCGC
TTCGCCGAAA GCCCATTTGT CACAGGCTCA CCGTATATCC GCTATTACGC GGGCGTCCCG
TTGCGGGTCC GGCGGCCAGA CAGCGGCCAC GTTACGATCG GCACGCTCTG CGTCCTTGAC
GAGCGGCCGC GCGAGCCGAC CTCAACCGAC TTGGAGGTTC TAGAAGAGTT GGCCTGCGTC
GCCGAAGCCT TGATTGAGGC CCGGGCCGTC GCTCTCCGTG CTGCCAAGGC TGCCGAAGAG
CACCGTCGAG CCGTAGAGCG GCTGGAGCGC GAACGCCGCC AGTTCAAGCA GGCGGAGCGC
ATGGCCGACA TGGGATCGTA CCGATACGAC ATCGAGAAGC AGTTCACCCT CTGGTCGGAC
GGTGTCTTCG CCATCCACGA ACGGCCCGTC AGCGCCGGTG TGCCGAACGG CGAGATCATG
AACTATTTCC CCGAGCCCGA TCGCTCCCTA TTCGTCGCCG CGGTCAGGCG CACGCTGGAC
ACGGGCGAGC CGTTCGAGAT GGACGGCGAC TTCATAACCG CCAAAGGCAA CGCGCGGCGC
GTGCGGTACT CCTGCGAGAT CGAGTTGGCC AAGGGCAAAC CCGTTGCCCT CATCGGTCTG
ATCCAAGACA TCACCGAACG GCACGGCTTG GAGCAGCGCC TGCGCCACCT CGCTTGCACC
GACGACCTGA CCCAGCTGGC CAACCGGGCC GAGTTTCACC GTGTTCTCGA TGGACGGCTG
CACGAGGCGC GCGCTGCGGA CGACGACGTG GCCGTGCTTC TGATCGACCT CGACGGCTTC
AAAGGCGTCA ACGACGTCCT CGGCCATGCA GCAGGCGACG CGGTGTTGCG CGGTGTCGCC
GATCGGTTGC GCGCTTTCTG CGATGACGGT TGTCTCCCAG CTCGGCTAGG GGGCGACGAG
TTCGCGGTCG TGATGCCGGC CGGTCTCGAT CGCGTAGGAC TTGATCGGAA GGTGCGGCGC
CTCCTGCACG AACTTGAGAT CGTCATGCAC GGACATGGCC ACATCGCCCG TGTGACGGGA
ACAATCGGTA TCGCGTGGTC GAGCGCGGCC GCGCAGAACC GCGACGTCCT CCTTCGTCAG
GCCGATGCGG CGCTCTACGC CGCCAAGCGC ACCCGGAAGG GAACGGCCCA GACCTATCAA
GCCGGCGCGG ACCACCGGAT GGCTGGCTGA
 
Protein sequence
MGVERHMSDN QREALRQNAL AEIALLDTPP EREFDTLAKL AQRMLGTGMS SITLIAPERQ 
WFKARCGPLA PETTRAQAFC PVVVETEAPL TVADARLDPR FAESPFVTGS PYIRYYAGVP
LRVRRPDSGH VTIGTLCVLD ERPREPTSTD LEVLEELACV AEALIEARAV ALRAAKAAEE
HRRAVERLER ERRQFKQAER MADMGSYRYD IEKQFTLWSD GVFAIHERPV SAGVPNGEIM
NYFPEPDRSL FVAAVRRTLD TGEPFEMDGD FITAKGNARR VRYSCEIELA KGKPVALIGL
IQDITERHGL EQRLRHLACT DDLTQLANRA EFHRVLDGRL HEARAADDDV AVLLIDLDGF
KGVNDVLGHA AGDAVLRGVA DRLRAFCDDG CLPARLGGDE FAVVMPAGLD RVGLDRKVRR
LLHELEIVMH GHGHIARVTG TIGIAWSSAA AQNRDVLLRQ ADAALYAAKR TRKGTAQTYQ
AGADHRMAG