Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3803 |
Symbol | |
ID | 5834730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 4222153 |
End bp | 4223622 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641369595 |
Product | diguanylate cyclase |
Protein accession | YP_001641248 |
Protein GI | 163853205 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2199] FOG: GGDEF domain [COG2203] FOG: GAF domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0126112 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGGTGG AGCGGCACAT GTCGGACAAT CAGCGCGAGG CACTTCGCCA AAATGCGTTG GCCGAGATCG CCCTCCTCGA CACGCCACCC GAGCGCGAGT TCGATACACT CGCGAAGCTT GCACAACGCA TGCTTGGCAC CGGCATGTCT TCGATCACCC TGATTGCTCC GGAGCGCCAA TGGTTCAAGG CACGCTGCGG TCCCCTGGCA CCGGAGACGA CGCGGGCTCA GGCGTTCTGC CCGGTCGTGG TCGAGACCGA GGCGCCGCTC ACCGTAGCGG ACGCTCGCCT CGACCCTCGC TTCGCCGAAA GCCCATTTGT CACAGGCTCA CCGTATATCC GCTATTACGC GGGCGTCCCG TTGCGGGTCC GGCGGCCAGA CAGCGGCCAC GTTACGATCG GCACGCTCTG CGTCCTTGAC GAGCGGCCGC GCGAGCCGAC CTCAACCGAC TTGGAGGTTC TAGAAGAGTT GGCCTGCGTC GCCGAAGCCT TGATTGAGGC CCGGGCCGTC GCTCTCCGTG CTGCCAAGGC TGCCGAAGAG CACCGTCGAG CCGTAGAGCG GCTGGAGCGC GAACGCCGCC AGTTCAAGCA GGCGGAGCGC ATGGCCGACA TGGGATCGTA CCGATACGAC ATCGAGAAGC AGTTCACCCT CTGGTCGGAC GGTGTCTTCG CCATCCACGA ACGGCCCGTC AGCGCCGGTG TGCCGAACGG CGAGATCATG AACTATTTCC CCGAGCCCGA TCGCTCCCTA TTCGTCGCCG CGGTCAGGCG CACGCTGGAC ACGGGCGAGC CGTTCGAGAT GGACGGCGAC TTCATAACCG CCAAAGGCAA CGCGCGGCGC GTGCGGTACT CCTGCGAGAT CGAGTTGGCC AAGGGCAAAC CCGTTGCCCT CATCGGTCTG ATCCAAGACA TCACCGAACG GCACGGCTTG GAGCAGCGCC TGCGCCACCT CGCTTGCACC GACGACCTGA CCCAGCTGGC CAACCGGGCC GAGTTTCACC GTGTTCTCGA TGGACGGCTG CACGAGGCGC GCGCTGCGGA CGACGACGTG GCCGTGCTTC TGATCGACCT CGACGGCTTC AAAGGCGTCA ACGACGTCCT CGGCCATGCA GCAGGCGACG CGGTGTTGCG CGGTGTCGCC GATCGGTTGC GCGCTTTCTG CGATGACGGT TGTCTCCCAG CTCGGCTAGG GGGCGACGAG TTCGCGGTCG TGATGCCGGC CGGTCTCGAT CGCGTAGGAC TTGATCGGAA GGTGCGGCGC CTCCTGCACG AACTTGAGAT CGTCATGCAC GGACATGGCC ACATCGCCCG TGTGACGGGA ACAATCGGTA TCGCGTGGTC GAGCGCGGCC GCGCAGAACC GCGACGTCCT CCTTCGTCAG GCCGATGCGG CGCTCTACGC CGCCAAGCGC ACCCGGAAGG GAACGGCCCA GACCTATCAA GCCGGCGCGG ACCACCGGAT GGCTGGCTGA
|
Protein sequence | MGVERHMSDN QREALRQNAL AEIALLDTPP EREFDTLAKL AQRMLGTGMS SITLIAPERQ WFKARCGPLA PETTRAQAFC PVVVETEAPL TVADARLDPR FAESPFVTGS PYIRYYAGVP LRVRRPDSGH VTIGTLCVLD ERPREPTSTD LEVLEELACV AEALIEARAV ALRAAKAAEE HRRAVERLER ERRQFKQAER MADMGSYRYD IEKQFTLWSD GVFAIHERPV SAGVPNGEIM NYFPEPDRSL FVAAVRRTLD TGEPFEMDGD FITAKGNARR VRYSCEIELA KGKPVALIGL IQDITERHGL EQRLRHLACT DDLTQLANRA EFHRVLDGRL HEARAADDDV AVLLIDLDGF KGVNDVLGHA AGDAVLRGVA DRLRAFCDDG CLPARLGGDE FAVVMPAGLD RVGLDRKVRR LLHELEIVMH GHGHIARVTG TIGIAWSSAA AQNRDVLLRQ ADAALYAAKR TRKGTAQTYQ AGADHRMAG
|
| |