Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3919 |
Symbol | |
ID | 5834123 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 4354766 |
End bp | 4355986 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641369710 |
Product | diguanylate cyclase |
Protein accession | YP_001641361 |
Protein GI | 163853318 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2199] FOG: GGDEF domain |
TIGRFAM ID | [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.556135 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGTGG TCACCATGCT GGTCATGAGC CTTGCCGCGG CCGCTGCGGG CTTCCTGGCC GTCGAGTGGC GGATCCTGCG CAACCCCGCG CTGCTGCTGT GGAGCGCGGG CTTCGCCACG ATCGTCCTCG GCTGCGCGCT CTCGCCGGTC CGCGCCGCCT CCTTCCTCGT CGGGGTCTGG TTCGCCAACG GCCTGCTCGT CGTCGCCCAC CTCCTGTTCC TCTTCGGCAC GGCCCGCTTC ACCGGGCGCC GGATCGCACC GCTCTGGTGG GGCCTGCTGC TGGCCTGGGC CGGGCTCCTG CTCCTGCCGG CGGGGATCGA CCCGACGCCG GCCTTCGCGA TGACCAATTC CGGCCTCGTC GCCCTTGCCG CCCTGCGGGC CGCCCATCTC CTGCTGTCGC GGGCCGGCAC GCCCTTCCCG GAGCGCACCG CCGCCTCCGA CGCGCTCGGC GCCGTCTTCC TCGCCCACGG CACCTTCTAC GCCGTCAAGG CGCTGCTCGT GCCCGTGCCT GGCGCCTTCG TCAGCCTCGT CGGCTTCAAG GGCGTGCTGA TCCAGGTCTC GCTGTTCGAG GGCATCCTCG TCGAGATGCT GCTCGCGCTG CTGATGGCCG CCGCGGTGCG CCGCCGCCGG GAGGAGGCGA TGACGGCGCT CGCCGAGCGC GATCCCCTCA CCGGCGCCTT CAACCGCCGC GCCTTCGAGA GCCGCGCCGC GGAGGCGTTG CGGGACATGG CCGCCGGGCG CCAGTCCGGC GCCCTGCTGC TGCTCGACGT GGACCGGTTC AAGACGGTCA ACGACAGCTT CGGCCATGCC GTCGGCGACC GCATCCTCGT GGCGCTCACC GGCGTGCTGG AGGCCTGCCT GCCGCACCAC GCGATCCTGG CACGGCTCGG CGGGGACGAG TTCGTGATCC TGGTGCCCGG CCTCGACGAG GCGGCGGCAT CCGCGCTCGG CGCGGCGATC CGCGACCGCA TGGCCCGCGA GAGCAGCCGG AGCCTGCCCA CCGGCGCCAC GGTCAGCCTC GGCGCCGCCC TGTTCAGCGG TGGCCCGGCC GGGCTCGACA CCCTGATGGC GCTCGCCGAC ACGGCGCTCT ACGAGGCCAA GGCCCATGGC CGCGACCGGC TGCAGGCCCG CCGCCTGGAG CCGCCGGCGG AGCGGGTCCC GGATCGATCC GGGCCGGGGG CGGCGACGCA CATGTCCGCG CAAGGTGGCT GCTGCGCCTA A
|
Protein sequence | MDVVTMLVMS LAAAAAGFLA VEWRILRNPA LLLWSAGFAT IVLGCALSPV RAASFLVGVW FANGLLVVAH LLFLFGTARF TGRRIAPLWW GLLLAWAGLL LLPAGIDPTP AFAMTNSGLV ALAALRAAHL LLSRAGTPFP ERTAASDALG AVFLAHGTFY AVKALLVPVP GAFVSLVGFK GVLIQVSLFE GILVEMLLAL LMAAAVRRRR EEAMTALAER DPLTGAFNRR AFESRAAEAL RDMAAGRQSG ALLLLDVDRF KTVNDSFGHA VGDRILVALT GVLEACLPHH AILARLGGDE FVILVPGLDE AAASALGAAI RDRMARESSR SLPTGATVSL GAALFSGGPA GLDTLMALAD TALYEAKAHG RDRLQARRLE PPAERVPDRS GPGAATHMSA QGGCCA
|
| |