Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2117 |
Symbol | |
ID | 5833189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 2375020 |
End bp | 2376915 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641367914 |
Product | histidine kinase |
Protein accession | YP_001639583 |
Protein GI | 163851540 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.0208634 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGTCAC CGCTCGCGCC GCTGCATCAA GGGCGGCGTG AGCGCATGCG TTACGCTGAT TCGGTCGGCG GATGCATGCG GGCGACCTCG AAGACCGCGG AAGGCCTTCA CGCGATGAAC GACGACGACT GGCTCGAACT CATCCCCGAC GAGGCTGCGG AAGCGCCGAC CCGAGCCGGA ACCTGGACCA TCGCGGTGGT CGACGACGAT CCGGCCGTCC ACGAGGGCAC CCGCTACGCG CTCGCCGGCT ACAGCCTCGA CGGGCGGGGC CTCGACATCC TCTCCGCCTA TTCGGCGCAG GAGGCGCGGG CGCTGCTCGC CGAGCGGCGC GACATCGCCA TCGTGCTGCT CGACGTCGTG ATGGAGACCG ACGATGCCGG ACTGAGGCTC GTCGATTACA TCCGCCGGGA GCTGAAGGAG GAGACCGTCC GCATCATCCT GCGCACCGGC CAGCCGGGGC AGGCGCCGGA GCGGCGCGTC ATCGTCGATT ACGACATCAA CGACTACAAG GCCAAGACCG AACTCACCGC CGACAAGCTC TTCACCAGCC TCACCGCGGC GCTCCGGGCC TACCAGCAGC TGCGGCGGCT CGACGAGACC CGGCGTGGCC TGGAAATCAT CATCGACGCC GCGCCGATGC TGCTCGACCA CAAGTCGATG CAGCGGCTCG CCGAAGGGGT GCTGACCCAG GTCGCCTCGC TCCTCAATGT CGATTGCGCC GGCATCCTGG TTCTGCGCGA GAGCGAGGAT GCCGCCGACC GGCGGGAGGG CGGATTCTGC GTGTTGGCGG GCTCGGGGCT CTATGGCGGC TATGTCGGGC GCGACCCCGG CTGGCCGCTC GATCCCGGCA TCCAGCCCCT GGTCGAGCAA GCCTTCGCCG CGCGCTGCCA CAACTTCGGC GACAACCTCT CCACGCTCTA TGTCCAGACT GCGAGCGGCA GCGAGATCGT CGCGCTGATC GACACCGACC GCCCGCTCTC CGACACCGAC CGCGCGCTCA TCACCCTTCT GGCGGGGCGC CTCTCGGTCG CCTTCGACAA CGTGATTCTC TACGAGCGGC TGCAGCGCGC CAACGTCACC TTGGAGCAGC GGGTGGTCGA GCGCACCGCC GAGCTGATCC GGGCCAACCG CCGCCTCGAC ATGCAGCGCT CGGACCTGCG CCGGGCCAAC AGCCTCAAGA CCGAGATTCT GGGCACCATC GCCCACGACC TGAAGAACCC GCTCTCGGTG ATCCTGGGTC GCTCCGAGAT GCTGGCCGAC CTGATCGGCC TCGATCAGGG GGATGCCTCC GGCGCGGAGA AGGCGCAGGC GGCGATGCTG ACCCAGGTCG AGCACATCCG CGCCTCGGCC ACGCGTCTGA TCGACATGAT CGACAGCCTC ATGGCCGATG CCATGAACGA CGCCCTCGAC ATCACGCTGC GGCGGGAGCC CGTCGATCTC GCCGGCCTCG CCCGCGAGGT CTGCGAGGCC AACCGGCCGC TCGCCGAATC GAAGGGCCAG AGCCTGGTGA TGGATCTCGC CGGCCCGCTC ACTCTTTGCG GCGACGCCGA GCGGCTGCGC GAGGCGCTCG ACAACCTCGT CTCGAACGCG ATCAAGTATT CCTATCCCGG CGGCGCGATC GCGGTGAGCG TGCGGGAGGA GGGAGGCGAC CTCGTCTGCG CGGTGGCCGA CCAGGGCCCC GGCCTGTCGC CGGAGGATGC CGGCCGGCTT TTCGGCCGCT ACCAGCGCCT CTCGGCCAAG CCCACCGGCG GCGAGGGCTC GACGGGGCTC GGCCTCTCCA TCGTCAAGCG CATCGCCGAA CTCCACGGCG GCCGCGCCGA GGCGTTTTCC GACGGACCGG GGCAGGGGGC CGAGTTCGCG ATGCGGTTTC CAAGGGAGGC GGTGGGGGTG CCTTAG
|
Protein sequence | MASPLAPLHQ GRRERMRYAD SVGGCMRATS KTAEGLHAMN DDDWLELIPD EAAEAPTRAG TWTIAVVDDD PAVHEGTRYA LAGYSLDGRG LDILSAYSAQ EARALLAERR DIAIVLLDVV METDDAGLRL VDYIRRELKE ETVRIILRTG QPGQAPERRV IVDYDINDYK AKTELTADKL FTSLTAALRA YQQLRRLDET RRGLEIIIDA APMLLDHKSM QRLAEGVLTQ VASLLNVDCA GILVLRESED AADRREGGFC VLAGSGLYGG YVGRDPGWPL DPGIQPLVEQ AFAARCHNFG DNLSTLYVQT ASGSEIVALI DTDRPLSDTD RALITLLAGR LSVAFDNVIL YERLQRANVT LEQRVVERTA ELIRANRRLD MQRSDLRRAN SLKTEILGTI AHDLKNPLSV ILGRSEMLAD LIGLDQGDAS GAEKAQAAML TQVEHIRASA TRLIDMIDSL MADAMNDALD ITLRREPVDL AGLAREVCEA NRPLAESKGQ SLVMDLAGPL TLCGDAERLR EALDNLVSNA IKYSYPGGAI AVSVREEGGD LVCAVADQGP GLSPEDAGRL FGRYQRLSAK PTGGEGSTGL GLSIVKRIAE LHGGRAEAFS DGPGQGAEFA MRFPREAVGV P
|
| |