Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1671 |
Symbol | |
ID | 5832219 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 1879549 |
End bp | 1885362 |
Gene Length | 5814 bp |
Protein Length | 1937 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641367470 |
Product | histidine kinase |
Protein accession | YP_001639141 |
Protein GI | 163851098 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCCGA CTACGTTGCA GTTGGAGCCG AAGCTCCTCC TGAAGAGCCT TCGTGCCTTT CGGAAGGGTG ACTTCTCGAC TCGGCTGCCG CTGGATCTGA CGGGCATTGA GGGGGAGATC GCGCAAGCGT TCAACGATAT CGTCGAGCTC AACCAAGGGC TCGCTCGTGA ACTCGACAGG GTCGCCCGCG CGGTCGGCAA GGACGGGCGC ATCGGCGAGC GCGGAAAGCT TCCGGCCGCG ACCGGCGGGT GGAACGATTG CGTCGAGTCG GTCAACACGA TGATCGGCGA CCTTGTTCAG CCCACCACCG AGGTGGCGCG CGTCATCGGG GCCGTCGCCA AGGGCGACCT CGGCCAGACC ATGCAGATCG AGATCGAGGG CCGGCCCCTG CGCGGCGAGT TCCTTCGGAT CGGCAAGGTC GTCAACACGA TGGTGGACCA GCTGAACTCC TTCGCCTCCG AGGTGACCCG CGTCGCCCGC GAGGTCGGCT CGGAGGGCAA GCTCGGAGGG CAGGCGCAGG TCAAAGGCGT CGGCGGGACC TGGAAGGACC TGACCGACAA CGTGAACCTG ATGGCGGCCA ACCTGACGGG CCAAGTGCGC AACATCGCCG AAGTCACCAC CGCGGTGGCC AACGGCGACT TGTCCAAGAA GATCACCGTC GACGTGAAGG GCGAGATCCT CGACCTGAAA TCGACCATCA ACACGATGGT GGACCAGCTC AATTCCTTCG CCTCGGAGGT GACCCGCGTC GCCAAGGAGG TGGGGTCGGA GGGCAAGCTC GGCGGACAGG CGCAGGTCAA GGGCGTCGGC GGCGTCTGGA AGGACCTGAC CGACAACGTC AACATGATGG CGGAAAACCT GACGGGCCAG GTGCGCAACA TCGCCGAGGT GACGACCGCG GTCGCGCGGG GCGACCTGTC CAAGAAGATT ACCGTCGATG TGAAGGGCGA GATTCTCGCC TTGAAGCTGA CCATCAACAC GATGGTGGAC CAGCTGAACT CCTTCGCCTC GGAGGTGACC CGCGTCGCCC GCGAGGTCGG CACGGAGGGC AAGCTCGGCG GACAGGCCCA GGTCGAGGGC GTGGGCGGGA CCTGGAAGGA CCTGACCGAC AACGTGAACA TGATGGCGGC CAACCTGACG GGCCAGGTGC GCAACATCGC CGAAGTCACC ACCGCGGTGG CCAACGGCGA TCTCTCCAAG AAGATCACCG TCGACGTGCG CGGCGAGATC CTGGAACTCA AGAACACCAT CAACACGATG GTGGACCAGC TGAATTCCTT CGCCTCCGAG GTGACCCGCG TCGCCCGTGA GGTCGGCTCG GAAGGCAAGC TCGGCGGGCA GGCGCAGGTG CGCGGTGTCG CCGGCACCTG GGCCGACCTC ACCGACAACG TGAACCTGAT GGCGGCCAAC CTGACGGGTC AGGTGCGCAA CATCGCCGAC GTGACCACTG CGGTGGCCAA CGGTGATCTG TCCAAGAAGA TCACCGTCGA CGTGCGCGGG GAGATCCTGG AACTCAAGAA CACCATCAAC ACGATGGTGG ACCAACTGAA CTCCTTCGCC TCCGAGGTGA CCCGCGTCGC GAAGGAAGTG GGTTCGGAAG GCAAGCTGGG CGGTCAGGCC CGCGTCGAGG GCGTGGCCGG CACCTGGGCC GACCTCACCG ACAACGTGAA CCTGATGGCG GCCAACCTGA CGGGCCAAGT GCGCAACATC GCCGATGTGA CGACCGCGGT CGCCAATGGC GATCTGTCCA AGAAGATCAC CGTCGACGTG AAGGGCGAGA TCCTGGAGCT CAAGTCGACT ATCAACACGA TGGTGGATCA GCTGAACTCC TTCGCCTCCG AGGTGACCCG CGTCGCCCGC GAGGTCGGCA CGGAGGGCAA GCTCGGCGGA CAGGCGCAGG TCAAAGGCGT CGGCGGCGTC TGGAAGGGCC TGACCGACAA CGTGAACATG ATGGCGGCCA ACCTGACGGG GCAGGTGCGC AACATCGCCG AGGTGACGAC CGCGGTCGCC AACGGCGACC TGTCCAAGAA GATCACCGTG GCCGTCGAGG GCGAGATCCT GGAGCTCAAA TCGACCATCA ACACGATGGT GGATCAGCTG AACTCCTTCG CCTCCGAGGT GGTCCGCGTC GCCCGCGAGG TCGGCATCGA GGGCAAGCTC GGCGGCCAGG CACAGGTGCG CGGCGTCGGC GGGACTTGGA AGGACCTGAC CGACAACGTG AACATGATGG CGGCCAACCT CACCGGCCAG GTGCGCAACA TCGCCGACGT GACGACCGCG GTCGCCAATG GCGATCTGTC CAAGAAGATC ACGGTCGACG TGAAGGGCGA GATCCTGGAG CTCAAATCGA CCATCAACAC GATGGTGGAT CAGCTGAACT CCTTCGCCTC CGAGGTGACC CGCGTCGCCC GCGAGGTCGG CTCCGAGGGC AAACTCGGCG GCCAGGCGCA GGTGCGCGGC GTCGGCGGGA CCTGGAAGGA CCTGACCGAC AACGTGAACA TGATGGCGGC CAACCTCACC GGTCAGGTGC GCGGCATCGC AGACGTCGTC ACGGCGGTGG CGCAGGGCGA CCTCAAGCGG AAACTCTCGG TCGACGCCAA GGGCGAGATC GCGGCGCTGG CCGACACCGT CAACGAGATG ATCGAGACGC TTGCCACCTT TGCCGACCAG GTGACCAACG TGGCGCGCGA AGTGGGCGTC GAAGGCAAGC TGGGCGGCCA GGCGCGGGTG CCGGGCGCGG CGGGTCTCTG GCGTGATCTG ACGGACAACG TGAATCAGCT GGCGGCGAAC CTCACGACCC AGGTGCGCGC CATCGCCGAG GTGGCGACCG CCGTGACGAA GGGGGATCTC GCGCGCTCGA TCTCGGTCGA GGCCTCCGGC GAGGTCGCCT CGCTGAAGGA CAACATCAAC GAGATGATCC GCAATCTGCG GGACACGACG CTCAAGAACG CCGAACAGGA TTGGCTGAAG ACCAACCTCG CCAAGTTCAC CCGCATGCTG CAGGGCGAGC GCGACCTCGC CACGGTCTCG AACCTGATCC TGTCCGAGAT TGCATCGCTC GTGAGCGCCC AGCGCGGCGT GTTCTACATG GTGGAGGACG AGGGCGGAGA GCCGGTTCTC GATCTGATGG CGAGCTACGC CTTCACGGAG CGCAAGAACC TCTCGAACCG CTATCGCTTG CGTCAAGGCC TCGTCGGCCA ATGCGCCTTC GAGAAGAAGC GCATCCTCCT CACCCACGTG CCCGGCGACT ACATCACCAT CGGCTCGGCT CTCGGCGAGG CGCCGCCCCT CAACATCATC GTGCTGCCCG TCCTCTTCGA GCAGGAAGTC CGAGCGGTCA TCGAGTTGGC GTCGTTCAAC CGCTTCAGCG AGACCCACCA GTCCTTCCTC GATCAGCTGA CCGAGTCGAT CGGGATCGTG CTCAACACGA TCGCTGCCAA CATGAGGACG GAAGGCCTCC TGAAGCAGTC GCAGCTCCTG ACCGGCGAGT TGCAGAGCCG GCAGGAAGAA TTGAAGAAGA CGAACGATCG CCTGGAGCTG CAGGCCGCCT CGCTGCAGCA ATCCGAAGAC CTCCTGAAGA ACCAGCGGGA GCGGCTGCAG CAGACCAATG AGGAGCTGGA GGAAAAGGCG CGCCTGCTCG AAATCCAGAA GCGTGAGGTC GAGGGCAAGA ACCGCGAGGT CTCGATCGCC AAGACGGCGC TGGAGGAGAA GGCCGAGCAG CTGAGCCTGA CCTCGCGCTA CAAGTCGCAA TTCCTCGCCA ATATGAGCCA CGAGCTACGC ACGCCGCTCA ACAGCCTGCT GATCCTCTCG AAGCTTCTCT CCGAGAACCG CGACGGGAAC CTCACCGACA AACAGCGGGA ATTCGCCAAG ACGATCAACG CGGCCGGGAC AGACCTCCTG TCGCTCATCA ACGACATCCT CGATCTCTCC AAGATCGAGT CGGGCACGGT GTCGCTTGAG ATCGGCGACG TCACGCTGCA GGACATCAGC GAGAACCTGA ACAGGACCTT CCGGCAGCTC GCCGAGGAGC GCCACCTCGC CTTTGTGATC GAGGTCGATC CGGGCCTACC GCGCTCGCTG CGGACGGACT CGAAGCGCCT GCAGCAGGTC CTGCGCAACC TTCTCTCCAA CGCCTTCAAG TTCACCGAGA AGGGGAGCGT CTCGCTCAAG ATCGGTTCCG CGGAGGGCTC GCCCCTGCGC GCCGGGACCC AGTGGATCGC CATGTCCGTC ACCGATACCG GGATCGGCAT CGCCGAGGAC AAGCAGCGGA TCATCTTCGA GGCGTTCCAG CAGGCCGACG GGACCACGAG CCGCAAGTAC GGCGGCACGG GCCTCGGCCT CGCGATCAGC CGCGAGATCG CGCGCCTGTT GGGCGGCGAG ATCGTGGTCG AGAGCCGCGT CGGTACGGGC AGCACCTTCA CCCTGTTCCT GCCCTTCGAA CCACCCGCGC AGGCGCTCAC CGGACGCGCG GTGGAGGCAT CGGACGGGTA CGCGCCTTCG GCGCCGATGG GCCGCCAGCC GAACACCGCG ATGGCGCTCT CGTCCTCCGC GGACGACCGT CATGCAATTC ACCCGAGCGA CCACATCGTG CTCATCGTGG AGGACGACGC CATGTTCGCC TCGGTGCTGC TGGAGCTCGC GCGCGAGCGT GGCTTCAAGG GTTTGATCGC GCAGGACGGG GCCGGCGCGC TTAGCCTCGC GCACCGCTTC AAGCCGCACG CCATCACCCT CGACATCGGC CTGCCCGACA TGGACGGCTG GGCCCTGCTC GACCTCCTGA AGAACGATCC CCGCACCCGG CACATTCCGA TCCACGTGAT CTCGGTCAAC GACGAGAAGA AGCGCGGCCT TCGCGCGGGC GCGTTCGGGT TTCTGGAGAA GCCGGTCGAG CGCGAGGGCC TGATGCGCGC CCTGGAGCGC TCGAAGGAGT TCATCACGCG GCCGGTGCGC AACCTCCTGC TCGTCGAGGA CGACGAGAAC CAGCGCGCCA GCATCACAGC GCTCCTAAGC GAAGAGGACG TCAAGATCTT CGGGGTCGGC ACGGCCTCCG CCGCGCTGGA AGCCCTGACA GGCGGCCGGT TCGACTGCGC GATCATCGAT CTCGGCCTGC CCGATATCGG CGGTTCCGAG CTGATCGAGC AGATCCGTGC CTCCCAGGAC GGGGAGGAGT TGCCGATCAT CGTGTATACG GGCCGGGAGC TGACGGCCGC CGAGGAGCAG CAGCTCAGGC ACACCGCCTC GACGATCATC CTCAAGGACA CGCGCTCGTC CGAGCGACTT CTCGACGAGA CGGCGCTGTT CCTGCATCGC GCGATCAATC GCGTTCCAGT CGAGGAGCAG ATCCTCGTCG AGCGGAAGGA TGCGGCGTCA CTTCAGGGCT GTCGGGTCAT TCTGGTCGAT GACGACCTGC GCAACATCTT CTCGCTCACG AGCGCCCTCG AGCAGCATGG GCTGGAAGTT CTGTTCGCCG AGAACGGGCA GGACAGCATC GCCCTCCTCA AGTCCAACCC GAACATCGAT GCGATGCTGG TCGACATCAT GATGCCGGGG ATGGACGGCT ACGAGACGAT GCGCGCGATC CGGGCGGAGG CCAGCTTCCG TAGCCTGCCG CTCATCGCGG TGACCGCGAA GGCCATGAAG GGCGACCGAG AGAAGTGCCT CGAGGCGGGC GCGTCCGATT ACGTCTCGAA GCCGATCGAC ATGGATCAGC TCCTCGCGGT GCTGCGGGTG CAGCTGGCCC GGCGTGGCGA GGTCGCACAC GGCCCCGTGC CCACGCTCGG CGACCCGACG ACCCTCGATC GGCAACCACA ATGA
|
Protein sequence | MTPTTLQLEP KLLLKSLRAF RKGDFSTRLP LDLTGIEGEI AQAFNDIVEL NQGLARELDR VARAVGKDGR IGERGKLPAA TGGWNDCVES VNTMIGDLVQ PTTEVARVIG AVAKGDLGQT MQIEIEGRPL RGEFLRIGKV VNTMVDQLNS FASEVTRVAR EVGSEGKLGG QAQVKGVGGT WKDLTDNVNL MAANLTGQVR NIAEVTTAVA NGDLSKKITV DVKGEILDLK STINTMVDQL NSFASEVTRV AKEVGSEGKL GGQAQVKGVG GVWKDLTDNV NMMAENLTGQ VRNIAEVTTA VARGDLSKKI TVDVKGEILA LKLTINTMVD QLNSFASEVT RVAREVGTEG KLGGQAQVEG VGGTWKDLTD NVNMMAANLT GQVRNIAEVT TAVANGDLSK KITVDVRGEI LELKNTINTM VDQLNSFASE VTRVAREVGS EGKLGGQAQV RGVAGTWADL TDNVNLMAAN LTGQVRNIAD VTTAVANGDL SKKITVDVRG EILELKNTIN TMVDQLNSFA SEVTRVAKEV GSEGKLGGQA RVEGVAGTWA DLTDNVNLMA ANLTGQVRNI ADVTTAVANG DLSKKITVDV KGEILELKST INTMVDQLNS FASEVTRVAR EVGTEGKLGG QAQVKGVGGV WKGLTDNVNM MAANLTGQVR NIAEVTTAVA NGDLSKKITV AVEGEILELK STINTMVDQL NSFASEVVRV AREVGIEGKL GGQAQVRGVG GTWKDLTDNV NMMAANLTGQ VRNIADVTTA VANGDLSKKI TVDVKGEILE LKSTINTMVD QLNSFASEVT RVAREVGSEG KLGGQAQVRG VGGTWKDLTD NVNMMAANLT GQVRGIADVV TAVAQGDLKR KLSVDAKGEI AALADTVNEM IETLATFADQ VTNVAREVGV EGKLGGQARV PGAAGLWRDL TDNVNQLAAN LTTQVRAIAE VATAVTKGDL ARSISVEASG EVASLKDNIN EMIRNLRDTT LKNAEQDWLK TNLAKFTRML QGERDLATVS NLILSEIASL VSAQRGVFYM VEDEGGEPVL DLMASYAFTE RKNLSNRYRL RQGLVGQCAF EKKRILLTHV PGDYITIGSA LGEAPPLNII VLPVLFEQEV RAVIELASFN RFSETHQSFL DQLTESIGIV LNTIAANMRT EGLLKQSQLL TGELQSRQEE LKKTNDRLEL QAASLQQSED LLKNQRERLQ QTNEELEEKA RLLEIQKREV EGKNREVSIA KTALEEKAEQ LSLTSRYKSQ FLANMSHELR TPLNSLLILS KLLSENRDGN LTDKQREFAK TINAAGTDLL SLINDILDLS KIESGTVSLE IGDVTLQDIS ENLNRTFRQL AEERHLAFVI EVDPGLPRSL RTDSKRLQQV LRNLLSNAFK FTEKGSVSLK IGSAEGSPLR AGTQWIAMSV TDTGIGIAED KQRIIFEAFQ QADGTTSRKY GGTGLGLAIS REIARLLGGE IVVESRVGTG STFTLFLPFE PPAQALTGRA VEASDGYAPS APMGRQPNTA MALSSSADDR HAIHPSDHIV LIVEDDAMFA SVLLELARER GFKGLIAQDG AGALSLAHRF KPHAITLDIG LPDMDGWALL DLLKNDPRTR HIPIHVISVN DEKKRGLRAG AFGFLEKPVE REGLMRALER SKEFITRPVR NLLLVEDDEN QRASITALLS EEDVKIFGVG TASAALEALT GGRFDCAIID LGLPDIGGSE LIEQIRASQD GEELPIIVYT GRELTAAEEQ QLRHTASTII LKDTRSSERL LDETALFLHR AINRVPVEEQ ILVERKDAAS LQGCRVILVD DDLRNIFSLT SALEQHGLEV LFAENGQDSI ALLKSNPNID AMLVDIMMPG MDGYETMRAI RAEASFRSLP LIAVTAKAMK GDREKCLEAG ASDYVSKPID MDQLLAVLRV QLARRGEVAH GPVPTLGDPT TLDRQPQ
|
| |