Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4081 |
Symbol | |
ID | 5832962 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 4539344 |
End bp | 4542031 |
Gene Length | 2688 bp |
Protein Length | 895 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641369872 |
Product | histidine kinase dimerisation/phosphoacceptor |
Protein accession | YP_001641522 |
Protein GI | 163853479 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) |
TIGRFAM ID | [TIGR02373] photoactive yellow protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.628066 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.498394 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGAAGG CAGACCCGGC ATTGCTTAAC CGGGGCGTGG CAGGCATACC GGCTGCCGTA ATGGTCGATC CGTCGCAGAT GCATGCCGTG GGGCCGAACG GTGGTCCGGT CGACCTTCCG GTCGATCTCG ATGCCTTGAC CGCCGAGCAG CGGGATGAAT TCGGCGTGGG CATTCTTGCC CTCGACGCCG CCGGAATCGT GCTCGCCTGC AATCGGGCCG CGGGCGCGCT GTGCGGCCTG CCGCCCAATA CGATGATCGG GCGGAGTTTC TTCCGCGAAC TCGTGCCGAG TGCCAACGTG CCGAGCTTCT ACGGCCGCTT TCTCAGCGGC CAGCGCCGGA GCGTGGCCGA TCAGGCCTTC GAGTTCGTGT TCGGCCGCAT CCCGGCCCCC CTGCGGGCAC GAATCGGCCT GCGGTCCGGG GCGAACGGGC ACATCTGGCT GACGATCACC CCCCTGGAGC AGATCGCCGC CGGCCCCTCG CGCGAGGCCG TCCTCGCGGC GATCGCTCAG CGCAGCCGGG CCGAGCCGGT CGATCCGAGC CTGTGCGAGC GTGAGCCGAT CCACATTCCC GGCTCGATCC AGCCGAACGC GGTGATGCTC GCCGCCGACG CCGCGAGCCT CGAGATCCTG GCCTTCAGCG CCAACGCCGC CGACGTGCTG GCGCCCGATC TCTTCCCGCC CAACGGCCTC AACCTGGAGG CGGTGCTGCC GGGCGCGATC GTCTCCGCGA TTCGCGATGG GCTCGCCGCC CACACCCTGA CCGACGGTCG GCTCCTGCGC CGCTCGCTCA CGCTTCCGCC GCGGGGAGAG CGCTTCCACC TCGTGGCCCA TGCCCATCTC GGCCGGGTGA TCGTCGAGCT GGAACTGGCG CCGGAGCGCC CCGAGGACTT CCTCGCCGCG AGCCCGCTCG ACGCCGAACT CGCCATGATG CGGCTGCGCG CCGCCGAGTC CCTGACCGAG GCGGCGCAGA TCGCCGCCCT CGAAATCCGC GCCATGACCG GTTTCGAATC GGTGCTGGTC TACCGGTTCG ACACGGATTG GAACGGCGAG GCCATCGCCG AGGACATGGT GCCGGACTGG CAGCGCCCGC TGATCGGCCT GCGCTTTCCC GCCTCCGACA TCCCGGCCCA GGCCCGCGCC CTCTACACCA AGGCCAAGAG CCGCTTCGTG ATCGACCGCG ACTGCGTGCC CGTGCCGCTG GTGGCCGACC GCGCCGCGGG CAACGCGCCG GTCGACCTCA CTTTCGCGCA GAACCGAACG CTCTCGCCGA TCCACCTCGA ATACCAGCGC AATCTCGGCG TCGACGGCTC GATGTCGATC TCCATCATGG TCGAGAACCG GCTCTGGGGC CTGATGATTG GCCATCACCG CCGGCCGCAC TACGTCGCGC CGGAGACCCG CGCCGCGGCG ACCGTCCTCA CCGACGCCTT CGCCATGCGG GTGCAGGAGA TCGAGGGCAA GGCGCTCTGG GGCGAGCGGC AGCGCCATCT CGACGTGCAG GGCCGGCTGG TGCGCGGGCT CACCCGTTCC GACGATTTCG TCACCTCCCT GACCCAGGGC GATCCGACCC TGCTCGACCT GTTCGGCGCC ACGGGTGCGG GCATCGTCTC GGACGAGGCC GTCTGCCTCG TCGGCGTCAC GCCGGAGGCG GCGAAGGTGC GGGCGCTCGC CGATTGGCTG CGAGAGAGCC TGCCACCCGA CGAGACCACC TTCGTCACCG ACACGCTGGT GCTGCATCAC GCGCCGGCCG CGGACTTCAC GGAGATCGCC AGCGGCCTGC TCGCGGCCTT CGTCGGCACC TCGCGCCAGC ATCTGCTGTT CTGGGTCAAG CCGGAGGTGC CGAGCACGGT GACCTGGGGC GGCGATCCGC GCAAACCCGT CCTGCCCGGC AGCGGCCCGG TGGCGGTGCT GCCGCGGCGC TCGTTCGAGC GCTGGATCGA GGAGCGCCGT GGCCATTCCA CCCCCTGGGC GACGTGGAAG GTGGCGCTCG CCGCGCAGCT CGCCGACGCC GTGGACGGCG TGGTGCTGCG CCAGCGCCGC AAGATCGACG AGCTGACGGG GCTGCTGGCC GACAAGGAGC GCCTGCTGGA GCAGAAGGAT CTGCTCACCC GCGAGATCGA CCACCGGGTC AAGAACTCGC TGCAGATCGT GACGGCCTTC CTGCACATGC AGCGCCGGCA GATCGCCGAC CCGGAGGCGC GCCAAGCCTT CTCCGAGACC TCGGCCCGCG TCATGAGCGT CGCGCGGGTG CATGACAGCC TGTACCAGGG CGAGAGCATG GAGCAGGTCG ATCTCGGCCA GACCATCCAG ACCCTGTGCA GCGACCTCGC CGGTATGGCC GGCGACGAGC ACAGCGTCGA GCTGACCGCC GAGCCCGGCC TGATGGTGCC CTATCGGCAC GCGGTGGCGC TCTCGCTGAT CACCACCGAA CTCGTCACCA ACGCGTTCAA ATATGCCGGC AAGCCCGAGA AGGGGGCGCG GATCAGCGTC TCCGTGGCCG GCGGCGAAGG GGCGGCCGTC CGCCTCAGGA TCTGCGACGA CGGCGAGGGC ATGCCGACGG GCTGGAAGAA CGCCAAGGCG CGGGGCACCG GGCTCGGCAT GAAGCTGATC CGCGCCATGC TCGACCAGAT CGGCGCCCGC CTCGACGTCG AGAACGCCGA CGGCGCCTGC TTCACCGTTC ACGCCTGA
|
Protein sequence | MAKADPALLN RGVAGIPAAV MVDPSQMHAV GPNGGPVDLP VDLDALTAEQ RDEFGVGILA LDAAGIVLAC NRAAGALCGL PPNTMIGRSF FRELVPSANV PSFYGRFLSG QRRSVADQAF EFVFGRIPAP LRARIGLRSG ANGHIWLTIT PLEQIAAGPS REAVLAAIAQ RSRAEPVDPS LCEREPIHIP GSIQPNAVML AADAASLEIL AFSANAADVL APDLFPPNGL NLEAVLPGAI VSAIRDGLAA HTLTDGRLLR RSLTLPPRGE RFHLVAHAHL GRVIVELELA PERPEDFLAA SPLDAELAMM RLRAAESLTE AAQIAALEIR AMTGFESVLV YRFDTDWNGE AIAEDMVPDW QRPLIGLRFP ASDIPAQARA LYTKAKSRFV IDRDCVPVPL VADRAAGNAP VDLTFAQNRT LSPIHLEYQR NLGVDGSMSI SIMVENRLWG LMIGHHRRPH YVAPETRAAA TVLTDAFAMR VQEIEGKALW GERQRHLDVQ GRLVRGLTRS DDFVTSLTQG DPTLLDLFGA TGAGIVSDEA VCLVGVTPEA AKVRALADWL RESLPPDETT FVTDTLVLHH APAADFTEIA SGLLAAFVGT SRQHLLFWVK PEVPSTVTWG GDPRKPVLPG SGPVAVLPRR SFERWIEERR GHSTPWATWK VALAAQLADA VDGVVLRQRR KIDELTGLLA DKERLLEQKD LLTREIDHRV KNSLQIVTAF LHMQRRQIAD PEARQAFSET SARVMSVARV HDSLYQGESM EQVDLGQTIQ TLCSDLAGMA GDEHSVELTA EPGLMVPYRH AVALSLITTE LVTNAFKYAG KPEKGARISV SVAGGEGAAV RLRICDDGEG MPTGWKNAKA RGTGLGMKLI RAMLDQIGAR LDVENADGAC FTVHA
|
| |