Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0208 |
Symbol | |
ID | 4783991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 224727 |
End bp | 226853 |
Gene Length | 2127 bp |
Protein Length | 708 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640088757 |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_001019405 |
Protein GI | 124265401 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.832424 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGAAA CTGCTTACCT CTTTCGCTTG CGAGCTCTCG ATACTGAGGC TATGGCCGAT GTCGATGCCC CCGTCGTGCC CACGCCGCTC GCTTCACGCT GGCGGCGACT GTTGCCGTGG GCGCTGCTCG TGCTATTGCT GGGCGTGGCC CAGACACTGC TGCTGGCACT GACGGTGCAG CACGAGAACA ACCGCGAGCA GGAGCGAGTG GAGATGGCCG CCGCCTCGGC CGCGGCCAAC GTGCGGCAGC TGCTCCTGCG CGACGTGCAG AGTCTGCAGG CCCTGCTGTG GAACGATCCG CAACCCGCCC AATGGCGCTC CGACGCGGCC GACCTGCTGC GTGCGCGGCG CGAGATGCTG CGCATCGAAT GGCGCAGTGG CGCCAACGTG ATCGAGACGG CCGCCGATTC GCCCTACGGT GGGCCGCTGT TCGTCAGCCT GGCGCGCAGT GACCTCGATC TGGACACGCA GGCCGCGTGT GCCGACGCGC AGCGGCTGGC GGCGCCGGCC TGGTCACGCA GCTACTTCGT GCCTCAGCCC GGCGGACTCG GGGTCGAGGT GATCGACGTC TGCCTGCCGA TCGGTCCGAA TGGCGAGCGC GGCGCGATGG TGGCCACGCT GGCGCTCGGC GGGGTGATCG ACGAAGCCGT CGCGCCTGAA CTGGCGCGCA GCCACGAGCT GTCGTTCGTC GAAGGTGACG GTACGCGCCT GGCGCGCGGA GGGCTGACGC GCGGAGCCGG CATCTATGTC ACCGAGCGCC TGGTCAACCT GTCGAGCCTC ACGCTGGCGT TGCGGCTCGA CAGCGCGGCC GGCGCGCCGC GACTGATCCC CAACGTGGCG ACCTCGCTGG TGCTCGGCCT CTCGCTTGCG CTGGCGGCGG TGGTGGCGTT GCTGGCGCGC GACGTGCGTC GCCGCGCGGC GGCGGAGCAG CGTCTCGGTG AGGAACTGAC GCTGCGACGC GCGATGGAGG ACTCGCTGGT GACCGGCCTG CGCGCGCGTG ACCGGCAGGG CCGCGTCACC TACGTCAATC CGGCCTTCTG CAAGATGGTG GGCTTCAGTG CCGAGCAGCT GGTCGGGCAG ACCACACCGC CCTACTGGCC GCCCGAGATG CTCAGCGCCT ACGAGGCGCG CCAGGTGCAG CGCATGGCGC AGGGCGTGCC GGCGCACGAG TCGCGCGACG GATTCGAGAC CACTTTCATG CGCGCTGGCG GTGAGCGCTT CCCGGTGCTG ATCTTCGAGG CGCCGCTGGT GGGTCCCGAC GGGCTGCAGA GCGGCTGGAT GAGTGCGGTG CTCGATCTCA GCGCGCAGCG CCGCGTCGAG GAGCGCGCGC GCCAGCAGCA GGAGCAGCTG CAGGCCACCG CCCGGCTTGC CAGCGTCGGC GAGATGGCCT CGCTGCTGAG CCACGAGCTC AATCAGCCGC TCGCGGCCAT TGCCAGTTAC GCGACCGGCT CGCTGAACCT GATCGACGAT GCCGGCGAGC GACCCGATCC GCCGAGCTTG GCGATGATCC GCGAGGCCAC GCAGCACATC GCCGAGCAGG CCGAGCGCGC CGGCCGCGTC ATCCGCAGCG TGCACGACTT CGTGCGTCGC CGCGAGCAGA GCCGCGAGAG CCTGAGCTGC GACCAGCTCG TGGAGGCGGT GATGCCGCTG CTGCGCCTGC AGGCGCGCAA GTGCGGCGCG CGCATCGAGT TCGAGTTCGG CAGCCCGCCA CCGCGTGTCG TGTGCGACCG CACGATGGTC GAGCAGGTGC TGCTCAACCT GACCCGTAAC GCGCTGCAGG CAATGGTGGG CGAGCCGGCG GAGCGCCGCG TGGTGCGGCT CGGCGCCCGC ACCCACGGCG CCTGGGTGCG GCTGAGCGTG ACCGATCACG GCCCCGGCAT CGAGCCCGAG GTCGCAGCCC GGCTGTTCAC GCCCTTCTTC ACCACCAAGG CCGAAGGGAT GGGGCTGGGC CTGAGCCTGT GCCGGACGGT CGTCGAGCAG CACGGCGGGG CGCTCGACTT CACGACGCTG CGCGACGGCG AGGGCCGCGT GCGCGGCACC TGTTTCGAGT TCACACTTCC GGCTGCCGGG CCCGCTTCCT CCCCGGCGGC AGCCCCTCCG GGCGTTCTCG ATGGAGTCTC TGCATGA
|
Protein sequence | MRETAYLFRL RALDTEAMAD VDAPVVPTPL ASRWRRLLPW ALLVLLLGVA QTLLLALTVQ HENNREQERV EMAAASAAAN VRQLLLRDVQ SLQALLWNDP QPAQWRSDAA DLLRARREML RIEWRSGANV IETAADSPYG GPLFVSLARS DLDLDTQAAC ADAQRLAAPA WSRSYFVPQP GGLGVEVIDV CLPIGPNGER GAMVATLALG GVIDEAVAPE LARSHELSFV EGDGTRLARG GLTRGAGIYV TERLVNLSSL TLALRLDSAA GAPRLIPNVA TSLVLGLSLA LAAVVALLAR DVRRRAAAEQ RLGEELTLRR AMEDSLVTGL RARDRQGRVT YVNPAFCKMV GFSAEQLVGQ TTPPYWPPEM LSAYEARQVQ RMAQGVPAHE SRDGFETTFM RAGGERFPVL IFEAPLVGPD GLQSGWMSAV LDLSAQRRVE ERARQQQEQL QATARLASVG EMASLLSHEL NQPLAAIASY ATGSLNLIDD AGERPDPPSL AMIREATQHI AEQAERAGRV IRSVHDFVRR REQSRESLSC DQLVEAVMPL LRLQARKCGA RIEFEFGSPP PRVVCDRTMV EQVLLNLTRN ALQAMVGEPA ERRVVRLGAR THGAWVRLSV TDHGPGIEPE VAARLFTPFF TTKAEGMGLG LSLCRTVVEQ HGGALDFTTL RDGEGRVRGT CFEFTLPAAG PASSPAAAPP GVLDGVSA
|
| |