Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A2858 |
Symbol | |
ID | 4785552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 3043379 |
End bp | 3046384 |
Gene Length | 3006 bp |
Protein Length | 1001 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640091429 |
Product | signal transduction histidine kinase-like protein |
Protein accession | YP_001022047 |
Protein GI | 124268043 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.920291 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAACG AGCACGTCGA CTCGCAGTTC GAGCGTGCCG CGGCCGAGGC GCGCGCCGCG GTCGATCGCC GGGTGCTGCG GCCGCTGCGC GATCTGGCGG TGGCGCGCAA GCTCGCGTTG ATCGTGTTCC TGCTGGTCGG CGTGGTCGTC GGCCTCGTCT ACCTCAGCAA GCTCAGCTCC GACATCCTGT CCGGCGTGCG CGCCTACGTG GCCGGGGACG GGCTGTGGGC CAAAGGCAAC CGCGACGCGG TGTTCTACCT GGTGCGCTAT GCACAGACCC ACGAGGAAGC TGATTACCGC CAGTACGCAG CGGCGTTGTC CGTGACGCTG GGTGACCGGC AGGCGCGACT GGAACTGGAG AAGCCCGATT TCGACTACGA CGTGGCGTAT GCGGGCTTCT TGGCAGGGCG CAACCACCCC GACGACATCG ACAGCCTGAT CTGGTTGTTC CGGACCTTCC GCAATCTGAG CTACATGGAC CGGGTGATCC AGCTCTGGAC CCGCGGCGAC GAGGAAATGG CGCTGCTGCG CAGCTACGGC GACACCATCC GCACGCAAGT GCTGAACGGC AGCCTGAGCC CGGCGCACGC TGCCCTCCTG ATCGGCGAGA TCGAGGTGCT CAATCGCCGC CTCGTGCCGA TCGAGGACGC CTTCTCCACC GTGCTGGGGG AGGCCAGCCG CTGGATCGCC CAGCTGTTGT TCGCTCTGAT GCTGGTGACG GCCTGCGGTT TGCTCACGCT CGGGCTGGTG GTCGCCGGCT ATCTGACGCG CCGCATCAAC CGTCAGATCG ACGGGCTGCG CGACGGGGCG TTGCGCATGG CGGACGACGA CTTCGAGCAA CCGGTGGAGA TCGTCTCCGA CGACGAACTG GGCCGGCTCG CCGCCACCTT CAACAGCATG CAGGCCCGGC TGCGGGAGCA TCGCAGTGCC ATCGAGGCCA GCGCCGCCGA GTTGCAGCAG GCCACGGCTG CGGCCCAGGC GCTGGCGCTG CAGGCCGAGA CGGCGAGCCA GGCGAAGAGC CAGTTCATGG CGACGATGAG CCACGAGATC CGCACGCCGA TGAACGGCGT GCTCGGCATG ACCGAACTCC TGCTGGGCAC CGCGCTCGAT TCGCGGCAGC GACGCTTTGC CCAGGCGGTG TACCGCTCCG GCGAAAGCCT GCTCGAGATC ATCAACGACA TCCTCGATTT CTCGAAGATC GAGGCCGGCA AGCTGGAACT GGCGCCGGCC GACTTCACGC CGCGAGCCCT GGTCGAGGAC GTGTTGGAAC TGTTGGCGCC TCGGGCGCAG GAGCGCGGGC TGGAGTTGAG CTTCCGGGAG GAGCCCGGCC TGCCGCCGGC ACTGCATGGC GATGCGCTGC GGCTGCGCCA GGTGCTCACG AACCTGGTCG CCAACGCGAT CAAATTCACC GAGCATGGCG AGGTGGTCGT CGAGATGCGC CGGGTCGAGC CGACCGCGGC AGAGACGGCC CTGGCCACCG GGGACCGGCT GTGGGTCGAG CTGTGCGTGC GAGATACCGG GATCGGCATT CCCCCCGAGG CCTTGTCACG GCTCTTCATC GCCTTCAGCC AGGCCAGCAG CGGCATGGCG CGACGCTACG GCGGCACCGG TCTGGGGCTG GCGATCTCGC GCCAGCTGGT CGAACTGATG TCCGGATCGA TCACGGTCCG CAGCCAGCCG GGGGTGGGCT CGCGGTTCTG CGTTCGGCTG CCCCTGTCGC CGGCCTCCAG CGACGTCGAT GTCGACATGC TCGAACTGCA TGACATGCCG GCCCTGCGCG TGCTCGTCGT CGATGACAAC GAGACCAACC GGACGGTGCT CGAAAACCTG CTCGGGGCCT GGGGCATGGA GGTCGTGGTG GCGAACGACG GCGTGCATGC GCTGGAGCGG CTGCATGCGG AGCGCGATGC CGCACGCAGC TTCGACATTG CGCTGATCGA CATGCAGATG CCGCGGCTCG ATGGCCTGCA ACTGGCCGAC CGGATCTCGG CCGAGCCGGA CTTCGCGGAC GTGAAGCTGA TCATGCTGTC GTCCGTGAGC TCGCCCGACG ACGCCAAGCG CGCGCAGGCC GTGGGCTTCA AGCGCTTCGT CAACAAGCCG GTGCGGAAGG CCGAACTGCG CCAGGCGATC CTCGGCGTGT CGGGTGTTGC CGGCGCCGGC GGCGGTTCGT CGCGCAAGAT CGGCGCCCAT ATCCTGGTGG TCGAGGACAA TCCCGTGAAC CAGGAAGTGA TCGGGCAGAT GCTGCGCCAC TTCGGTTGCC GCGTGCAGCT CGCCTCGTCG GCGCTCGAAG GGCTGCGTGC CTTGTGTGCC GAGCGCTTCG ACCTGATCAT GATGGACATC CAGATGCCGG GCATGGACGG TGTCGAGGCG CTCGGCTGGT TCCGCCGAGG ACCTGGCGAG CGCTTCGCCT TCCGCACTCC ACCCACCACG CCGGTGGTCG CGGTGACCGC CAACGCATTG GGGGGTGACC GCGAGCGATT CCTCGGCCTC GGATTCGATG AATACCTTTC CAAACCCTTC CGGCAAAGCC AGTTGCACAC CATGCTGTCA CAACGCCTGA ACATTCCCGA CACCGGGGCG GGAGAGCTGA CGCCGGCTCC GGCCGAGCCA GCGTTGGCTG GAGCTCCGGC GATCCCCCCT GCGGCGACGG CAGGAGCTCT GGATGCCCAG GCTCTGCAGC GGCTGCGCGA CCTCGATCCC ACCGGGGCGA ACCGGCTGCT GGAGCGCGTC GTGCAGGCGT TCGAGACGTC CACGGGGCGT TTGCTGCCGC AGCTCGACGA AGCGCATGCA GCGGGCGACC TCGACGGCGT GAAGCATGTC GCCCATACGC TGAAATCCTC GTCGGCCAGC ATCGGGGCGC TCAAGTTGTC GGCCCTGTGT GCCGACATCG AGGGCATGAT TCGCAACAAC GAGGTGCAGG CCCTGGGGCC GCGCGTCGCG GCGCTGCGCG CCGAGATCGC GTCGGTTCGT GGCAGCCTGC ACGCTCTGCT GCTGCCTGCC GCCTGA
|
Protein sequence | MSNEHVDSQF ERAAAEARAA VDRRVLRPLR DLAVARKLAL IVFLLVGVVV GLVYLSKLSS DILSGVRAYV AGDGLWAKGN RDAVFYLVRY AQTHEEADYR QYAAALSVTL GDRQARLELE KPDFDYDVAY AGFLAGRNHP DDIDSLIWLF RTFRNLSYMD RVIQLWTRGD EEMALLRSYG DTIRTQVLNG SLSPAHAALL IGEIEVLNRR LVPIEDAFST VLGEASRWIA QLLFALMLVT ACGLLTLGLV VAGYLTRRIN RQIDGLRDGA LRMADDDFEQ PVEIVSDDEL GRLAATFNSM QARLREHRSA IEASAAELQQ ATAAAQALAL QAETASQAKS QFMATMSHEI RTPMNGVLGM TELLLGTALD SRQRRFAQAV YRSGESLLEI INDILDFSKI EAGKLELAPA DFTPRALVED VLELLAPRAQ ERGLELSFRE EPGLPPALHG DALRLRQVLT NLVANAIKFT EHGEVVVEMR RVEPTAAETA LATGDRLWVE LCVRDTGIGI PPEALSRLFI AFSQASSGMA RRYGGTGLGL AISRQLVELM SGSITVRSQP GVGSRFCVRL PLSPASSDVD VDMLELHDMP ALRVLVVDDN ETNRTVLENL LGAWGMEVVV ANDGVHALER LHAERDAARS FDIALIDMQM PRLDGLQLAD RISAEPDFAD VKLIMLSSVS SPDDAKRAQA VGFKRFVNKP VRKAELRQAI LGVSGVAGAG GGSSRKIGAH ILVVEDNPVN QEVIGQMLRH FGCRVQLASS ALEGLRALCA ERFDLIMMDI QMPGMDGVEA LGWFRRGPGE RFAFRTPPTT PVVAVTANAL GGDRERFLGL GFDEYLSKPF RQSQLHTMLS QRLNIPDTGA GELTPAPAEP ALAGAPAIPP AATAGALDAQ ALQRLRDLDP TGANRLLERV VQAFETSTGR LLPQLDEAHA AGDLDGVKHV AHTLKSSSAS IGALKLSALC ADIEGMIRNN EVQALGPRVA ALRAEIASVR GSLHALLLPA A
|
| |