Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0812 |
Symbol | |
ID | 4784496 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 852540 |
End bp | 855467 |
Gene Length | 2928 bp |
Protein Length | 975 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640089373 |
Product | PAS/PAC sensor hybrid histidine kinase |
Protein accession | YP_001020009 |
Protein GI | 124266005 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.817585 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCAATA TCAACCGCCA GCTTTTTGCC GCCTCTGATT CGCTCTGCGT TCGCGATGTC ACATTGCGCG CCACGGACGA TGTGCAGACT CACCGCGAGA AGCTCGCCCG TATCGTGCTC GACGAACTCT ACGAATTTGT CGGCCTGCTC GATGCCCACG GAACGACGCT CGAGATCAAT CGTGCAGCAC TGGAAGGTGC CGGCATCGCG CTCGACGACA TCCAAGGCAG ACCGTTCTGG GAGGCACGCT GGTGGGCAAC CTCTCCCGAG GTTCGGAGGG AGCAACGTGA GGTCATCCGC CGAGCTGGCG AAGGCGAGTT CGTTCGCCGT GATTTCGAGA TCTATGGGCA ACAAGGTGGT CAGGAGACGA TCCTGATCGA CTATTCCTTG CTGCCCATCC GAGACAACTC CGGGAAGATC GTGTTCCTGT TGCCAGAAGG CAGAAATATC ACAGACAAGA AGCGCGCCGA AGCGGAGATA GCGCGCAAGA ACCGCGAACT GCAAAGATTG CTCGACAAGA TCCAGCGTCT CGACGATGCC AAGAGCGACT TCTTCGCGAA CGTCAGCCAC GAACTGCGGA CTCCGCTCGC GCTGATACTC GGCCCGTCAG AGTCGCTACT CGCAACAAGC GAAGGGCTCA GCGACGCTCA GCGGCGCGAT CTTCGGGTCA TCCAGCGCAA TGCGGCGATG CTGATGAAGC ATGTCAACGA CCTTCTGGAT CTGGCGAAGT TCGATGCCGG AAAGATGGCG CTTCGTTATA CCCGCGTCGA CCTGGCTGCG GAGGTTCGCA CTCTTGCCGC GCACTTCGAG GCCGTCGCGG CGGAGCGCTC GTTGTCTTAC GTTGTTCAGG CTCCCGCGGC GCTGGAAGTC GAGGTCGACC AACAGATGTT CGAACGGATC CTGCTCAATC TGCTGTCGAA CGCATTCAAG TTCACACCGG ATTTCGGCCG TATCCGCTGC TCCCTCGAAG CCAACCCGGA CCACAGCATC CAGCTTGTCG TCGAGGACAG TGGATGCGGC GTCAGGGCCG ATCTGCGAGA GGAGATCTTC GAGCGATTTC ATCAGGCGCA GAGTGGAACC ACGCGAAGCT TCAGCGGGAC CGGTCTCGGC CTGGCCATTG CCAAGGAGTT CGTGGACCTT CACACCGGAA CGATCTCGGT TTCCGATGCC ATCGGCGGAG GTGCACAGTT TCGAGTCGAG CTCCCTTCGC GAGCGCCATT GGGCGCTTAC ATCAGGTCCG TCGACTCGCC CCTCGGGAAC CGAAATCGCG GACAGATCGT CGGAACCATA GAGGAGCTGC AGCGGGCCGA GTTCGATGCC GTCTCGGATC TGTCTGGATC AGAAAAGCCT CTCGTGCTGG TCGCCGAGGA CAACGCCGAC ATGAGGCGCT TTATCGTCGA GGTCCTCTCC AGCGATTTCA GGGTCGTACA CGCTGCAGAC GGCTTGCAAG CGCTCACCCA GGCCCGGGCG CAGGCGCCCG ATGCCATCAT CACAGATCTG ATGATGCCGA AGCTCGGCGG CGACAAGCTG GTGTCGGAGC TCCGGTCGAC TCCAGAACTC GCGCATATTC CCGTTCTCGT GTTGTCAGCC AAGGCGGATG AGTCCCTTCG CCTGAAGCTG CTCTCCGACT CCGTCCAGGA CTACATCGTC AAGCCATTTT CATCGCGCGA GTTGCTCGTT CGAGTACGAA ACATCGTCAC GATGAAGCTG GCCCGCGAGG CGCTGCAGAA GGAGCTGGCG TCGCAGAACG AGGATCTTGC TCAGTTGACG CAGCAACTCA TTGCGAGCAA GCAGGGGCTT CAACGGAGCC ACGATGCATT GAAGGAATCA GAGCGGCGCT GGCGGGCTGT CTACGAAAAC ACCGCCGTCG GCGTATCACT GAGCGACCTA CAGGGAAACA TGCACGCTGC AAATCCCGCG CTCCAGGAGA TGCTGGGATA CACCGAAAGT GAGCTGATCG GTCTCGGCAA CCTGATGACA GATGCCGAAG CAGGCCATGA GGATCGTCGC CTTCAACTCG AACGGCTCGT CAACGGCAGC CAGGTAGAGA TGCGACAGCA AAGGAGGTAC CGGCATCGCA ATGGCATGAC GATCCTGGCG AACGTTCGTG AATCGCTCAT TCCCGGCACC TCGGACCTGC CTCCCACATT GATCACGGTT GTCGAGGACA TCACAACGCA GAAGCGCGCG GAGGTGGAAC TCGCTCAGAC CAAGGATGCG CTCGCACGGG TTTCGCGGGT GACAACGATG GGTGAACTCG CAGCTTCGAT TGCGCATGAA GTCAACCAGC CCCTCACGGC CGTCGTTGTC AACGGCCACG CCTGCCTGCG CTGGCTCTCG ACGGAACCAC GAAACGATCT CGAAGTTCAG GATGCGATAC AGCGGATCGT TCGAGATGCC AATCGGGCCA GCGAGGTTAT CGCCAGGATC AGGGGTTTCC TCAAGCGGAG CAAGACAGAT CGAACCATGG TCTGCATGGA CAACGTCGTT GAAGATGTCA TAGGCCTGGC GCGTGATTCG CTCAGATCCG CTGGCGTCCA GTTGATCAAG CACGTCGACT CCGATCTGCC TCGCGTGTTC GCGGACAGCG TCCAGCTCCA GCAGGTGATC CTCAATCTGA TGATGAATGG CATCGAGGCC ATGGGCTCCT GCGCGACACT TGAGCGTCAG CTGGAGCTGC GCGTCGTGAA GCACGGTGGA GATATCGATG TTTCGGTCAG CGACTCAGGG ACAGGACTGG TGACTGCTGA TTTCGAGCGG ATATTCGAAG CGTTCTACAC CACCAAGCCC GACGGCATGG GTATGGGACT GGCGATTTGC CGATCCATCG TCGAGGCACA TGGTGGACGG TTGTGGGCCC AGGCGAACAA GACGCAAGGA TTGACGCTGC AGTTCCGTCT GCCGATCGCA GAGCACGCCG AACCATGA
|
Protein sequence | MPNINRQLFA ASDSLCVRDV TLRATDDVQT HREKLARIVL DELYEFVGLL DAHGTTLEIN RAALEGAGIA LDDIQGRPFW EARWWATSPE VRREQREVIR RAGEGEFVRR DFEIYGQQGG QETILIDYSL LPIRDNSGKI VFLLPEGRNI TDKKRAEAEI ARKNRELQRL LDKIQRLDDA KSDFFANVSH ELRTPLALIL GPSESLLATS EGLSDAQRRD LRVIQRNAAM LMKHVNDLLD LAKFDAGKMA LRYTRVDLAA EVRTLAAHFE AVAAERSLSY VVQAPAALEV EVDQQMFERI LLNLLSNAFK FTPDFGRIRC SLEANPDHSI QLVVEDSGCG VRADLREEIF ERFHQAQSGT TRSFSGTGLG LAIAKEFVDL HTGTISVSDA IGGGAQFRVE LPSRAPLGAY IRSVDSPLGN RNRGQIVGTI EELQRAEFDA VSDLSGSEKP LVLVAEDNAD MRRFIVEVLS SDFRVVHAAD GLQALTQARA QAPDAIITDL MMPKLGGDKL VSELRSTPEL AHIPVLVLSA KADESLRLKL LSDSVQDYIV KPFSSRELLV RVRNIVTMKL AREALQKELA SQNEDLAQLT QQLIASKQGL QRSHDALKES ERRWRAVYEN TAVGVSLSDL QGNMHAANPA LQEMLGYTES ELIGLGNLMT DAEAGHEDRR LQLERLVNGS QVEMRQQRRY RHRNGMTILA NVRESLIPGT SDLPPTLITV VEDITTQKRA EVELAQTKDA LARVSRVTTM GELAASIAHE VNQPLTAVVV NGHACLRWLS TEPRNDLEVQ DAIQRIVRDA NRASEVIARI RGFLKRSKTD RTMVCMDNVV EDVIGLARDS LRSAGVQLIK HVDSDLPRVF ADSVQLQQVI LNLMMNGIEA MGSCATLERQ LELRVVKHGG DIDVSVSDSG TGLVTADFER IFEAFYTTKP DGMGMGLAIC RSIVEAHGGR LWAQANKTQG LTLQFRLPIA EHAEP
|
| |