Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_4823 |
Symbol | |
ID | 3973527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 5381441 |
End bp | 5384950 |
Gene Length | 3510 bp |
Protein Length | 1169 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637927935 |
Product | periplasmic sensor hybrid histidine kinase |
Protein accession | YP_534664 |
Protein GI | 90426294 |
COG category | [E] Amino acid transport and metabolism [K] Transcription [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0591] Na+/proline symporter [COG0642] Signal transduction histidine kinase [COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCACG ATTGGGGCGT GATCGCGGCC GCGTTCGGCT ATATCGGCTT TCTGTTCCTG GTCGCCAGCT ACGGCGATCG CCTCTCGCAG GCGCAACGCC AGCGGTTCGG CACCCTGATC TACCCGCTGT CGCTGGCGAT CTACTGCACC TCGTGGACGT TCTTCGGCTC GGTCGGCGTC GCCAGCCGCT TTAGCGTCGA TTTCCTGGCG ATCTATATCG GCCCGATCCT GATGGTGGCG TTCGGCGCGC CGCTGCTGCG CCGGGTGATC GATCTCGCCA AATCTCAGAA CATCACCTCG ATCGCCGATT TCATCGCCGC GCGCTACGGC AAGAGCCAGG CGGTGGCGGC CACCGTGGCA ATCATCGCCA TCGTCGGCTC GGTGCCCTAT ATCGCGCTGC AGCTGAAAGC GGTGGCGTCC TCGCTGCAGA CCATCCTGGG CGACGACCAA GTGATGTCGA GCATGCCGGT CGCCGGCGAC ATCGCGCTGA TCGTGGCGCT GACCATGGCG CTGTTCGCGG TGCTGTTCGG CACCAGGCAG ACTAACGCCA CCGAACATCA GCACGGCTTG ATGCTCGCGG TTGCCACCGA ATCCATCGTC AAGCTGGTTG CTTTCATTGC CGCGGGTGCC TTCGTCACCT TCTGGATGTT CAGCCCGCAC GAGCTGATCG AACGGGCGAT GAAGACCCCG GAGGCGGTCC GCGCCATCGA CTACGTGCCG TCCAGCGGCA ATTTCCTCAC CATGGTGCTG CTGTCGTTCT GCGCCATCAT GCTGCTGCCG CGGCAATTTC ACGTCAGCGT GGTGGAGAAT TCCAGTAGCG CCGAGGTCAG CCGCGCCCGC TGGCTGTTCC CGCTGTATCT GATCGCGATC AATCTGTTCG TGATTCCGAT CGCGCTGGCC GGATTGATCA GTTTCCCGTT CGGCGCGGTC GACAGCGACA TGTACGTGCT GGCGTTGCCG CTGGAGGCCA ACGCGCCCTA TCTCAGCGTC GCGGTGTTCA TCGGCGGGCT GTCGGCGGCG ACCGCGATGG TGATCGTCGA ATGCGTGGCG TTGGCGGTGA TGGTGTCCAA CGACCTGGTG ATGCCGCTGG TGCTGAAACG CGGCGTCACC CCGCGCGACG ACGAGCGCAG CTACGGTTCG TTCCTGCTCA CGGTCCGCCG GGTGGCGATC TTCGCCATCC TGATCATGGC CTATCTGTAT TTCCGCGCGC TCGGCAATAC CCAGCTGGCG GCGATCGGGC TGTTGTCGTT CGCCGCCATC GCGCAGTTCG CGCCGGCGTT TTTCGGCGGG TTGATCTGGC GCCGGGCGAC CGCGCGCGGC GCCATCGGCG GCATGCTGAT CGGCTTCGCG GTCTGGGCCT ATACGCTGTT CATGCCGAGC TTCCTCGACG GCAACACCGC AGGCGTGCTG TTGCTGCAGC ACGGCCCATT CGGCATCGAA GCGCTGCGGC CGCGGGCGCT GTTCGGCGCC GATCTGCCGC CGCTGCTGCA CGGCGTGCTG TGGAGCCTGT CGCTCAACAT CCTGGCCTAT GTGGTGCTGT CGCTGTGGAA GCAGCCGTCC TCGATCGAAC GGCTGCAAGC CGCGGTGTTC GTGCCGGCGG CGCTGAAGCC GATGGCGCCG GCGTTTCGGC GCTGGCGCAC GACGGTGACG GTGCAGGACA TTCTCGGCGC GGTCGCGCAA TATCTCGGGC CGGAGCGGGC CCGCGAAGCG TTTCGAACTT TCGCCGCCAG CCGCCGGATC AACATCGACC CGGCGGCGCC GGCGGATTTC GAATTGCTGC AGCATGCCGA GCACCTGATC GCATCGTCGA TCGGCGCGGC GTCGTCGCGG CTGGTGCTGT CGCTGTTGTT GCGCAAGCGC ACGGTGTCGG CGGAAGCGGC GCTGAAGCTG CTCGACGATT CCCACGCCGC GCTGCATTTC AACCGCGAGA TCCTGCAGAC CGCGCTCAAC CACGTCCGCC AGGGCATCGC GGTGTTCAAT CCGGATTTGC AGCTGATCTG CTCGAACCGG CAGTTCGGCG ACATCCTCGG GCTGCCGCCG CATATCGTGC AGATCGGCAT TCCGCTCACC GAGATCCTGG AATTCCTCTC GCTCAGCGCG CCCGGCTTCG GCGACAGCGA CATGCAGACG CAAATCCGGC TGGCCGCCTA TACCACCGAG GGCGCGCCCT ATATCGAGCG GATGCAGGAT CGCCATCTGG TGATCGAGGT CCGCGCCAAC CGGATGCCCG ACGGCGGCCT GGTGATCACT TTCTCCGACG TCACCCCGAG CTTCGAGGCC GCCGAGGCGC TGGAGCGGGC CAACGCCACG CTGGAAAAGC GGGTGCGCGA CCGCACCGAG GAACTGACAA GGCTGAATTC GGCGCTGGCC TTGGCCAAGA GCACGGCGGA AGAGGCCAAT ATCTCCAAGA CCCGGTTCCT CGCCGCCGCC AGCCACGACA TCCTGCAGCC GCTCAATGCC GCGCGGCTCT ATGTCACCAG CCTGGTCGAA CGCCAGAGCG GCGGCGACGA CGCCAAGCTG GTGGAGAACA TCGACGAATC GCTGGAGGCG ATCGAGGAGA TCATCGGCGC GCTGCTGGAT ATCTCGCGGC TCGACGCCGG TGCGATGACC CCTTCGCTGT CCAGCTTCGT GATCGGCGAT CTGATGCGCT CGCTGGAAAT CGAATTCGCG CCGATCGCCC GGGCCAAGGG GCTGAAGCTG ACCTTCGTGC CGTGCCTGCT GCCGGTGGAG TCCGACCGGC TGCTGTTGCG CCGGCTGATG CAGAATCTGA TCTCCAACGC CATCAAATAC ACCCCGCAGG GCAAGGTGTT GGTGGGCTGC CGGCGTCGCG GCCAGTCGCT GCAGATCGGC ATCTACGATA CCGGCGTCGG CATCCCGATT CTCAAGCGCG GCGAAATCTT CAAGGAATTC CACAGGCTGG AGCAGGGCGC GCGGATCGCC CGCGGGCTGG GCCTTGGGCT CTCGATCGTC GAGCGGCTGG CGCGGGTGCT GAACCACGGC ATCGCGATCG ATTCCAACAA GAGCGGCGGC TCGTTCTTCT CGGTCACGGT GCAGATCGCC AACATGGTCA ATCACACCGC CGCGGTGACC AGCGCCACGC CGCTGTCGCG GGCGTCGATG GCCGGGATCC TGGTGGCCTG TATCGAAAAT GATCCGGCGA TCCTCGACGG CATGAAGACG CTGCTCACCA CCTGGGGCGC AAGCGTCATC GCGGTGGCCG ATCCGGAAGC CGCCATCAAG GCGATCGAGT CCGCCGGGCG GCCGGTCACC GGTCTTCTGG TCGATTATCA TCTCGATCGT GGCAACGGCA TCGCCGGGAT CCGGGATATC CGCCACCGCT TCGGCGAGGA GATTCCGGCG ATCCTGATCA CCGCCGATCG CAGCCCACGG GTCCGGGCGG CGGCGCGCGA CGATAATATC TCGGTGCTGA ACAAGCCGGT GAAGGCGGCG TCGCTGCGCG CGCTGCTCGG GCAATGGCGC GCGCAGCAGA ACGTTGCCGC CGCGGAATAG
|
Protein sequence | MLHDWGVIAA AFGYIGFLFL VASYGDRLSQ AQRQRFGTLI YPLSLAIYCT SWTFFGSVGV ASRFSVDFLA IYIGPILMVA FGAPLLRRVI DLAKSQNITS IADFIAARYG KSQAVAATVA IIAIVGSVPY IALQLKAVAS SLQTILGDDQ VMSSMPVAGD IALIVALTMA LFAVLFGTRQ TNATEHQHGL MLAVATESIV KLVAFIAAGA FVTFWMFSPH ELIERAMKTP EAVRAIDYVP SSGNFLTMVL LSFCAIMLLP RQFHVSVVEN SSSAEVSRAR WLFPLYLIAI NLFVIPIALA GLISFPFGAV DSDMYVLALP LEANAPYLSV AVFIGGLSAA TAMVIVECVA LAVMVSNDLV MPLVLKRGVT PRDDERSYGS FLLTVRRVAI FAILIMAYLY FRALGNTQLA AIGLLSFAAI AQFAPAFFGG LIWRRATARG AIGGMLIGFA VWAYTLFMPS FLDGNTAGVL LLQHGPFGIE ALRPRALFGA DLPPLLHGVL WSLSLNILAY VVLSLWKQPS SIERLQAAVF VPAALKPMAP AFRRWRTTVT VQDILGAVAQ YLGPERAREA FRTFAASRRI NIDPAAPADF ELLQHAEHLI ASSIGAASSR LVLSLLLRKR TVSAEAALKL LDDSHAALHF NREILQTALN HVRQGIAVFN PDLQLICSNR QFGDILGLPP HIVQIGIPLT EILEFLSLSA PGFGDSDMQT QIRLAAYTTE GAPYIERMQD RHLVIEVRAN RMPDGGLVIT FSDVTPSFEA AEALERANAT LEKRVRDRTE ELTRLNSALA LAKSTAEEAN ISKTRFLAAA SHDILQPLNA ARLYVTSLVE RQSGGDDAKL VENIDESLEA IEEIIGALLD ISRLDAGAMT PSLSSFVIGD LMRSLEIEFA PIARAKGLKL TFVPCLLPVE SDRLLLRRLM QNLISNAIKY TPQGKVLVGC RRRGQSLQIG IYDTGVGIPI LKRGEIFKEF HRLEQGARIA RGLGLGLSIV ERLARVLNHG IAIDSNKSGG SFFSVTVQIA NMVNHTAAVT SATPLSRASM AGILVACIEN DPAILDGMKT LLTTWGASVI AVADPEAAIK AIESAGRPVT GLLVDYHLDR GNGIAGIRDI RHRFGEEIPA ILITADRSPR VRAAARDDNI SVLNKPVKAA SLRALLGQWR AQQNVAAAE
|
| |