Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1872 |
Symbol | |
ID | 4022354 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 2096442 |
End bp | 2099252 |
Gene Length | 2811 bp |
Protein Length | 936 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637962065 |
Product | sensor histidine kinase |
Protein accession | YP_569008 |
Protein GI | 91976349 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5000] Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.291512 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.670925 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCGAAACG GATTGAATTT ATCGACGCGG CTTACGATTG CGATCGTCCC GCTCGTGGTG CTGACCGCCG CGAGCGTCGG CTATCTCGGC TATCGGAACC TCGCGACTGT TGCGATCGGG CGCACACTGG CGGCGATCGA TACCACTGCG ACCTCGCGGG CAATCGAGCT CGCGAGCCTG GTCAAGAACG TTCGTGCCGA CGTGACGGCT TTTCGCGCTG CCATCGGCCT CGGCGAAATG ACAACACTCA GTCGCAATCC TTCGCTGCAG ACGACCCGCG GCTGGACGCT AGCAGAATGG AGCGCCGGGG TCGGACAACG GCTCGCTGGC GAACTCGAAG CCAAGCCCGA TCTCCTCAAG TACCGCCTGA TCGGATTGGC GGATGGAGGC CGCGAACTCG TTCGGGTCGA ACGTCAGCGC AACGGCGCCG TTCGCGTCGT TCCGGACGAA GAGTTGCAGC GCGCCAGCGA GCGCGATTTA TTTGCGCAGG CGATCAACGT TGCCGAGGGA GAGAGCATCG TCTCGCCGGT CGAACTCGAT CAGGATCATG GCGCGACGAC AAAGCCACAT GTGCCGGTGA TCCGCGTCTC GGCGCCGGTG TTTGGGCCGG ACGGAACGCA GTTCGGTTTG ATCGTCGCCC ATATCGATCT GCGCGCCGCC TTCGACCGGG TGACCGGTCT CGCGCAGCAG GGGCGGGTCG TCTACGTGAT CAACAACAAC GGCGACTATC TGCTCCATCC CGACAAGACG CGCGAGTTCG GCTTCGAACT GGGCAGGCCG GCCCGCATCC AGGACGACTT CCCCGCTCTC GTTGAAGCGA TTGCCAAGAA CCGGGAACGA ACGTCGATTG TCGAGGATCG CAACGGAACG CCGTTCGGCG TGGCGCTCGA ACATGCCGAT GGCGTGGCAC TGTCGATCGT CGAGACCATC CCACAGCGGA TCATTCTCGA CGCGATCATG ACGGCTTGGC TGAGTTCCAC CTTACTCGGC TGCGCTTTCG CGGTGCTGAT CGCGATCGGG CTCGCGCTGG TCATGGCCCG CACCATGACC AGACCGCTGT CGCAGATGAC CGCCGCGGTC TCGTCCTTCG CAGATGATCG GCCGATCGAC CTACCGCTCG ACACCGGCGG CGAAATCGGT GTCCTGGCGC GTGCTTTTCA GAAAATGGCG CTCGATTCGC GTGACAAAAC CGCGGCGATC CGCCGCGAGA AGGAGATTTT CGAGCGGATC ATGAACGCGA TGGCCGAGGC GGTCCTGCTG ATCGACAGGA AGGGACAGGT CATCTACGAG AGTCCCGGCG CAGTGAAGCT GAGGTCGCCG ACGCCCGGCC GCCCGGTGCG GCCCTGGGCA GAGGCGATCG ACTCCTTCCT CGAAGACGGG GTGACGCCGC TGCCTCCGGA TCGGCGCCCA GGCCAGCGCG CTTTGCAAGG CGAAACCGTC GACCAGATCG AACTGGTCCT GCACGTTCGC GACGCCGGCC GCAACGTCGA GGTCATCGGC AACGCCCAGC CTATCCGGAA CGCCGCAGGC CGGATCAACG GCGCGGTCGT CGTTTACAAG GACGTCACCG AATTGAAGGA AGCCGAGCGC CGACTGCATC AGGCGCAGAA GCTGGAGGCG ATCGGTCAGC TCACCGGCGG CGTCGCGCAC GACTTCAACA ACATGCTGAC CGTCATCAGC GGGACCGCAG AGATCCTGAT TGAAGAACTC ACCGACCGAC CGAACCTCAG CAACATCGCC AAGATGATCG AACAAGCTGC AGAGCGTGGC GCCGACCTCA CGCGGCAGTT GCTCGCCTTC GCTCGCAAAC AGCCGCTGCA GCCGCGCAAT GTCGACGTCA ACGCGATCGT TCTCGAAACC GAGCAATTGC TGCGGGCGAC CATCGGCGAA CACATCCAGG TCGACGTCCG GCTGGAGCAG GACGTTGATG CGGCGCGGAT CGACCCGTCG CAGCTCTCAT CGGCACTGCT CAACCTCGCG GTGAATGCGC GCGACGCGAT GCCGATCAGC GGCAAGCTGC TGCTCGAAAC CGGCGGCGTG GTGCTCGACA ACGACTACGC CCAGCAAAAT CCAGACGTGC GCCCCGGGCG CTACGTGATG ATCGCGGTCA GCGACACCGG CACCGGCATT CCCGCGGAGA TGCGCGATAA GGTGTTCGAG CCGTTCTTCA CCACGAAGAG CCTTGGCAAC GGCACCGGCC TCGGGCTCAG CATCGTGTAC GGTTTCGTCA AACAGTCGGG CGGCCACGTC AAGATCTACA GCGAGGAAAA TCAGGGCACC ACGATCAAGC TGTATCTTCC GCGCACCGAC GCCGACATCG ACGGCGCGCC GATCGCAGCG CCCGTTGTGG GCGGTAGCGA GACCATCTTG CTGGTGGAGG ATGACGAACT GGTCCGCAAA TTCGCGATCG CCCAGCTCCA GGGCCTCGGC TATCGCACCA TCGCGGTATG CGACGGTCCC TCGGCTCTGA AAGAGGTCGA GCGCGGCGCC GCGTTCGATT TGCTGTTCAC CGACGTGATC ATGCCCGGCG GGCTGAATGG CCCGCAACTC GCCGAAGCGG TCGCGCGGAT CAGGCCGGTC CGGGTGCTGT TCACGTCCGG CTACACCGAG AACGCGATCT TGCATCATGG CCGGCTCGAT CCCGGTGCGC TGCTGTTGAG CAAGCCGTAT CGCCGGTCGG ATCTGGCGCG GATGGTGCGC GCCGCTCTCG ATCAGGAATA CTACGTTCCC GCCGAGCCGT CGGCCTGCGC GGTCGCGGCT AAATCACCCG ACCAGCTTTG GCCCAATACG GATCGCGTAG CCGGCGCTTG A
|
Protein sequence | MRNGLNLSTR LTIAIVPLVV LTAASVGYLG YRNLATVAIG RTLAAIDTTA TSRAIELASL VKNVRADVTA FRAAIGLGEM TTLSRNPSLQ TTRGWTLAEW SAGVGQRLAG ELEAKPDLLK YRLIGLADGG RELVRVERQR NGAVRVVPDE ELQRASERDL FAQAINVAEG ESIVSPVELD QDHGATTKPH VPVIRVSAPV FGPDGTQFGL IVAHIDLRAA FDRVTGLAQQ GRVVYVINNN GDYLLHPDKT REFGFELGRP ARIQDDFPAL VEAIAKNRER TSIVEDRNGT PFGVALEHAD GVALSIVETI PQRIILDAIM TAWLSSTLLG CAFAVLIAIG LALVMARTMT RPLSQMTAAV SSFADDRPID LPLDTGGEIG VLARAFQKMA LDSRDKTAAI RREKEIFERI MNAMAEAVLL IDRKGQVIYE SPGAVKLRSP TPGRPVRPWA EAIDSFLEDG VTPLPPDRRP GQRALQGETV DQIELVLHVR DAGRNVEVIG NAQPIRNAAG RINGAVVVYK DVTELKEAER RLHQAQKLEA IGQLTGGVAH DFNNMLTVIS GTAEILIEEL TDRPNLSNIA KMIEQAAERG ADLTRQLLAF ARKQPLQPRN VDVNAIVLET EQLLRATIGE HIQVDVRLEQ DVDAARIDPS QLSSALLNLA VNARDAMPIS GKLLLETGGV VLDNDYAQQN PDVRPGRYVM IAVSDTGTGI PAEMRDKVFE PFFTTKSLGN GTGLGLSIVY GFVKQSGGHV KIYSEENQGT TIKLYLPRTD ADIDGAPIAA PVVGGSETIL LVEDDELVRK FAIAQLQGLG YRTIAVCDGP SALKEVERGA AFDLLFTDVI MPGGLNGPQL AEAVARIRPV RVLFTSGYTE NAILHHGRLD PGALLLSKPY RRSDLARMVR AALDQEYYVP AEPSACAVAA KSPDQLWPNT DRVAGA
|
| |