Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A0521 |
Symbol | |
ID | 3834489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 614264 |
End bp | 617092 |
Gene Length | 2829 bp |
Protein Length | 942 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637824605 |
Product | CheA Signal transduction histidine Kinases (STHK) |
Protein accession | YP_425612 |
Protein GI | 83591860 |
COG category | [K] Transcription [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0643] Chemotaxis protein histidine kinase and related kinases [COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.58429 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGATC TGCTCAGCGA GTTCCTCACG GAAACCTCGG AAAGCATTGC CACGCTTGAC GTGGAACTCG TGAATTTGGA GCGGAATCCA AACGACAAGG GTATCCTTTC GAACATTTTC CGTCTGGTCC ACACCATCAA GGGAACCTGT GGTTTCCTTG GGCTGCCCCG CCTTGAAAGC TTGGCCCATG CCGGCGAGAA CGTGCTAGGC AAGTTCCGTG ACGGCGATCT TGAAGTGACC CCGGCGGCGG TGACGGTGAT TTTGCACACC ATCGACCGCA TCAAGGAAAT CCTGGGTCAT CTGGAAGCCA ATGAGGTCGA GCCCGAGGGC GACGACAAGG ATTTGAAAGC CCATCTGAAC GCCTTCGCCG AGGGCAAGCA GCCGAGCGTG GCGGCGATGG CCCCGGCCCC GGCCCGCGCC GCGGGCTCGG GTCCGGCGAT TTCGGAAGGC GGCTATCCGG TCGCCGCCGA GCTGCTGGCC GAGGTGAGCG AGGCCGTCGC CAAGGGCAAG CGCGCCGCGA CCGAGGCCGA GTTGGCCGCC GAGCTGGCGG CCGAACTGGC GGCGGCGGAG AAAGCCGAAC AGGCGGTCGC CGCGCCCGTT CCCGAAGTCA TTCCCGAGGT CAAGGCCGCG ACTCCGGTGG TCCAGCCGGC CAAGCCGCCG GCGGTGACCG CCGCCCATGA CACCAACGGC CCCAACGGCG GTGGTGGCGG CGAGCAGAAG GAAGGCTCGG TCGCCTCGCA GTCGATCCGC GTCAATGTGG AACTGCTTGA AAACCTGATG ACCCTGGTGT CTGAACTGGT GCTGACGCGC AACCAGTTGC TCCAGATGGT ACGCGGCAGC GACGATTCGG AATTCGTCGC GCCGCTCCAG CGGCTTTCCC ATATCACCAC CGACCTTCAG GAAGGGGTGA TGAAAACCCG CATGCAGCCG ATCGGCAACG CCTGGGCCAA GCTGCCGCGC ATCGTGCGCG ATCTGTCGAT CGAAATGCAC AAGAAGATCG ATCTGCAGAT GTACGGGGCC GATACGGAAC TTGACCGTCA GGTTCTCGAG ATGATCAAGG ACCCGCTGAC CCACATGGTG CGCAATTCCG GCGATCACGG CCTGGAATTC CCCGACGAGC GGGCGGCGGC GGGCAAGCCC GAGACCGGCG TCATCAAGCT CAACGCCTAT CACGAGGGCG GCCACATCAT CATCGAGATT AGCGATGATG GGCGCGGCCT CAATCTTGAA CGCATCCGCG CCAAGGCGCT CTCCAACGGC CTCGCCACCG AGGCCGAACT GGAGAATATG ACCGATCAGC AGATCGCCCA GTACATCTTC CGCGCCGGGC TTTCCACCGC CGAAAAGGTC ACGGCCGTAT CGGGCCGCGG CGTCGGCATG GACGTGGTCA AGACCAATAT CGAAAAGATC GGCGGCACGG TCGAGCTGAA GACTTGGCCG GGCAAGGGCT CGCGCTTCGT CATCAAGATT CCGCTGACCC TGGCCATCGT CTCCGCCCTG ATCGTCGAGG CCTCGGGCGA GCGCTTCGCC ATCCCGCAGA TCTCGGTGCT TGAACTGGTG CGGGTGACGG CCAATTCGGA AACCACCATC GAGCAGATCA ACACCGCGCC GGTCTTGCGC CTGCGCGATC GCCTGATGCC GCTGGTGTCG CTTTCGGCCC TGTTGCGCCT GGATGACGGC GATGACGAGG AACTGGGCAA GACCGCCGAC GCCACGGCCG GACGGCGCGA TGAAACCTTC ATCGTCGTCA GTCAGGTTGG CACCTATACC TTCGGCATCA TCGTCGACCG CGTCTTCGAC ACCGAGGAAA TCGTCGTCAA GCCGGTCGCT CCGATCCTGC GCCATGTGTC GATGTTCTCA GGCAACACCA TCCTGGGCGA CGGCAGCGTG ATCATGATCC TTGATCCCAA CGGCATCGCC AGCGCCACCG GCGAGGTGAC CATGGGGTCG GCGTCGGGGA CCACCGAAGC CGCCCAGTCC CACGAGTTCG TCGGCGAGGA TCGCACCTCG CTGCTGGTCT TCCGCGCCGG GGGCAAGGAT CTCAAGGCCG TGCCGCTGGC CCTGGTCGCC CGCCTCGAGG AAATCGAGAC CGACAAGATC GAGCATTCCT TCGGCAAGCC GGTGGTTCAG TACCGGGGCC AGTTGATGCC GCTGGTCGGC ATCCACGATG AATTCACCCT TGCCGGCGAA GGCCGCCAGC CGGTTCTGGT CTTCTCGGAC CGTGACCGCA CCATGGGTCT GGTCGTCGAT GAAATCGTTG ATATCGTCGA AGACCACTTG AAGGTGGAAT TGCGCGCCGA TCTGCGCGGC GTCGTCGGAA CGGCGGTGGT CAACGGCAAG GCCACCGATA TCATCGATAC CGGGTATTAT CTGACCAAGG CCTTCGGCGA TTGGTTCGGC ACGATGAAGA GCGATGCCTT CGGCGAGCAA AAGAGCGCGA TCCGGGTTCT TCTGGTCGAC GACAGCCCGT TCTTCCGCAA TCTGCTGACG CCGCTGCTGT CGGTTTCGGG CTATGCGGTG ACCGCCGTCG AATCCGCTGA AAAGGCCCTG GAACTGCGTG AAAAGGGCCA TTCCTTCGAA GCCATCATCA GCGATATCGA GATGGCCGGC ATGGATGGCT TCTCCTTCGC CGCCGCCATC CGCGCCGATG GCCGGTGGGG CAATCTGCCG CTGATCGCCC TGTCGAGCCA CGCCACCGAG CGCGATCTTC AGCGCGGCCG GGAAGCCGGC TTCGATGACT ATGTCGCCAA GTTCGACCGG GACAGCCTGC TTGAGGTCCT CGGGCAACTG GTTGGCGGAC AGCCCGCTCT GGTGGCCCAG GAGGGCTGA
|
Protein sequence | MDDLLSEFLT ETSESIATLD VELVNLERNP NDKGILSNIF RLVHTIKGTC GFLGLPRLES LAHAGENVLG KFRDGDLEVT PAAVTVILHT IDRIKEILGH LEANEVEPEG DDKDLKAHLN AFAEGKQPSV AAMAPAPARA AGSGPAISEG GYPVAAELLA EVSEAVAKGK RAATEAELAA ELAAELAAAE KAEQAVAAPV PEVIPEVKAA TPVVQPAKPP AVTAAHDTNG PNGGGGGEQK EGSVASQSIR VNVELLENLM TLVSELVLTR NQLLQMVRGS DDSEFVAPLQ RLSHITTDLQ EGVMKTRMQP IGNAWAKLPR IVRDLSIEMH KKIDLQMYGA DTELDRQVLE MIKDPLTHMV RNSGDHGLEF PDERAAAGKP ETGVIKLNAY HEGGHIIIEI SDDGRGLNLE RIRAKALSNG LATEAELENM TDQQIAQYIF RAGLSTAEKV TAVSGRGVGM DVVKTNIEKI GGTVELKTWP GKGSRFVIKI PLTLAIVSAL IVEASGERFA IPQISVLELV RVTANSETTI EQINTAPVLR LRDRLMPLVS LSALLRLDDG DDEELGKTAD ATAGRRDETF IVVSQVGTYT FGIIVDRVFD TEEIVVKPVA PILRHVSMFS GNTILGDGSV IMILDPNGIA SATGEVTMGS ASGTTEAAQS HEFVGEDRTS LLVFRAGGKD LKAVPLALVA RLEEIETDKI EHSFGKPVVQ YRGQLMPLVG IHDEFTLAGE GRQPVLVFSD RDRTMGLVVD EIVDIVEDHL KVELRADLRG VVGTAVVNGK ATDIIDTGYY LTKAFGDWFG TMKSDAFGEQ KSAIRVLLVD DSPFFRNLLT PLLSVSGYAV TAVESAEKAL ELREKGHSFE AIISDIEMAG MDGFSFAAAI RADGRWGNLP LIALSSHATE RDLQRGREAG FDDYVAKFDR DSLLEVLGQL VGGQPALVAQ EG
|
| |