Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3436 |
Symbol | |
ID | 3911238 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 3935569 |
End bp | 3937884 |
Gene Length | 2316 bp |
Protein Length | 771 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637885339 |
Product | histidine kinase |
Protein accession | YP_487043 |
Protein GI | 86750547 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.416045 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0284383 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGCG TGCAGGGCGC GAGCGCCTGC GCTCAATCCG ATTCGATCAA AGGATTGGCG CAGTCGATCG CTAAACCGGC CTACCATCGA CTGTTGACGG CCGAGCCTGT GCTCCGCCGC GCCGTCCCGA TCCTCATCAT CGCCTTCCTC ATCACGATCT GCTTCGGCGC CGCCGTGCAG TTCATCGATC AGAGCCGGCA GAAGCGCGCT GCGCTCAATC GCGATCTCTG CGCCCTCGCC GATCTGCTCG CCGAGCGGCT CGAACATATC GGCGTCGTGC GGCTCGATCG TCCGGCTTCG ATCGAACGCC TGCAGAATTT GCTACCAGGC CTGATCCCGT CCTGGGGCAT CGCCGCCGGG CGCCACGTCA TCGTCACCAG CGCCGATCAG CGCGTCCTGG CGCGCGTACC CGGCGACGAA GGCACCAGCG CCGCCAGCAG TTTCCTCGAC ATCATCAGCG CCACCCCGCC GCTGACACGG TCCGCGCAGC AAGGCACCAT CACCCAGATC ATCCTGCCGA GCGGCGCCTC GGCGCTGGCG ACGTTGCACA CCGTCAAGTC GCTGCCCGGC CAAGTCATCG TCATTCACGA AGACACCGGG TCGATCCTGC GTTCCGATGC GGCGCTGCAG ATCACGCTGT CGGCTACCAC CGGCTTCGTC GTGCTGATCC TCGGCTTCGC CTTTCACTGG CAGTCGACCC GCGCCCAGGA GGGCGACCTC ATCAACGATG CCGTGCGCAG CCGGATCGAC ACCGCGCTCA ATCGCGGTCG CTGCGGGCTG TGGGACTGGG ACCTGTCGCG CGGCCGGATC TTCTGGTCGC AATCGATGTT CACTTTGCTC GGCCTGGAGA GTCGCAACGA CCTCCTGACC TTCGGCGAGG TCAACGCGCT GGTGAACAGC GACGACATCG ACCTGTTCGC GATCGGCGAC CAGTTGATCG CCGGCAACAC CGATCACATC GACCACAGCT TCCGGATGCG CCACGCCAAC GGCCACTGGA TCTGGCTGCG GATGCGCTGC GAACTCAGCC AGGAAACCCA ATCCGACAAC AAGCACCTGA TCGGCATCGC GGTCGACGTC ACCGAGCAGA AGAGCCTCGC CGAGCGCACC GTCGAAGCCG ATCTGCGGCT GCGCGACGCG ATCGAGACCA TCCCCGAGGC CTTCGTACTG TGGGACGCCG ACAATCGCCT GGTGCTGTGC AATTCGCACT TCCAGCGGCT GCACAAGCTG CCCGACATCG CGGTCGCGCC CGGCACCTCC TACGAGACGG TGATCGAGGT CGGACAGATG CCCGAGATCC GCACCCGTCT CTGCGACAAC AGCGGAGGAT CGTCCCCCGG CGCGCGGACC TTCGAGGCCC AGCTCGCCGA CGGCAGCTGG CTGCACATCA GCGAACGCCG CACCAAGGAC GGCGGCTACG TCGCGGTCGG CACCGACATC ACCCGGATCA AGGCTCACGA ACAGAAGCTC GTCGACAACG ACCTGCGGCT GCGCGCCACC GTCGCCGACC TCAAGATCTC GCAATACAAG CTGGAGCGTC AGGCGATCGA ACTCGCCGAT CTGGCGCGGA AATATTCCGA GGAGAAGACC CGCGCCGAGG AAGCCAATCA GACCAAATCC AAATTCCTCG CCAATATGAG CCACGAGCTG CGCACGCCGC TCAACGCCAT CATCGGTTTT TCCGAGATCA TGGGCAGCGG CATGTTCGGC ACGCTGGGCT CGGAGAAGTA TCAGGAATAC TGCCACGACA TCATGACCAG CGGGCACTAT CTGCTGGAAG TGATCAACGA CATTCTCGAC ATGTCGAAGA TCGAGGCAGG TCGGATGCGG CTCGACATGG AGGAGCTCGA TCTCGCTCGC ACCCTCGGCG AATCCCTCAA GGTCGTCGCC GGCCGCGCCG AGCACAAGCA GCTCGAACTG CTCTCGGAGA TCGAGGACGA CATTCCGATC GTGGCGGACC GCCGGGCGGT CAAGCAGATC CTGATCAATT TGTTGTCCAA CGCCGTGAAA TTCACCCCCG ACGGCGGACG GGTCACGGTC CGCAGCCGGA CGCTGCCGAA CGCCATCGTG TTGATGATCG CCGATTCCGG CATCGGCATC GCGCCGCAAT CGCTGCGCCG TCTCGGCCAG CCGTTCGAGC AGGTCGAGAG CCAGCTCAGC AAGACCTATC ATGGCTCCGG CCTGGGTCTG GCGATCGCCA AATCGCTCAC CACGCTGCAC GGCGGCTCGA TGCGGCTGCG CTCCACGCTC GGCGCCGGCA CGGTGGTGAT GGTGACGCTG CCGCGCGACT GCCAGCGCCG CCGGCTGGCG GCCTGA
|
Protein sequence | MARVQGASAC AQSDSIKGLA QSIAKPAYHR LLTAEPVLRR AVPILIIAFL ITICFGAAVQ FIDQSRQKRA ALNRDLCALA DLLAERLEHI GVVRLDRPAS IERLQNLLPG LIPSWGIAAG RHVIVTSADQ RVLARVPGDE GTSAASSFLD IISATPPLTR SAQQGTITQI ILPSGASALA TLHTVKSLPG QVIVIHEDTG SILRSDAALQ ITLSATTGFV VLILGFAFHW QSTRAQEGDL INDAVRSRID TALNRGRCGL WDWDLSRGRI FWSQSMFTLL GLESRNDLLT FGEVNALVNS DDIDLFAIGD QLIAGNTDHI DHSFRMRHAN GHWIWLRMRC ELSQETQSDN KHLIGIAVDV TEQKSLAERT VEADLRLRDA IETIPEAFVL WDADNRLVLC NSHFQRLHKL PDIAVAPGTS YETVIEVGQM PEIRTRLCDN SGGSSPGART FEAQLADGSW LHISERRTKD GGYVAVGTDI TRIKAHEQKL VDNDLRLRAT VADLKISQYK LERQAIELAD LARKYSEEKT RAEEANQTKS KFLANMSHEL RTPLNAIIGF SEIMGSGMFG TLGSEKYQEY CHDIMTSGHY LLEVINDILD MSKIEAGRMR LDMEELDLAR TLGESLKVVA GRAEHKQLEL LSEIEDDIPI VADRRAVKQI LINLLSNAVK FTPDGGRVTV RSRTLPNAIV LMIADSGIGI APQSLRRLGQ PFEQVESQLS KTYHGSGLGL AIAKSLTTLH GGSMRLRSTL GAGTVVMVTL PRDCQRRRLA A
|
| |