Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_1681 |
Symbol | |
ID | 3972741 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 1816067 |
End bp | 1818847 |
Gene Length | 2781 bp |
Protein Length | 926 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637924795 |
Product | multi-sensor hybrid histidine kinase |
Protein accession | YP_531560 |
Protein GI | 90423190 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [COG5000] Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.270935 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCAGG GATTGAGCCT TTCGACCCGC CTGACGATCG CCATCGTGGC GCTGGTCGTC GCCACCGCCG GCACGGTGGG CTATCTGAGC TATCGCAACA TCGCTGCCAT CGCGGTGCCG CGCGCCCTGG TGCGGCTCGA CGCCCATGCC CAATCGGTCG CCGTGGATCT CGCCAACATC GTCAAGAACG CTCGCGCCGA TGTGAAAGGG TTTCGCGACG TGATCGGACT CGAGGAGATC ATCGCCCTCA GTCGCGATCC GTCGCTCGCG AGATCGGGTG GCTTGACACC GGCGCAATGG CGGGCCCGGA TCGCACGGCG CTTCGCCGCC GAACTGGAGG CCAAACAGCA CTATGCGCAA TTCCGGATCG TTGGAATCGC CGATGGCGGC CGCGAGCTGA TCCGGGTCGA CCGTCGCTCG GGTGACGGCA AGGTGCGCAT CGTCGCCGAC GCCGACTTGC AGCAGAAAGC CGACCGCGGT TACTTCAAGC AGGCCGCCGC AGCGCCCGAC CGCAGCATCA TCGTGTCGCC GATCGAACTC AATCAGGAGA ACGGCGCGAT TGAACTCCCG CCGCAGCCGG TGGTGCGCAT CTCCTCGCCG ATCTTCGCAC CGGACGGCCA ATTGTTCGGA CTGCTGATCG TCAATATCGA TCTGCGGCCG GCCTTCGCGA CGCTCAAAGC GCGCGCCGCT CCCGACACCA CCATCTATGT GGTGAATGAG CGCGGCGACT ACCTGGCGCA TCCGGATTCG GAGCGCGAAT TCGGCTTCGA GTTCGGCACG CCGTTCCGCG TCCAGGACGA TTTTCCGGCG GCTGCGCGGG CCCTGGCCGA GGGGGATCCG CACCCCGGAC TGATCGAACA CCGCGGCGGC AAACGCTACG GCCTGGCGAT GGCTTCGCTG CGGCTCGGCG ATGGCCCGCC GGTGTCGGTG GTCGAGGTGA TCCCCGAGGA CAAGATCATC GCCCTCGCGC TGAAGGCGCT GCGCGACTCC AGCCTGCTCG GCGGCACCAT CGCGGTGCTC GGCGCGATGC TGCTGGGGTT CATTCTCGCC CGCACCCTGA CCCGGCAATT GACGCAGATG ACCGCGGCGG TCTCAGGCTT TGCCGATGGC AAACCGCTGG TGGTGCCGCT CAATGCCGGC GGCGAGATCG GCGTTCTGGC GCGGGCATTC CAGAACATGG CGCGAGAGGT CGACGAGAAG AACGCCGCGA TCCGGCGCGA AAAGGACATC TTCGAGGGCA TCATGTCGGC GATGGCCGAG GCCGTACTGC TGATCGATGC CGACGGCATC ATCGTCTATG CCAATCGCGC CAACCAGGAA CTGCTCGGGC CGATCGCGAC GGACGGCACC ACATGGCGCG AACTCTACGA CATCTATCTA CCCGATGGCA CCACGCTGCT GCCGAGGCAG CAATGGCCCT CGGCGCGATG CCTGCGCGGC GAACAGGTCG ACGGCTTCGA ACTGGTGTAC CGGCGTCGCG ACAGCGGCAA GACCGTGCAT GTGATGGGCA GTGCGCATCC GATCCGCGGC GCGGAAGGTG CGCGCGCCGG CGTGGTCGTG GTCTATCGCG ACGTCACCGC CACCAAGGAG ATCGAGCGCC AACTGCACCA GTCGCAGAAG CTCGACGCCA TCGGCCAGCT GACCGGCGGC GTCGCCCACG ACTTCAACAA CACGCTGACG GTGATCACCG GCACCGCCGA AATCCTGTTC GAGAGCCTCG CCGACCGGCC CAACCTGCAG CAGATCGCCA AGATGATCGA CGACGCCGCC GGCCGCGGCG CCGAGCTTAC CAAGCATCTG CTGGCCTTTG CCCGCCGGCA GCCGCTGCAG CCGCGCAATG TCGACGTCAA CACCTTGGTG CTCAACACCG CGCAATTGCT GCGGCCGACG CTGGGCGAGC AGATCGAGAT CGAATCGATG CTCGGCAACG ACGCCGAACC GGCGCATATC GATCCCTCGC AGCTGTCGAC CGCGCTGCTC AACCTTGCGG TCAACGCCCG CGACGCGATG CCGAACGGCG GCAAGCTGAC GCTGGAGACC GGCAACGTGG TGCTCGACGA GACTTACGCG CACGCCAACC CCGAGGTGAC GCCCGGGCCC TATGTGATGA TCGCGGTCAG CGACAGCGGC ACCGGCATCT CGGCGGCCAT CCTCGACAAG GTGTTCGAGC CGTTCTTCAC CACCAAGGAC GTCGGCAAGG GCACCGGGCT CGGGCTCAGC ATGGTGTACG GTTTCGTCAA GCAGTCCAAC GGACATATCA AGATCTACAG CGAGGAAGGC TACGGCACCA CGATCAAGCT GTATCTGCCG CGCGCCAGCG CCGACGCCGA AGAGCTCGCG CCCACCGCGC CGATCAAAGG CGGCAGTGAA ACCGTACTGG TGGTGGAAGA CGACGCCATG GTGCGCAATT TCGTCGTCAC CCAGCTGCGC AGCCTCGGCT ACAAGACGCT GACGGCGGCG AACGGCGCCG AAACGCTGGC GCAGATCGAC GCCGGCGCGA CGTTCGACCT GCTGTTCACC GACGTCATCA TGCCCGGCCT CAACGGCCGG CAACTGGCCG AGGCCGTCAA GCAGCGGCGG CCCGCCACCA AGGTGCTCTA CACTTCGGGC TACACCGAGA ACGCCATCGT GCATCACGGC CGGCTGGATC CCGGGGTGTT GCTGCTGCCG AAGCCGTACC GCAAATCCGA ACTGGCGCGA CTGATCCGCG CGGCGCTGAA TTCCGGCATC GCTCCTGCCG ACGACATCCA CATCGGACTA TCAGGGCGCG TGTCGTCCTG A
|
Protein sequence | MSQGLSLSTR LTIAIVALVV ATAGTVGYLS YRNIAAIAVP RALVRLDAHA QSVAVDLANI VKNARADVKG FRDVIGLEEI IALSRDPSLA RSGGLTPAQW RARIARRFAA ELEAKQHYAQ FRIVGIADGG RELIRVDRRS GDGKVRIVAD ADLQQKADRG YFKQAAAAPD RSIIVSPIEL NQENGAIELP PQPVVRISSP IFAPDGQLFG LLIVNIDLRP AFATLKARAA PDTTIYVVNE RGDYLAHPDS EREFGFEFGT PFRVQDDFPA AARALAEGDP HPGLIEHRGG KRYGLAMASL RLGDGPPVSV VEVIPEDKII ALALKALRDS SLLGGTIAVL GAMLLGFILA RTLTRQLTQM TAAVSGFADG KPLVVPLNAG GEIGVLARAF QNMAREVDEK NAAIRREKDI FEGIMSAMAE AVLLIDADGI IVYANRANQE LLGPIATDGT TWRELYDIYL PDGTTLLPRQ QWPSARCLRG EQVDGFELVY RRRDSGKTVH VMGSAHPIRG AEGARAGVVV VYRDVTATKE IERQLHQSQK LDAIGQLTGG VAHDFNNTLT VITGTAEILF ESLADRPNLQ QIAKMIDDAA GRGAELTKHL LAFARRQPLQ PRNVDVNTLV LNTAQLLRPT LGEQIEIESM LGNDAEPAHI DPSQLSTALL NLAVNARDAM PNGGKLTLET GNVVLDETYA HANPEVTPGP YVMIAVSDSG TGISAAILDK VFEPFFTTKD VGKGTGLGLS MVYGFVKQSN GHIKIYSEEG YGTTIKLYLP RASADAEELA PTAPIKGGSE TVLVVEDDAM VRNFVVTQLR SLGYKTLTAA NGAETLAQID AGATFDLLFT DVIMPGLNGR QLAEAVKQRR PATKVLYTSG YTENAIVHHG RLDPGVLLLP KPYRKSELAR LIRAALNSGI APADDIHIGL SGRVSS
|
| |