Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3596 |
Symbol | |
ID | 3911398 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4123837 |
End bp | 4126617 |
Gene Length | 2781 bp |
Protein Length | 926 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637885498 |
Product | multi-sensor hybrid histidine kinase |
Protein accession | YP_487202 |
Protein GI | 86750706 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5002] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCCCGAC GATTGAACTT ATCCACCCGG CTCACCCTCG CCATCGTGCC TCTCGTGGCG CTGACCGCGG CGACCGTCGG TTATCTGGGG TACCGAAACC TCGCGGCCAT TGCGATCGAA CGCACGCTGG CCGGGCTGGA TGCTACTGCG CGGTCGCGAG CTGTGGAACT CGCGAGCCAG ATTCGGAACG TCAGCGCCGA CGTCGCGAGC TTTCGCACGA TGATCGGCCT GGGCGAATTG ATCGCGCTCA GCCACGACGC GACGCTCCGG ACCGCCGGCG GCCGGACGCT GGCGGAGTGG CGCGCGCGGA TCGAGCAGCG ATTCGCCGAC GAACTCGGAG CGAAAGCCTA TCTGATCCGA TACCGCCTGA TCGGAGCGAG CAACGACGGC CGCGAGATCA TCCGGGTCGA GCGACGGAAC GATACGGTCC GGATTGTTCC GGACGACGAG TTGCGCGGGC AGAGCGAATA CGCCTTCTTC GAACAGGCCA TCCGAGCGGC CGGAAGCGAG GTGGTGGTCT CGCCGGTGGA ACTCGCCCGG ACCGACGGCG CGATCCTGCA ACCTCCGATG CCGCTGATCC GCGTGTCGGC CGCGCTGTTT GCGACCGACG GTACGATGTT CGGGCTGATC ATCGCCGATG TCGGTCTGCG CCCCGCCTTC GCGACGGCCA CCGCAAAAAC GCGAAAAGGC CGCACCGTCT TCATCATCAA CGACCGCGGC GACTACCTGC TGCATCCCGA CAAGTCTCGC GAGTTCGGTT TCGAATTCGA TCGGCCCGCC CGCATCCAGG ACGACTTTCC AAGCCTCGCC ACCGCGATCA CCAGCGGCAA GGATCAGACG GCGATCGTCG AGGACCGCAA CGGCGTGCCG ATCGGGGTTG CGATCGACCG TGTCGAAGGG GCGCCCCTGG CCATCGTCGA GACCGTGCCG CAGCAATTCA TTCTCGACGA CATCATGACC GCGTGGCTGG ATTCGACCTT GACCGGCGGC TCGGTCGCCG TGCTGACCGC CGTTCTGCTG GGTTTCGTCA TGGCCCGGAC CCTGATCAAG CCGCTGTCGC AGATGACGAA GGCGGTGGCG GGATTTGCCG AGGACGCGCC GCCGAAGATG CCGGTCGCGG CCAGTGGCGA AATCGGCGTG CTGGCGCGGG CGTTCGACAC CATGGTGCAG GACGTGCAGG CGAAGACCGC CGCGATCCGG CACGAGAAGG AGCTGTTCGA GAGCATCATG ACCACGATGG CCGAGTGTGT CGTGCTGATC GACCGCAACG GCGAGGCCAT CTATCAGAAC CGCGCCAACC GGGAACTGCT CAGCGCACTC GATATCAGGG TCGACCAGTG GCAGGAGCTC TACGACATCT ACACGCCGGA CGGCTCGACC CGGCTGTCCG CCGACCATTG GCCCTCCGCC CGCGTCCTGC GCGGCGAGAC CGTCGATAAT TACGAGATCG TCTGCCGAAG GCGCGATTCC GGCAAGACGG TTCATCTGAT GGGGAGCGCG CGGCCGTTGT GGGAAGCCGC GGGCACGCAA ACCGGAGCGG TCGTGGTGTT CCGCGACGTC ACCGAGATGC GGGCGACCGA GCACCGGCTG CATCAGTCAC AGAAGCTGGA AGCGATCGGC CAGCTCACCG GCGGCGTCGC GCACGACTTC AACAACATGC TGACGGTGAT CAACGGCACC GCCGAGATCC TGCTCGACGA ACTTGCCGAC CGGCCGGACC TCTGCAGCAT CGCCAGGATG ATCGAGCAGG CCGCCGGGCG CGGCGCCGAC CTGACGCGGC AACTGCTCGC CTTCGCCCGC AGGCAGCCGC TGCAACCGCG CAATATCGAC GTCAACGCCA TCGTGCTGAA CACCCAGCAA TTGCTGAAAG CGACGATCGG CGAACACATC GACGTCGAAG TCAGGCTGGC GCAGGACGTC GATGCGGCGC GGGTCGATCC GTCGCAACTC TCGTCGGCGC TGCTCAACCT CGCGGTGAAT GCGCGCGACG CGATGCCGAA CGGCGGCAAG CTGATGCTCG AAACCGCCGA CGTGGTGCTC GACGCCGCCT ACGGGCAGCA CAATCCCGAC GTCCAGCCCG GCCGCTACGT GATGATCGCG GTCAGCGACA CCGGCACAGG AATTCCAGCC GAGTTGTGCG ACAAGGTGTT CGAGCCGTTC TTCACGACCA AGAGCGCCGG CCAGGGCACC GGCCTCGGCC TCAGCATGGT CTATGGCTTC GTCAAGCAAT CGGGCGGGCA CATCAACATC TACAGCGAGG AGGGCCACGG CACCACGCTC AAGCTGTATC TGCCGCAGGC CGATTCCGAC CCGGCCGTCG ACAGCGCACC GGACGCCGGC CCGGCGACCG AGGGCGGCAG CGAAACCATC CTGCTGGTCG AGGACGACGA GTTGGTGCGC AAATTCGCGA TCGCCCAGCT CGCGGGTCTC GGTTATCGCA CCATCGCGAT GTGCGACGGC CAGGCGGCGC TGCGTGAGGC GGAGCGCGGC ACCGCGTTCG ATCTGCTGTT CACCGACGTG ATCATGCCGG GCGGCCTGAA CGGCCCGCAA CTCGCCGACG CGATCGCCCG GGTCCGGCCG GTGCGGGTGC TGTACACCTC GGGCTACACC GAGAACGCGA TCGTGCATCA CGACCGGCTC GACAGCGGCG CGCTGCTGCT GACCAAGCCG TATCGCAGGT CGGATCTGGC CCGGATGGTC CGCGCCGCAC TCGGCAAGGA CGTGCACGTC CCGCCGACCG GGATCGCGGC GGCACCCTCG TCGCGCGCCA GCGCCCGTTA G
|
Protein sequence | MPRRLNLSTR LTLAIVPLVA LTAATVGYLG YRNLAAIAIE RTLAGLDATA RSRAVELASQ IRNVSADVAS FRTMIGLGEL IALSHDATLR TAGGRTLAEW RARIEQRFAD ELGAKAYLIR YRLIGASNDG REIIRVERRN DTVRIVPDDE LRGQSEYAFF EQAIRAAGSE VVVSPVELAR TDGAILQPPM PLIRVSAALF ATDGTMFGLI IADVGLRPAF ATATAKTRKG RTVFIINDRG DYLLHPDKSR EFGFEFDRPA RIQDDFPSLA TAITSGKDQT AIVEDRNGVP IGVAIDRVEG APLAIVETVP QQFILDDIMT AWLDSTLTGG SVAVLTAVLL GFVMARTLIK PLSQMTKAVA GFAEDAPPKM PVAASGEIGV LARAFDTMVQ DVQAKTAAIR HEKELFESIM TTMAECVVLI DRNGEAIYQN RANRELLSAL DIRVDQWQEL YDIYTPDGST RLSADHWPSA RVLRGETVDN YEIVCRRRDS GKTVHLMGSA RPLWEAAGTQ TGAVVVFRDV TEMRATEHRL HQSQKLEAIG QLTGGVAHDF NNMLTVINGT AEILLDELAD RPDLCSIARM IEQAAGRGAD LTRQLLAFAR RQPLQPRNID VNAIVLNTQQ LLKATIGEHI DVEVRLAQDV DAARVDPSQL SSALLNLAVN ARDAMPNGGK LMLETADVVL DAAYGQHNPD VQPGRYVMIA VSDTGTGIPA ELCDKVFEPF FTTKSAGQGT GLGLSMVYGF VKQSGGHINI YSEEGHGTTL KLYLPQADSD PAVDSAPDAG PATEGGSETI LLVEDDELVR KFAIAQLAGL GYRTIAMCDG QAALREAERG TAFDLLFTDV IMPGGLNGPQ LADAIARVRP VRVLYTSGYT ENAIVHHDRL DSGALLLTKP YRRSDLARMV RAALGKDVHV PPTGIAAAPS SRASAR
|
| |