Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4904 |
Symbol | |
ID | 6412590 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 5272912 |
End bp | 5275533 |
Gene Length | 2622 bp |
Protein Length | 873 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642714781 |
Product | multi-sensor hybrid histidine kinase |
Protein accession | YP_001993868 |
Protein GI | 192293263 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase [COG0784] FOG: CheY-like receiver |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.683005 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGGCC TGTGGCAATC GATGACAACG GCCTGGATGT CCCGAACCGA TCGGTCGGCT CTGCTCGACT TCTGGCTCGG CGGCCGCGGC GCCGGCACGC TGATCGCGAA CACCGATATC GACGAGCAGG AAATGCGGCG GATCCGCGCC GCGCAGATCA ACAGCGTCAC CCGGCTGACG CCGGTGACGA TGAGTATCAA CGTCGCCAAC GCTGCGCTGA TCCTGTTCAC CTTCTGGGAC AACGACGCCC GCCCGCAACT GCTGGCGTGG ACGGCGATGA TCGTGTTCGC GGCGGGATCG GCGGTTCAGT CGTGGCTGCG CAACCGCTTC GAGCCCCGGC TCGAGGCCTC CGCCCATGCG ATCCGGCGGA TGACGCTGCA CGCGTTGCTG CTCGGATTGA TCTGGGGCGC GATGCCCGTG ATGGTGTTCG TCAAGGCCGA TCCGGGTGAT CAGTTGATCG TCGCCTGCGT GGCGGCCGGC ATGATCTCCG GCGGCGCGTT TACGCTGTCC ACGGTGCCAC GCGCCGGCCT CGCTTATACT TGGTCACTGG CCGCTGGATC GGCGCTGTCG CTGGCGCTGT GCGATGGCAT GGCTTACCGC ATCACCGCGG CCTTCCTGGT GCTGTACGCG GTGTTCATGT CGCGCAACCT GATCTCGCAC GGCGAGATGT TCTTCGACAA TTTGCGGGCG AAGTTCGAAC TCGAACGGCA GACTGAAATC ATCTCGCTGC TGCTGAAGGA TTTCCAGGCC AACGCCAGCG ACTGGCTGTG GCAGACCGAT GCCCATGGCC GGCTGGTGCA TGTGCCGGAG CGCTTCGTCG AAGTCGCCAA GCTGCCGCCG TCGCTGCTGA TCGGGGCGCC GCTCGCGGAC GTGATCGGCA TGCTGTGCCC GGAGGACGGC CGCTGCGCCA TGGGTGTCGC TGCGAAGATG GCGCAGCGCG AGCCGATGAA CGATCTCGTG GTGCATGTGG TGATCGGCGG CACGCCGCGT CTGTGGTCGC TCACCGCCAA GCCGATGTTC GATGCCGCCG GCGAGTTCGC AGGCTATCGC GGCGTCGGCC GTGACGTCAC CGAGCGCTGG CGCGCCGAGC AGGCCGAGGC CGAAAACCGC GCCAAGTCCA GCTTCCTGGC GATGATGAGT CACGAGATCC GCACGCCGAT GAACGGTGTG CTGGGCCTCG CCAACTCACT GCTCGAAACC AAGCTCGATC CCGAGCAGCA ACAGGCCGTC ACCACGATCC GCGATTCCGG TGACGACCTG CTGCGCATCC TCAACGATAT CCTCGATCTG TCGAAACTCG AGGCCGGCCG GCTGGAGTTC GAGCAGGCCG ACTTCTCGCC CACCACCCTG GTCGAGTCGG TCCGCGCGAT TATCGAGCCG GAAGTTCGAG GCAAGGGCAT CGAACTGAAG GTCGATATCG ATCCGCGGCT ACCGCCATCG TTGAACGGTG ACGCGGCACG AATCCGCCAG GTGCTGCTCA ACCTCGCTGC GAACGCGGTG AAGTTCACCG AGCAAGGCTC GATTGCCATC GTGCTGACGT GCGTAAAGCG CAACGACAGC CACGCCACCG TCGAGTGGCA GGTGACCGAT ACCGGCATCG GCATTTCGCC GGATCGTGTC GGCAGCCTGT TCACCGACTT TGCCCAGGCC GATGTCTCGA TCAATCGCCG GTTCGGCGGC ACCGGGCTTG GGCTCGCGAT CAGCCGGCGG ATCGTCGAAC AGATGGGCGG CGATATCGCC GTCACCTCCC GCGAAGGCGA GGGCTCGACC TTCCGCTTCA GCCTCGACCT GCCGTGGAGC AATGCCTGGA TTGCCGACCA CCGGCTCGAC CGGCTCGGCA GCGACGATCT CCGCACCCGC ATCGCGATGT TGGGACGGCC GCTACGCGTG CTGATCGCCG AAGACGACTC CACCAACCAG ATGGTGGTGA TGAAGATGCT GCAGGAATTC GCCGCCGAAA TCACCGTGGT GTCCGACGGC ACCGACGCGG TGCAGGCGGC AGGTGAGGGC GAGTTCGACG TCGTACTGAT GGACGTGCGG ATGCCCAACA TGGACGGCCT CGCCGCCACC CGGGCGATCC GCGGCAAGGG TGGCGCGCTC GCCAAACTGC CGATTATCGC GCTCACCGCG AACGCTTTCC CCGACGACGT CAAAGTTTGC CGCGACGCCG GCATGAATGA CTTCCTGGCA AAGCCGCTGC GCAAGCCGGC GCTGGTTGCG GCCGTGCTGC GGGCGCTGCG CGGCGCATCG ACACCGGTGA CGATGCCGTC GCCGCCGATG CTGGTCGATC TCGACACGCT CGCCGAACTC ACCGCCGAGA TCGGCCAGGA TCAGGTGAAC GAGATGGTGG CGCTGTTCTT CGCCGAAACC GAGCGGCGGA TCGCGCTGTT CCGGCGCTTT GCCGAACACA TGGACCGCCA CGACCTTGAG GTCGAGGCGC ATTCGCTCAA GGGCGGGGCC CGCACACTCG GCTTCCACCC GATCGCCGAG ATCGCACGTT CGATCGAGCG CGAAGCGGGA TCGATCTCGC CCGAGGCGCT CGATCTGTTC ACCGGTCAGC TCAACAAGGC GCTGATCGAG CTGCGCCGGC AATGCGAAGG CAGCCTGAAG CTGGCGAGCT GA
|
Protein sequence | MSGLWQSMTT AWMSRTDRSA LLDFWLGGRG AGTLIANTDI DEQEMRRIRA AQINSVTRLT PVTMSINVAN AALILFTFWD NDARPQLLAW TAMIVFAAGS AVQSWLRNRF EPRLEASAHA IRRMTLHALL LGLIWGAMPV MVFVKADPGD QLIVACVAAG MISGGAFTLS TVPRAGLAYT WSLAAGSALS LALCDGMAYR ITAAFLVLYA VFMSRNLISH GEMFFDNLRA KFELERQTEI ISLLLKDFQA NASDWLWQTD AHGRLVHVPE RFVEVAKLPP SLLIGAPLAD VIGMLCPEDG RCAMGVAAKM AQREPMNDLV VHVVIGGTPR LWSLTAKPMF DAAGEFAGYR GVGRDVTERW RAEQAEAENR AKSSFLAMMS HEIRTPMNGV LGLANSLLET KLDPEQQQAV TTIRDSGDDL LRILNDILDL SKLEAGRLEF EQADFSPTTL VESVRAIIEP EVRGKGIELK VDIDPRLPPS LNGDAARIRQ VLLNLAANAV KFTEQGSIAI VLTCVKRNDS HATVEWQVTD TGIGISPDRV GSLFTDFAQA DVSINRRFGG TGLGLAISRR IVEQMGGDIA VTSREGEGST FRFSLDLPWS NAWIADHRLD RLGSDDLRTR IAMLGRPLRV LIAEDDSTNQ MVVMKMLQEF AAEITVVSDG TDAVQAAGEG EFDVVLMDVR MPNMDGLAAT RAIRGKGGAL AKLPIIALTA NAFPDDVKVC RDAGMNDFLA KPLRKPALVA AVLRALRGAS TPVTMPSPPM LVDLDTLAEL TAEIGQDQVN EMVALFFAET ERRIALFRRF AEHMDRHDLE VEAHSLKGGA RTLGFHPIAE IARSIEREAG SISPEALDLF TGQLNKALIE LRRQCEGSLK LAS
|
| |