Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1045 |
Symbol | |
ID | 3908897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1201886 |
End bp | 1202869 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637882938 |
Product | thiosulphate-binding protein |
Protein accession | YP_484666 |
Protein GI | 86748170 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTCGCC GAATTGTTCC GCTGCTCGCC GGCCTGATGG TCGCGACCTC CGCGCAGGCC GCCGACGTGT CGCTGCTCAA CGTCTCGTAC GACCCGACCC GCGAACTCTA CAGCGAGTTC AACAAATCCT TCGCCGCCGC GTATCAGAAG GAAACCGGCG ACACCGTCAC GATCAAGCAG TCGCACGGCG GCTCCGGCTC GCAGGCGCGC TCGGTGATCG ACGGTCTGCA GGCCGACGTC GTGACGCTGG CGCTGGCCTA CGACATCGAC GCGATCGCCA ACAAGGGGCT GCTGACCAAG GACTGGCAGA AGCGTCTGCC GCAGAATGCG TCGCCCTACA CCTCGACCAT CGTGTTCCTG GTGCGCAAGG GCAATCCGAA GGGCATCAAG GACTGGCACG ATCTGATCAG GCCCGGAATC AGCGTGATCA CGCCGAACCC GAAGACCTCC GGCGGCGCGC GCTGGAATTA TCTGGCGGCC TGGGGCTACG CGCTGAAGAC GGAGGGATCG GAGGACAAGG CGCGCGACTT CGTCGGGAAC ATTTACAAGA ATGTGCCGGT GCTGGACACC GGCGCCCGCG GCGCGACCAT GACCTTCGTC CAGCGTGGCG TCGGCGACGT GCTGCTGGCG TGGGAGAACG AGGCATTCCT GGCGGTCAAG GAATTCGGCA AGGACAGATT CGAGATCGTG GTGCCGTCGA TCTCGATTCG CGCCGAGCCG CCGGTGGCGC TGGTCGACAG CGTGGTCGAC AAGAAAGGTA CCCGGGCAGT GGCCGAAGCC TATCTGCAGT ATTGGTACAC CAAGGAAGGT CAGGAAATCG CCGCACGGAA CTTCTATCGT CCGCGCGATT CGGAGATTGC CAACAAGCAC GCCTTCGCGA AGGTCGAGTT GTTCACCATC GACGAATTGT TCGGCGGCTG GACCAAGGCG CAGACGACGC ACTTCACCGA CGGTGGGGTG TTCGACAAGA TCTACAAGAA CTGA
|
Protein sequence | MFRRIVPLLA GLMVATSAQA ADVSLLNVSY DPTRELYSEF NKSFAAAYQK ETGDTVTIKQ SHGGSGSQAR SVIDGLQADV VTLALAYDID AIANKGLLTK DWQKRLPQNA SPYTSTIVFL VRKGNPKGIK DWHDLIRPGI SVITPNPKTS GGARWNYLAA WGYALKTEGS EDKARDFVGN IYKNVPVLDT GARGATMTFV QRGVGDVLLA WENEAFLAVK EFGKDRFEIV VPSISIRAEP PVALVDSVVD KKGTRAVAEA YLQYWYTKEG QEIAARNFYR PRDSEIANKH AFAKVELFTI DELFGGWTKA QTTHFTDGGV FDKIYKN
|
| |