Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_4010 |
Symbol | |
ID | 3969200 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 4458316 |
End bp | 4459305 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637927114 |
Product | thiosulphate-binding protein |
Protein accession | YP_533855 |
Protein GI | 90425485 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.311837 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCCGTC GTTTTCTTAC CGTGATCGCG GGATTGATCT GGGCAGGCTC CGCGATGGCC GCTGACGTCA CGCTGCTCAA CGTGTCGTAC GATCCGACCC GCGAACTTTA TTCCGACTTC AACAAGGCGT TCGCCGCCGC CTATCAGAAG GACGCCGGCA AGAGCGTCGA GATCAAGCAG TCGCATGGCG GCTCCGGCGC GCAGGCGCGT TCGGTGATCG ACGGGCTGCA GGCCGACGTC GTCACCTTGG CGCTCGCCTA CGACATCGAC GCCATCGCCA ACAAGGGGCT GATCGCCAAG GATTGGCAGG CCAAGCTGCC GCAGAACTCA TCGCCCTACA CCTCGACCAT CGTGTTCCTG GTGCGCAAGG GCAACCCGAA GGGCATCAAG GACTGGGATG ATCTGACCAA ATCCGGCGTC AGCGTGATCA CGCCGAACCC GAAGACCTCG GGCGGCGCGC GCTGGAACTA TCTCGCCGCC TGGGGCTACG CGCTGAAGAA GTCGGGTTCC GAACAGGGGG CGCGGGAGTT CGTCGCCAAC ATCTATAAGA ACGTGCCGGT GCTGGATACC GGGGCGCGCG GCTCCACCGT CAGTTTCGTC GAGCGCGGCG TCGGCGACGT GTTGCTGGCC TGGGAAAACG AGGCGTTTCT CGCGGTGAAG GAATTCGGCA AGGACAGGTT CGAGATCGTG GCGCCATCGG TGTCGATCCT GGCGGAGCCG CCGATCGCGG TGGTCGATGG CGTTGCCGAC AAGAAAGGCA CCCGCTCTGC CGCGGAAGCT TATCTGAAAT ACTGGTACAC TCCAGAGGGC CAGGAAATCG CCGCGCGCAA CTTCTACCGC CCTCGCGATG CCGGAATTGC CAAGAAGTAT GCCGACTCGT TCGCCAAGGT TGAGCTGTTC ACCATCGACG ACGTGTTCGG CGGCTGGACC AAGGCGCAAA AGGAACACTT CAGCGACGGC GGTGTGTTCG ATAAGATATA TAAGGACTAA
|
Protein sequence | MVRRFLTVIA GLIWAGSAMA ADVTLLNVSY DPTRELYSDF NKAFAAAYQK DAGKSVEIKQ SHGGSGAQAR SVIDGLQADV VTLALAYDID AIANKGLIAK DWQAKLPQNS SPYTSTIVFL VRKGNPKGIK DWDDLTKSGV SVITPNPKTS GGARWNYLAA WGYALKKSGS EQGAREFVAN IYKNVPVLDT GARGSTVSFV ERGVGDVLLA WENEAFLAVK EFGKDRFEIV APSVSILAEP PIAVVDGVAD KKGTRSAAEA YLKYWYTPEG QEIAARNFYR PRDAGIAKKY ADSFAKVELF TIDDVFGGWT KAQKEHFSDG GVFDKIYKD
|
| |