Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1156 |
Symbol | |
ID | 4021632 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 1315777 |
End bp | 1316766 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637961348 |
Product | thiosulphate-binding protein |
Protein accession | YP_568295 |
Protein GI | 91975636 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein [TIGR01564] S-layer protein, MJ0822 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.464796 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTCGCC GAATAATTCC TCTGCTCGCC GGACTTCTGG TTGCGACGTC CGCGCACGCC GCCGACGTTT CGTTGCTCAA CGTGTCGTAC GATCCGACCC GCGAACTCTA TGCCGCGTTC AACAAGTCGT TTGCCGCCGC GCATCAAAAA GAGTTCGGCA AGAGCGTCGA GATCAAGCAG TCGCATGGCG GCTCGGGCTC GCAGGCGCGC TCGGTGATCG ACGGGCTGCA GGCCGACGTC GTTACGCTGG CGCTCGCCTA TGACATCGAT GCGATCGCCA ACAAGGGTCT GCTGTCGAAG GATTGGCAGA AGCGGCTGCC GCAGAATGCG TCGCCCTACA CCTCGACCAT CGTGTTCCTG GTGCGCAAGG GCAATCCGAA AGGCATCAAG GACTGGGACG ATCTGATCAA GTCCGGCGTC AGTGTCGTGA CGCCGAACCC GAAGACCTCG GGCGGCGCGC GTTGGAACTA TCTTGCGGCC TGGGGCTATG CCGAGAAGAA GCTGGGCTCC GCCGAGGCGG CGCGGGAATT CGTCGGTAAG CTCTACAAGA ACGTTCCGGT GCTCGATACC GGCGCGCGCG GTTCGACCGT CACCTTCGTC GAGCGCGGCG TCGGCGACGT GTTGCTGGCG TGGGAGAACG AGGCTTATCT TGCGGTCAAG GAATTCGGCA AGGACAAGTT CGAGATCGTT GCCCCGTCAG TGTCGATCCT GGCCGAGCCG CCGGTGACGA TCGTCGATAC CGTCGTCGAC AAGAAGGGAA CCCGGGCCGC GGCCGAGGCC TATCTGAAAT ATCTCTACAG CAAGGACGGT CAGGAAATTG CCGCACGGAA CTTCTACCGT CCGCGCGACC CGGAGGTCGC CAAGACGTAT GAAGGCTCGT TCGCCAAGGT CGACCTGTTC ACGATCGACG ATGCATTCGG AGGCTGGACT AAGGCGCAGG CCGAGCACTT CGCCGAAAAC GGTGTGTTCG ACAAGATCTA CAAGAACTAG
|
Protein sequence | MFRRIIPLLA GLLVATSAHA ADVSLLNVSY DPTRELYAAF NKSFAAAHQK EFGKSVEIKQ SHGGSGSQAR SVIDGLQADV VTLALAYDID AIANKGLLSK DWQKRLPQNA SPYTSTIVFL VRKGNPKGIK DWDDLIKSGV SVVTPNPKTS GGARWNYLAA WGYAEKKLGS AEAAREFVGK LYKNVPVLDT GARGSTVTFV ERGVGDVLLA WENEAYLAVK EFGKDKFEIV APSVSILAEP PVTIVDTVVD KKGTRAAAEA YLKYLYSKDG QEIAARNFYR PRDPEVAKTY EGSFAKVDLF TIDDAFGGWT KAQAEHFAEN GVFDKIYKN
|
| |