Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1398 |
Symbol | |
ID | 3908348 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 1589039 |
End bp | 1590049 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637883292 |
Product | thiosulphate-binding protein |
Protein accession | YP_485019 |
Protein GI | 86748523 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4150] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACCGCT TTGCTTTCGC CCTCGCCGCC GGCCTCGCCG CATTCGCGAC CGCCATGCCG GCGCAAGCGC AGACGCCACC CGCACCGCTG CTCAATGTCT CCTACGACAT CGCGCGGGAA CTCTATGCCG AGATCAACGC TGCATTCATC CCGCACTGGA AGCAGACGAC CGGCCAGGAC ATCAGCATCA ACCAGTCGCA TGGCGGCTCG TCGCGGCAGG CGCGCTCGAT CCTCGAAGGG CTCGAAGCCG ACGTGGTGAC CTTCAATCAG GTCACCGACG TCCAGGTGCT GCACGACAAG GGCAAGCTGA TCCCGGCCGA TTGGGCGAAG CGGCTGCCCA ATAATTCATC GCCCTACTAT TCGCTGCCGG CGTTCCTGGT GCGCGCCGGC AATCCGAAGG GCATCAAGGA TTGGGACGAT CTGGTGAAGC CGGGCGTCAA GGTGATCTTC CCCAACCCGA AGACCTCCGG CAATGCACGC TACACCTATC TCGCGGCCTA CGCCTTCGCC AAGCATAAAT ACGGCAGCGA GGCCGAGGCC GACGCTTTTG TTAAAAAGCT GTTCGCCAAC GTGCCGGTGT TCGACACCGG CGGCCGCGCC GCGACCACCA CCTTCGTCGA GCGCCAGACC GGCGACGTGC TGATCACCTT CGAGGCCGAG ACCAGCGCGA TCCGCGACCT CGCCGGCGCC GACAAGTATC AAGTCGTGGT GCCGCCGACC AGTGTGCTGG CCGAATTCCC CGTCAGCGTC GTCGACAAAT ACGCCGACAA GCACGGCACC CGCGCGCTCG CGACCGCCTA TCTCGAATAT CTGTATTCAC CCGAGGGCCA GACCATCCTC GCCAAGGCCT ATAACCGCGT CCACGACAAG GCCGTGATCG CGCAATTCAA GGACAAGTTC CCCGAGGTCA AGCTGTATCG CGTCGAGGAC GAATTCGGCG GCTGGGACAA GCTCAACGCC GAACACCTCG CCTCCGGCGC GAAGCTCGAT CAGCTGTTCG GAGGACGATA A
|
Protein sequence | MNRFAFALAA GLAAFATAMP AQAQTPPAPL LNVSYDIARE LYAEINAAFI PHWKQTTGQD ISINQSHGGS SRQARSILEG LEADVVTFNQ VTDVQVLHDK GKLIPADWAK RLPNNSSPYY SLPAFLVRAG NPKGIKDWDD LVKPGVKVIF PNPKTSGNAR YTYLAAYAFA KHKYGSEAEA DAFVKKLFAN VPVFDTGGRA ATTTFVERQT GDVLITFEAE TSAIRDLAGA DKYQVVVPPT SVLAEFPVSV VDKYADKHGT RALATAYLEY LYSPEGQTIL AKAYNRVHDK AVIAQFKDKF PEVKLYRVED EFGGWDKLNA EHLASGAKLD QLFGGR
|
| |