Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2797 |
Symbol | cysP |
ID | 6272837 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2594873 |
End bp | 2595889 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641726751 |
Product | thiosulfate transporter subunit |
Protein accession | YP_001881224 |
Protein GI | 187732231 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4150] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGTTA ACTTACTGAA AAAGAACTCA CTCGCGCTGG TCGCTTCTCT GCTGCTGGCG GGCCATGTAC AGGCAACGGA ACTGCTGAAC AGTTCTTATG ACGTCTCCCG CGAGCTGTTT GCCGCCCTGA ATCCGCCGTT TGAACAGCAA TGGGCAAAAG ATAACGGCGG CGACAAACTG ACGATAAAAC AATCTCATGC CGGGTCATCA AAACAGGTGC TGGCAATTTT GCAGGGCTTA AAAACCGACG TTGTCACTTA TAACCAGGTG ACCGACGTAC AAATTCTGCA CGACAAAGGC AAGCTGATCC CGGCTGACTG GCAGTCGCGC CTGCCGAATA ATAGCTCGCC GTTCTACTCC ACCATGGGCT TCCTGGTGCG TAAGGGCAAC CCGAAGAATA TCCACGACTG GAACGACCTG GTGCGCTCCG ACGTGAAGCT GATTTTCCCG AACCCGAAAA CGTCGGGTAA CGCGCGTTAT ACCTATCTGG CGGCATGGGG CGCAGCGGAT AAAGCTGACG GTGGCGACAA AGCCAAAACC GAACAGTTTA TGACCCAGTT CCTGAAAAAC GTTGAAGTGT TCGATACTGG CGGTCGTGGC GCGACCACCA CTTTTGCCGA GCGCGGCCTG GGCGATGTAC TGATCAGCTT CGAGTCGGAA GTGAACAACA TCCGTAAACA GTATGAAGCG CAAGGCTTTG AAGTGGTGAT TCCGAAAACC AACATTCTGG CGGAATTCCC GGTGGCGTGG GTCGATAAAA ACGTGCAGGC CAACGGTACG GAAAAAGCAG CAAAAGCCTA CCTGAACTGG CTCTACAGCC CGCAGGCGCA GACCATCATC ACCGACTATT ACTACCGCGT AAATAACCCG GAAGTCATGG ACAAACTGAA AGATAAATTC CCGCAGACCG AGCTGTTCCG CGTGGAAGAC AAATTTGGCT CCTGGCCGGA AGTGATGAAA ACCCACTTCA CCAGCGGCGG CGAGTTAGAC AAGCTGTTAG CGGCGGGGCG TAACTGA
|
Protein sequence | MAVNLLKKNS LALVASLLLA GHVQATELLN SSYDVSRELF AALNPPFEQQ WAKDNGGDKL TIKQSHAGSS KQVLAILQGL KTDVVTYNQV TDVQILHDKG KLIPADWQSR LPNNSSPFYS TMGFLVRKGN PKNIHDWNDL VRSDVKLIFP NPKTSGNARY TYLAAWGAAD KADGGDKAKT EQFMTQFLKN VEVFDTGGRG ATTTFAERGL GDVLISFESE VNNIRKQYEA QGFEVVIPKT NILAEFPVAW VDKNVQANGT EKAAKAYLNW LYSPQAQTII TDYYYRVNNP EVMDKLKDKF PQTELFRVED KFGSWPEVMK THFTSGGELD KLLAAGRN
|
| |