Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A2810 |
Symbol | cysP |
ID | 6874517 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 2681670 |
End bp | 2682686 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642785864 |
Product | thiosulfate transporter subunit |
Protein accession | YP_002216514 |
Protein GI | 198242952 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4150] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 90 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGTTA ACTTACTGAA AAAGAGACCC CTGACGCTGG CAGCAATGCT ATTACTGGCA GGGCAGGCGC AGGCAACGGA GCTGCTGAAC AGCTCATACG ATGTCTCCCG CGAGCTGTTT GCCGCCCTTA ACCCGCCGTT TGAGCAACAA TGGGCGAAGG ATAACGGCGG CGATAAGCTG ACGATTAAGC AGTCTCATGC CGGGTCATCA AAACAGGCGC TGGCGATTTT GCAGGGACTG AAGGCAGACG TCGTGACCTA CAATCAGGTG ACCGACGTAC AGATTCTCCA TGATAAAGGC AAACTGATCC CTGCCGACTG GCAAAGCCGT CTGCCGAACA ACAGTTCGCC ATTCTATTCC ACGATGGGCT TCCTGGTGCG CAAGGGGAAC CCCAAAAATA TTCACGACTG GAGCGATCTT GTACGTTCCG ACGTGAAGCT GATTTTCCCT AACCCGAAAA CCTCCGGCAA CGCCCGTTAC ACGTATCTGG CGGCATGGGG CGCGGCGGAT AACGCGGACG GCAGCGATAA AGCCAAAACC GAACAGTTTA TGACCCAGTT CCTGAAAAAC GTCGAAGTGT TTGATACCGG CGGCCGCGGC GCTACGACTA CCTTTGCCGA GCGTGGTCTG GGCGATGTGC TGATTAGTTT TGAATCGGAA GTGAACAACA TCCGCAAACA ATATGAAGCC CAGGGATTTG AAGTGGTGAT CCCGAAAACG AACATTCTTG CTGAATTCCC GGTTGCCTGG GTAGATAAAA ACGTGCAGGC CAACGGCACA GAAAAAGCCG CCAAAGCTTA CCTGAACTGG CTGTATAGCC CGCAGGCGCA GACCATCATT ACCCATTACT ACTACCGCGT GAATAACCCG GAAATCATGG GCAAGCAAGC AGATAAATTC CCGCAGACCG AACTGTTCCG CGTGGAAGAC AAGTTTGGTT CCTGGCCGGA AGTGATGAAA ACGCACTTTG CCAGTGGCGG CGAGCTGGAC AAACTGTTGG CGGCGGGGCG TAAGTAA
|
Protein sequence | MAVNLLKKRP LTLAAMLLLA GQAQATELLN SSYDVSRELF AALNPPFEQQ WAKDNGGDKL TIKQSHAGSS KQALAILQGL KADVVTYNQV TDVQILHDKG KLIPADWQSR LPNNSSPFYS TMGFLVRKGN PKNIHDWSDL VRSDVKLIFP NPKTSGNARY TYLAAWGAAD NADGSDKAKT EQFMTQFLKN VEVFDTGGRG ATTTFAERGL GDVLISFESE VNNIRKQYEA QGFEVVIPKT NILAEFPVAW VDKNVQANGT EKAAKAYLNW LYSPQAQTII THYYYRVNNP EIMGKQADKF PQTELFRVED KFGSWPEVMK THFASGGELD KLLAAGRK
|
| |