Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2638 |
Symbol | cysP |
ID | 6485491 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 2554463 |
End bp | 2555479 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642737971 |
Product | thiosulfate transporter subunit |
Protein accession | YP_002041705 |
Protein GI | 194444437 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4150] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.461248 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 84 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGTTA ACTTCCTGAA AAAAAATGCT CTGACGCTGG CGGCGTCTCT GTTACTGGTC GGTCAGGTAC AGGCGACGGA ACTGCTGAAC AGCTCATACG ATGTCTCCCG CGAGCTGTTT GCCGCCCTTA ACCCACCGTT TGAGCAACAA TGGGCGAAGG ATAACGGCGG CGATAAGCTG ACGATTAAGC AGTCTCATGC CGGGTCATCA AAACAGGCGC TGGCGATTTT GCAGGGACTG AAGGCAGACG TCGTGACCTA CAATCAGGTG ACCGATGTAC AGATTCTCCA TGATAAAGGC AAACTGATCC CTGCCGACTG GCAAAGCCGT CTGCCGAACA ACAGTTCACC ATTCTATTCC ACGATGGGCT TCCTGGTGCG CAAGGGAAAC CCGAAAAATA TTCACGACTG GAGCGACCTT GTACGTTCCG ACGTGAAGCT GATTTTCCCG AACCCGAAAA CCTCCGGCAA CGCCCGCTAT ACGTATCTGG CGGCATGGGG CGCGGCGGAT AACGCGGACG GCGGCGATAA AGCCAAAACC GAACAGTTTA TGACCCAGTT CCTGAAAAAC GTCGAAGTGT TTGATACCGG CGGTCGCGGC GCTACGACTA CCTTTGCCGA GCGTGGTCTG GGCGATGTGC TGATTAGTTT TGAGTCGGAA GTGAACAATA TCCGCAAACA ATATGAAGCC CAGGGATTTG AAGTGGTGAT CCCGAAAACG AACATTCTTG CTGAATTCCC GGTTGCCTGG GTGGATAAAA ACGTGCAGGC CAACGGCACA GAAAAAGCCG CCAAAGCTTA CCTGAACTGG CTGTATAGCC CGCAGGCGCA GACCATCATC ACCCATTACT ACTACCGCGT GAATAACCCG GAAATCATGG GCAAGCAAGC AGATAAATTC CCGCAGACCG AACTGTTCCG CGTGGAAGAA AAGTTTGGTT CCTGGCCGGA AGTGATGAAA ACGCACTTTG CCAGCGGCGG CGAGCTGGAC AAACTGTTGG CGGCGGGGCG TAAGTAA
|
Protein sequence | MAVNFLKKNA LTLAASLLLV GQVQATELLN SSYDVSRELF AALNPPFEQQ WAKDNGGDKL TIKQSHAGSS KQALAILQGL KADVVTYNQV TDVQILHDKG KLIPADWQSR LPNNSSPFYS TMGFLVRKGN PKNIHDWSDL VRSDVKLIFP NPKTSGNARY TYLAAWGAAD NADGGDKAKT EQFMTQFLKN VEVFDTGGRG ATTTFAERGL GDVLISFESE VNNIRKQYEA QGFEVVIPKT NILAEFPVAW VDKNVQANGT EKAAKAYLNW LYSPQAQTII THYYYRVNNP EIMGKQADKF PQTELFRVEE KFGSWPEVMK THFASGGELD KLLAAGRK
|
| |