Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C2704 |
Symbol | cysP |
ID | 6489557 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | - |
Start bp | 2613656 |
End bp | 2614672 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642742882 |
Product | thiosulfate transporter subunit |
Protein accession | YP_002046509 |
Protein GI | 194448416 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4150] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 96 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGTTA ACTTACTGAA AAAGAGACCC CTGACGCTGG CAGCAATGCT GTTACTGGCA GGGCAGGCGC AGGCAACGGA GCTGCTGAAC AGCTCATACG ATGTCTCCCG CGAGCTGTTT GCCGCCCTTA ACCCGCCGTT TGAGCAACAA TGGGCGAAGG ATAACGGCGG CGATAAGCTG ACGATTAAGC AGTCTCATGC CGGGTCATCA AAACAGGCGC TGGCGATTTT GCAGGGACTG AAGGCAGACG TCGTGACCTA CAATCAGGTG ACCGACGTAC AGATTCTTCA TGATAAAGGC AAACTGATCC CTGCCGATTG GCAAAGCCGT CTGCCGAACA ACAGTTCGCC ATTCTATTCC ACGATGGGCT TCCTGGTGCG CAAGGGGAAC CCGAAAAATA TTCACGACTG GAGCGATCTT GTACGTTCCG ACGTGAAGCT GATTTTCCCT AACCCGAAAA CCTCCGGCAA CGCCCGTTAC ACGTATCTGG CGGCATGGGG CGCGGCGGAT AACGCGGACG GCGGCGATAA AGCCAAAACC GAACAGTTTA TGACCCAGTT CCTGAAAAAC GTCGAAGTGT TTGATACCGG CGGTCGCGGC GCTACGACTA CCTTTGCCGA GCGTGGTCTG GGCGATGTGC TGATTAGTTT TGAATCGGAA GTGAACAACA TCCGCAAACA ATATGAAGCC CAGGGATTTG AAGTGGTGAT CCCGAAAACG AACATTCTTG CTGAATTCCC GGTTGCCTGG GTGGATAAAA ACGTGCAGGC CAACGGCACA GAAAAAGCCG CCAAAGCTTA CCTGAACTGG CTGTATAGCC CGCAGGCGCA GACCATCATC ACCCATTACT ACTACCGCGT GAATAACCCG GAAATCATGG GCAAGCAAGC AGATAAATTC CCGCAGACCG AACTGTTCCG CGTGGAAGAC AAGTTTGGTT CCTGGCCGGA AGTGATGAAA ACGCACTTTG CCAGCGGCGG CGAGCTGGAC AAACTGTTGG CGGCGGGGCG TAAGTAA
|
Protein sequence | MAVNLLKKRP LTLAAMLLLA GQAQATELLN SSYDVSRELF AALNPPFEQQ WAKDNGGDKL TIKQSHAGSS KQALAILQGL KADVVTYNQV TDVQILHDKG KLIPADWQSR LPNNSSPFYS TMGFLVRKGN PKNIHDWSDL VRSDVKLIFP NPKTSGNARY TYLAAWGAAD NADGGDKAKT EQFMTQFLKN VEVFDTGGRG ATTTFAERGL GDVLISFESE VNNIRKQYEA QGFEVVIPKT NILAEFPVAW VDKNVQANGT EKAAKAYLNW LYSPQAQTII THYYYRVNNP EIMGKQADKF PQTELFRVED KFGSWPEVMK THFASGGELD KLLAAGRK
|
| |