Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5372 |
Symbol | sbp |
ID | 6971262 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 5011906 |
End bp | 5012895 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643389026 |
Product | sulfate transporter subunit |
Protein accession | YP_002273435 |
Protein GI | 209400686 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAGT GGGGCGTAGG GTTAACATTT TTGCTGGCGG CAACCAGCGT TATGGCAAAG GATATTCAGC TTCTTAACGT TTCATATGAT CCAACGCGCG AATTGTACGA ACAGTACAAC AAGGCATTCA GCGCCCACTG GAAACAGCAA ACTGGCGATA ACGTGGTGAT TCGTCAGTCA CACGGTGGTT CAGGCAAACA AGCAACGTCG GTAATCAACG GTATTGAAGC TGATGTTGTC ACGCTGGCTC TGGCCTATGA CGTGGACGCG ATTGCGGAAC GCGGGCGGAT TGATAAAGAG TGGATCAAAC GTCTGCCGGA TAACTCCGCA CCGTACACTT CCACCATTGT TTTCCTGGTA CGTAAGGGAA ATCCGAAGCA GATCCATGAC TGGAACGATC TGATTAAACT GGGTGTTTCG GTGATCACGC CTAATCCGAA AAGCTCTGGT GGCGCGCGCT GGAACTACCT GGCAGCCTGG GGCTACGCGC TGCATCACAA CAACAACGAT CAGGCAAAAG CACAGGATTT TGTTCGGGCA CTGTATAAAA ACGTCGAAGT TCTGGATTCT GGCGCGCGCG GCTCCACTAA CACTTTTGTC GAGCGCGGAA TTGGCGATGT ACTGATTTCC TGGGAAAACG AAGCTCTGCT GGCAGCGAAT GAACTGGGGA AAGATAAATT CGAAATCGTC ACGCCGAGTG AGTCTATCCT CGCAGAGCCA ACCGTGTCGG TGGTCGATAA AGTGGTCGAG AAAAAAGGTA CTAAAGAGGT GGCGGAAGCC TACCTGAAAT ATCTCTACTC GCCAGAAGGT CAGGAAATTG CCGCGAAAAA CTACTACCGT CCGCGCGACG CTGAGGTGGC GAAAAAGTAC GAAAATGCGT TTCCAAAGCT GAAGTTATTC ACCATTGATG AAGAGTTCGG CGGCTGGACG AAAGCGCAAA AAGAGCATTT TGCTAACGGC GGTACGTTCG ATCAGATCAG CAAACGCTGA
|
Protein sequence | MNKWGVGLTF LLAATSVMAK DIQLLNVSYD PTRELYEQYN KAFSAHWKQQ TGDNVVIRQS HGGSGKQATS VINGIEADVV TLALAYDVDA IAERGRIDKE WIKRLPDNSA PYTSTIVFLV RKGNPKQIHD WNDLIKLGVS VITPNPKSSG GARWNYLAAW GYALHHNNND QAKAQDFVRA LYKNVEVLDS GARGSTNTFV ERGIGDVLIS WENEALLAAN ELGKDKFEIV TPSESILAEP TVSVVDKVVE KKGTKEVAEA YLKYLYSPEG QEIAAKNYYR PRDAEVAKKY ENAFPKLKLF TIDEEFGGWT KAQKEHFANG GTFDQISKR
|
| |