Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4148 |
Symbol | sbp |
ID | 5591651 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 4136843 |
End bp | 4137832 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640923250 |
Product | sulfate transporter subunit |
Protein accession | YP_001460709 |
Protein GI | 157163391 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 58 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGT GGGGCGTAGG GTTAACATTT TTGCTGGCGG CAACCAGCGT TATGGCAAAG GATATTCAGC TTCTTAACGT TTCATATGAT CCAACGCGCG AATTGTACGA ACAGTACAAC AAGGCATTCA GCGCCCACTG GAAACAGCAA ACTGGTGATA ACGTGGTGAT TCGTCAGTCA CACGGTGGCT CAGGTAAACA AGCGACGTCG GTAATCAACG GTATTGAAGC TGATGTTGTC ACGCTGGCTC TGGCCTATGA CGTGGACGCA ATTGCGGAAC GCGGGCGGAT TGATAAAGAG TGGATCAAAC GTCTGCCGGA TAACTCCGCA CCGTACACTT CCACCATTGT TTTCCTGGTA CGTAAGGGAA ATCCGAAGCA GATCCATGAC TGGAACGATC TGATTAAACC GGGTGTTTCG GTGATCACGC CTAATCCGAA AAGCTCTGGT GGCGCGCGCT GGAATTACTT GGCAGCCTGG GGCTACGCGC TGCATCACAA CAACAACGAT CAGGCAAAAG CACAGGATTT TGTTCGGGCA CTGTATAAAA ACGTCGAAGT TCTGGATTCT GGCGCGCGCG GCTCCACTAA CACTTTTGTC GAGCGCGGAA TTGGCGATGT ACTGATTGCC TGGGAAAACG AAGCTCTGTT GGCAGCGAAT GAACTGGGGA AAGATAAATT CGAAATCGTC ACGCCGAGTG AGTCTATCCT CGCGGAACCA ACCGTGTCGG TGGTCGATAA AGTGGTCGAG AAAAAAGGTA CTAAAGAGGT GGCGGAAGCC TACCTGAAAT ATCTCTACTC GCCAGAAGGT CAGGAAATTG CCGCGAAAAA CTACTACCGT CCGCGCGACG CTGAGGTGGC GAAAAAGTAC GAAAATGCGT TTCCAAAGCT GAAGTTATTC ACCATTGATG AAGAGTTCGG CGGCTGGACG AAAGCGCAAA AAGAGCATTT TGCTAACGGC GGTACGTTCG ATCAGATCAG CAAACGCTGA
|
Protein sequence | MNKWGVGLTF LLAATSVMAK DIQLLNVSYD PTRELYEQYN KAFSAHWKQQ TGDNVVIRQS HGGSGKQATS VINGIEADVV TLALAYDVDA IAERGRIDKE WIKRLPDNSA PYTSTIVFLV RKGNPKQIHD WNDLIKPGVS VITPNPKSSG GARWNYLAAW GYALHHNNND QAKAQDFVRA LYKNVEVLDS GARGSTNTFV ERGIGDVLIA WENEALLAAN ELGKDKFEIV TPSESILAEP TVSVVDKVVE KKGTKEVAEA YLKYLYSPEG QEIAAKNYYR PRDAEVAKKY ENAFPKLKLF TIDEEFGGWT KAQKEHFANG GTFDQISKR
|
| |