Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1098 |
Symbol | |
ID | 6969690 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1128813 |
End bp | 1129772 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643385109 |
Product | alkanesulfonate transporter substrate-binding subunit |
Protein accession | YP_002269608 |
Protein GI | 209396569 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0897398 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.248761 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAACA TCATTAAACT GGCGCTGGTG GGATTGCTTA GCGTCTCTAC GATTGTGGTT GCTGCAGAAT CCTCGCCTGA AGCGTTACGT ATAGGCTATC AGAAAGGCAG TATTGGTATG GTATTGGCAA AAAGCCACCA GTTACTGGAA AAACGCTATC CGCAATCAAA AATCTCGTGG GTGGAGTTCC CCGCTGGCCC GCAAATGCTT GAGGCGTTAA ACGTTGGCAG TATTGATCTC GGCAGTACCG GGGATATTCC GCCAATCTTC GCCCAGGCTG CCGGGGCTGA TTTGCTGTAC GTGGGCGTCG AGCCGCCGAA GCCAAAAGCC GAGGTGATTC TGGTGGCAGA AAACAGCCCG ATCAAAACCG CAGCCGATCT TAAAGGCCAC AAAGTTGCGT TCCAGAAAGG TTCCAGTTCA CACAATCTTT TACTGCGCGC ACTACGCCAG GCCGGACTTA AGTTTACTGA CATCCAGCCC ACTTATCTGA CGCCAGCTGA TGCCCGCGCC GCGTTCCAGC AAGGTAACGT TGACGCCTGG GCTATCTGGG ATCCCTACTA CTCCGCTGCA TTATTACAGG GCGGCGTGCG GGTGTTGAAA GACGGCACCG ATCTCAATCA AACCGGATCG TTTTATCTGG CAGCTCGCCC CTATGCAGAA AAAAACGGCG CTTTTATTCA GGGTGTACTG GCAACCTTTA GTGAGGCCGA TGCGTTAACC CGCAGCCAGC GCGAGCAAAG TATCGCTTTA CTGGCAAAAA CGATGGGCTT ACCGGCACCG GTGATTGCCT CTTACTTAGA TCATCGCCCT CCTACCACCA TCAAACCGGT TAACGCCGAG GTTGCCGCCT TACAGCAGCA AACGGCAGAT CTGTTTTATG AAAATCGTCT GGTGCCGAAA AAAGTCGATA TTCGCCAACG CATCTGGCAG CCCACTCAAC TGGAAGGAAA ACAATTATGA
|
Protein sequence | MRNIIKLALV GLLSVSTIVV AAESSPEALR IGYQKGSIGM VLAKSHQLLE KRYPQSKISW VEFPAGPQML EALNVGSIDL GSTGDIPPIF AQAAGADLLY VGVEPPKPKA EVILVAENSP IKTAADLKGH KVAFQKGSSS HNLLLRALRQ AGLKFTDIQP TYLTPADARA AFQQGNVDAW AIWDPYYSAA LLQGGVRVLK DGTDLNQTGS FYLAARPYAE KNGAFIQGVL ATFSEADALT RSQREQSIAL LAKTMGLPAP VIASYLDHRP PTTIKPVNAE VAALQQQTAD LFYENRLVPK KVDIRQRIWQ PTQLEGKQL
|
| |