Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3923 |
Symbol | proX |
ID | 6967264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3635519 |
End bp | 3636511 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643387697 |
Product | glycine betaine transporter periplasmic subunit |
Protein accession | YP_002272145 |
Protein GI | 209399990 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.334646 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACATA GCGTACTTTT TGCGACAGCG TTTGCCACGC TTATCTCTAC ACAAACTTTT GCTGCCGACC TGCCGGGCAA AGGCATTACT GTTAATCCGG TTCAGAGCAC CATCACCGAA GAAACTTTCC AGACGCTGCT GGTCAGCCGC GCGCTGGAGA AATTAGGTTA TACCGTCAAT AAACCCAGCG AAGTGGATTA CAACGTTGGC TACACCTCGC TTGCTTCCGG CGATGCAACC TTCACCGCCG TGAACTGGAC GCCACTGCAT GACAACATGT ACGAAGCTGC CGGTGGCGAT AAGAAATTTT ATCGTGAAGG GGTATTTGTT AACGGCGCGG CACAGGGTTA CCTGATCGAT AAGAAAACCG CCGACCAGTA CAAAATCACC AACATCGCAC AACTGAAAGA TCCGAAGATC GCCAAACTGT TCGATACCAA CGGCGACGGA AAAGCGGATT TAACCGGTTG TAACCCTGGC TGGGGCTGCG AAGGTGCGAT CAACCACCAG CTTGCCGCGT ATGAACTGAC CAATACCGTG ACGCATAATC AGGGGAACTA CGCGGCGATG ATGGCCGACA CCATCAGTCG TTACAAAGAG GGCAAACCGG TGTTTTACTA CACCTGGACG CCGTACTGGG TGAGTAATGA GCTGAAGCCA GGCAAAGATG TGGTCTGGTT GCAGGTGCCG TTCTCTGCAC TGCCGGGCGA TAAAAATGCC GATACCAAAC TGCCGAATGG CGCGAATTAC GGCTTCCCGG TCAGCACCAT GCATATCGTT GCCAATAAAG CCTGGGCCGA GAAAAACCCG GCAGCAGCGA AACTGTTTGC CATTATGCAG TTGCCAGTGG CAGATATTAA CGCCCAGAAC GCCATTATGC ATGACGGCAA AGCCTCAGAA GGCGATATTC AGGGCCACGT TGATGGCTGG ATCAAAGCCC ACCAGCAACA GTTCGATGGC TGGGTGAATG AGGCGTTGGC AGCGCAGAAG TAA
|
Protein sequence | MRHSVLFATA FATLISTQTF AADLPGKGIT VNPVQSTITE ETFQTLLVSR ALEKLGYTVN KPSEVDYNVG YTSLASGDAT FTAVNWTPLH DNMYEAAGGD KKFYREGVFV NGAAQGYLID KKTADQYKIT NIAQLKDPKI AKLFDTNGDG KADLTGCNPG WGCEGAINHQ LAAYELTNTV THNQGNYAAM MADTISRYKE GKPVFYYTWT PYWVSNELKP GKDVVWLQVP FSALPGDKNA DTKLPNGANY GFPVSTMHIV ANKAWAEKNP AAAKLFAIMQ LPVADINAQN AIMHDGKASE GDIQGHVDGW IKAHQQQFDG WVNEALAAQK
|
| |