Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3256 |
Symbol | |
ID | 6968401 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2988799 |
End bp | 2989716 |
Gene Length | 918 bp |
Protein Length | 305 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643387069 |
Product | ABC transporter, quaternary amine uptake transporter (QAT) family, substrate-binding protein |
Protein accession | YP_002271533 |
Protein GI | 209399202 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.00281885 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCACTCT CAAAGGTCTG GGCAGGTTCA CTGGTTTTGT TGGCAGCCGT GAGCCTGCCG CTGCACGCGG CTTCCCCCGT TAAAGTCGGT TCAAAAATCG ATACCGAAGG CGCGCTGCTC GGCAATATCA TTTTGCAAGT ACTCGAAAGC CACGGAGTAC CAACGGTCAA TAAAGTGCAA CTTGGAACGA CTCCTGTGGT GCGCGGGGCG ATTACTTCCG GTGAACTGGA TATCTATCCG GAATATACCG GCAATGGCGC GTTTTTCTTT AAAGATGAAA ACGATGCAGC GTGGAAAAAC GCGCAGCAAG GTTACGAGAA AGTCAAAAAA CTCGATGCAG AGCAAAACAA GTTAATCTGG CTGACGCCCG CACCTGCAAA TAACACCTGG ACCATCGCCG TGCGTCAGGA TGTGGCAGAG AAAAACAAAC TCACTTCGCT TGCTGACCTG AGGCGTTATC TGAAAGAGGG CGGCACCTTC AAACTGGCAG CCTCGGCTGA GTTTATCGAA CGCGCCGATG CGTTACCCGC GTTTGAAAAA GCCTACGACT TTAAACTCGA TCAGGATCAG TTACTGTCAC TGGCTGGCGG CGACACGGCG GTAACGATTA AAGCCGCTGC CCAGCAAACT TCTGGCGTTA ATGCCGCAAT GGCTTACGGC ACTGACGGTC CGGTCGCGGC GCAGGGGCTG CAAACCTTAA GCGATCCGCA AGGCGTGCAA CCTATCTACG CGCCTGCACC AGTGGTGCGT GAGTCGGTGC TGAAAGAGTA TCCGCAAATG GCACAGTGGC TACAGCCAGT CTTCGCCAGC CTCGATGCAA AAACATTGCA GCAACTGAAT GCCAGCATTG CAGTGGAAGG ACTGGATGCC AAAAAAGTGG CTGCCGACTA CTTGAAACAA AAAGGGTGGA CGAAGTAA
|
Protein sequence | MPLSKVWAGS LVLLAAVSLP LHAASPVKVG SKIDTEGALL GNIILQVLES HGVPTVNKVQ LGTTPVVRGA ITSGELDIYP EYTGNGAFFF KDENDAAWKN AQQGYEKVKK LDAEQNKLIW LTPAPANNTW TIAVRQDVAE KNKLTSLADL RRYLKEGGTF KLAASAEFIE RADALPAFEK AYDFKLDQDQ LLSLAGGDTA VTIKAAAQQT SGVNAAMAYG TDGPVAAQGL QTLSDPQGVQ PIYAPAPVVR ESVLKEYPQM AQWLQPVFAS LDAKTLQQLN ASIAVEGLDA KKVAADYLKQ KGWTK
|
| |