Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1935 |
Symbol | sapA |
ID | 6972225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1830415 |
End bp | 1832034 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643385865 |
Product | peptide ABC transporter, periplasmic peptide-binding protein |
Protein accession | YP_002270354 |
Protein GI | 209399749 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.101699 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGTGATTG CTGGACTTGT GAGTGGTCAG GCAATCGCCG CGCCTGAATC CCCCCCGCAT GCTGATATCC GCGACAGCGG TTTTGTCTAT TGCGTCAGCG GGCAAGTCAA CACCTTTAAC CCATCCAAAG CGAGCAGTGG GTTAATTGTC GATACCCTTG CCGCCCAGTT TTATGATCGA CTGCTGGATG TCGATCCCTA TACCTATCGC CTGATGCCAG AACTTGCCGA AAGCTGGGAA GTACTCGACA ACGGCGCGAC CTATCGCTTC CACCTGCGTC GCGATGTCCC GTTTCAAAAA ACCGCCTGGT TTACTCCCAC TCGCAAAATG AATGCCGACG ATGTGGTGTT TACCTTCCAG CGAATTTTTG ACCGCAACAA CCCGTGGCAT AACGTCAACG GCAGCAACTT CCCCTACTTC GACAGCCTGC AATTTGCCGA TAACGTGAAA AGCGTCCGTA AACTGGATAA TCATACCGTT GAGTTCCGTC TGGCTCAGCC GGATGCTTCT TTTTTGTGGC ACCTCGCAAC CCATTATGCT TCGGTCATGT CGGCAGAATA TGCCCGGAAG TTAGAGAAAG AAGATCGCCA GGAGCAACTC GACCGTCAAC CGGTCGGCAC TGGACCGTAT CAGTTGTCGG AATACCGCGC CGGGCAATTT ATTCGCCTAC AACGTCATGA TGACTTCTGG CGCGGTAAAC CGTTAATGCC GCAGGTGGTG GTGGATTTAG GCTCCGGCGG CACCGGACGT CTGTCGAAAC TCCTGACCGG GGAATGCGAC GTTCTGGCCT GGCCTGCTGC CAGCCAGCTA TCCATTTTGC GTGACGACCC GCGCTTGCGT TTAACGCTGC GTCCTGGGAT GAACGTCGCC TATCTGGCAT TTAACACCGC CAAACCGCCG CTAAATAATC CCGCTGTCCG CCATGCGCTG GCACTGGCGA TTAATAACCA GCGCCTGATG CAATCCATCT ATTATGGTAC GGCTGAAACG GCGGCCTCTA TTTTACCGCG CGCCTCGTGG GCCTATGACA ACGAGGCTAA AATTACTGAA TACAATCCGG CGAAATCGCG CGAACAGTTG AAGTTGTTGG GGCTGGAAAA TTTAACGCTG AAACTGTGGG TGCCCACACG TTCGCAGGCG TGGAACCCCA GTCCACTGAA AACTGCCGAA CTGATTCAGG CGGATATGGC GCAGGTTGGC GTAAAAGTGG TGATTGTGCC GGTAGAAGGT CGCTTTCAGG AGGCGCGGTT GATGGATATG AGCCATGATC TGACGTTATC CGGTTGGGCG ACGGACAGTA ACGACCCGGA CAGTTTCTTC CGTCCGTTAC TGAGCTGCGC GGCAATTCAT TCACAGACCA ACCTCGCCCA CTGGTGCGAT CCGAAATTCG ACAGCGTGTT GCGTAAGACG CTCTCCTCGC AGCAGCTGGC GGCGCGTATT GAAGCCTATG ACGAAGCGCA GAGTATTCTG GCGCAGGAAT TGCCCATTTT GCCGCTGGCG TCGTCATTGC GTTTGCAGGC CTATCGGTAC GATATCAAAG GTCTGGTACT TAGCCCGTTT GGTAACGCCT CCTTTGCTGG GGTGTATCGC GAGAAACAGG ATGAGGTGAA AAAACCATGA
|
Protein sequence | MVIAGLVSGQ AIAAPESPPH ADIRDSGFVY CVSGQVNTFN PSKASSGLIV DTLAAQFYDR LLDVDPYTYR LMPELAESWE VLDNGATYRF HLRRDVPFQK TAWFTPTRKM NADDVVFTFQ RIFDRNNPWH NVNGSNFPYF DSLQFADNVK SVRKLDNHTV EFRLAQPDAS FLWHLATHYA SVMSAEYARK LEKEDRQEQL DRQPVGTGPY QLSEYRAGQF IRLQRHDDFW RGKPLMPQVV VDLGSGGTGR LSKLLTGECD VLAWPAASQL SILRDDPRLR LTLRPGMNVA YLAFNTAKPP LNNPAVRHAL ALAINNQRLM QSIYYGTAET AASILPRASW AYDNEAKITE YNPAKSREQL KLLGLENLTL KLWVPTRSQA WNPSPLKTAE LIQADMAQVG VKVVIVPVEG RFQEARLMDM SHDLTLSGWA TDSNDPDSFF RPLLSCAAIH SQTNLAHWCD PKFDSVLRKT LSSQQLAARI EAYDEAQSIL AQELPILPLA SSLRLQAYRY DIKGLVLSPF GNASFAGVYR EKQDEVKKP
|
| |