Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1900 |
Symbol | |
ID | 6972192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1792780 |
End bp | 1794363 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 643385833 |
Product | hypothetical protein |
Protein accession | YP_002270322 |
Protein GI | 209399841 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.00000762932 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAGGGA AGTTTCGCTG CATTTTGCTG TTGATAGTTG GGCTTTTTTT CTCTTCGTTA AGTTATGCGA AAAACACGGA GATGCCTTCT TATGAAGAAG GGATCTCGCT CTTTGATGTT GAAGCCACTC TGCAACCGGA TGGGGTGCTC GACATCAAAG AAAATATTCA TTTTCAGGCG CGAAATCAGC AGATTAAGCA CGGATTTTAT CGTGATTTAC CACGACTATG GATGCAGCCT GATGGGGACG CTGCACTGCT GAACTATCAT ATTGTTGGCG TCACCCGTGA TGGTATTCCT GAACCCTGGC ATCTTGACTG GCATATCGGG TTAATGAGTA TTGTCGTGGG CGATAAACAA CGTTTCTTGC CTCAAGGCGA CTATCATTAT CAAATTCATT ATCAGGTTAA AAATGCTTTC CTGCGTGAGG GAGATTCAGA TCTGTTAATC TGGAACGTGA CTGGTAACCA CTGGCCGTTT GAAATCTATA AGACCCGATT TTCACTCAAG TTCCCTGATA TCGCGGGTAA TCCATTTAGC GAAATCGATC TCTTTACTGG AGAAGAGGGC GACACATATC GAAATGGCCG CATCCTTGAG GACGGAAGAA TTGAATCCAG CGATCCGTTT TATCGTGAAG ATTTCACGGT ACTCTACCGT TGGCCTCACT CGTTACTTAG CAATGCCCCG GCTCCACAAA CGACGAATAT TTTCAGCCAT CTTCTTTTAC CCTCCACGTC ATCGTTGTTA ATTTGGTTTC CGTGTCTATT CCTGGTTTGT GGATGGTTAT ATCTCTGGAA GCGCAGGCCG CAATTTACGT CGGTAGATGT TTTACGGATG CAGAAATTTT ATCTGCCGCG TAAAAAGTCT TCGTTTTACC GGCCTGATAC TTTTTTGCAA TGGGGTGGAC TGGCAATATT GGCGGTCATT CTTTACGGTA ACCTGAGTCC TGTAGGCTGG GCAGGAATGA GTCTGGTGGG CGATATGTTT ATTATGATCT GCTGGCTTCT TCCTTTTTTA TTTTGTTCCC TTGAGCTTTT GTTTGCCCGC GATGATGACA AGCCTTGCGT TAATCGTGTA ATCATCACTT TGTTTTTACC GCTGATTTGT TCAGGCGTGG CCTTTTATTC TCTCTATATC AATGTCGGAG ATGTATTCTT TTACTGGTAT ATGCCAGCGG GTTATTTTAG CGCTGTTTTC CTGACCGGTT ATCTCACTGG CATGGGGTAT ATTTTTCTGC CAAAGTTTAC CCAAACTGGG CAGCAACGTT ATGCCCACGG TGAAGCTATC GTTAACTATC TCGCGCGTAA AGAGGCAGCA ACACACAGTG GGCGTCGGCG GAAAGGGGAA ACACGGAAAC TGGATTACGC GTTGCTAGGT TGGGCTGTCT CAGCAAACCT TGGAAGAGAA TGGGTAGCAC GTATCACCCC ATCACTCACA GCGGCTGTTC GCGCCCCGGA AATTGCCCGT AGTGGCGTTT TGTTCTCATT ACAGATGCAC CTGAGTCTGG GGGCCAATAC CAGTTTATTG GGGCGAAGTT ATTCCGGTGG TGCTGCGGGT GGCGGAGGCG GTGGTGGCTG GTAA
|
Protein sequence | MAGKFRCILL LIVGLFFSSL SYAKNTEMPS YEEGISLFDV EATLQPDGVL DIKENIHFQA RNQQIKHGFY RDLPRLWMQP DGDAALLNYH IVGVTRDGIP EPWHLDWHIG LMSIVVGDKQ RFLPQGDYHY QIHYQVKNAF LREGDSDLLI WNVTGNHWPF EIYKTRFSLK FPDIAGNPFS EIDLFTGEEG DTYRNGRILE DGRIESSDPF YREDFTVLYR WPHSLLSNAP APQTTNIFSH LLLPSTSSLL IWFPCLFLVC GWLYLWKRRP QFTSVDVLRM QKFYLPRKKS SFYRPDTFLQ WGGLAILAVI LYGNLSPVGW AGMSLVGDMF IMICWLLPFL FCSLELLFAR DDDKPCVNRV IITLFLPLIC SGVAFYSLYI NVGDVFFYWY MPAGYFSAVF LTGYLTGMGY IFLPKFTQTG QQRYAHGEAI VNYLARKEAA THSGRRRKGE TRKLDYALLG WAVSANLGRE WVARITPSLT AAVRAPEIAR SGVLFSLQMH LSLGANTSLL GRSYSGGAAG GGGGGGW
|
| |