Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2080 |
Symbol | |
ID | 6970802 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1978202 |
End bp | 1979461 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 643385985 |
Product | leucine-rich repeat protein |
Protein accession | YP_002270474 |
Protein GI | 209397006 |
COG category | [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 66 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTCC CTTCAATATT TAACAAAATA AAACCACAAT CCATACAGCA ACATCCAGAA AAAAATCAAC TTAACTGGAT GCTCGAATTA AATAAATGGA AAGAAGAACG TATACTTACA GGTGAAATCC ATCGTCCGGA ATGTCGAAAC GAAGCCGCTA AAAGGATAAG CTGTGCTTTT TTGTCGAAAC AGAATGACAT TGATTTATCA GGACTTAATT TATCTACTCA ACCACCAGGG CTGCAAAACT TCACCTCTAT CAATCTTGAT AATAACCAAC TCACACATTT TGATGCAACC AACTACGATA GACTCGTAAA ACTTAGTCTG AATAGTAACA CTCTTGAGTC AATAAATATT CATCAAGGCA GAAATGTAAG CATTACACAT ATATCTATGA ATAATAATTG TCTCAGAAAT ATTGATATAG ATAGGCTTTC ATCAATTACT TATTTTAGTG CGGCACATAA TAAACTAGAG TTTGTGCAAT TAGAATCTTG CGAATGGCTG CAATACCTGA ATCTCAGCCA TAATCAATTA ACTGATATTG TTACAGGAAA TAAAGAAGAA CTCTTACTGC TGGATCTATC CCATAATAAA CTAGCAAGTT TACACAATGC CTTATTTCCC AACTTAAATA CGTTACTTAT CAACAACAAC TTGCTTTCTG AAATTAAAAT GTTTTATAGC AACTTCTGCA AAGTTCAGAC ATTAAACGCT GCTAACAATC AGTTGGAAAA AATAAACCTT CATTTCCTGA CTTATCTTTC ATCTATCAAA AGTTTAAGGC TGGACAATAA TAAAATAACT CGCATTGATA CTGAGAACAC ATCCGATATT AGAAGTTTAT TCCCCATAAT AAAGAAGAGC GAAAGCTTAA ATTTTTTAAA TATTTCTGGC GAGAACAATT GCCCTACTAT CCAGCTCATG TTATTTAATT TGTTTTCCCC AGCACTTAAG CTTAATACTG GCCTGGCAAT TCTTTCGCCT GGTGCATTTG AAGATCACTC TGACGGATTA GATGTGGATA ACGAATTGTT TCACTATACT ATTAATAAAG CATATACCCC ATATAATATA CATACTTATA AAACAGAAGA AGTTGTAAAC CAGAGGAATA TAAAAATTAA AAATATGACC TTAGATGAAA TAAACAATAC TTATTGTAAT AACGATTATT ACAATGAGGC AATAAGAGAG GAACCGATAG ACTTTCTGGA CAGATCGTTT TCCTCCAGCT CATGGCCTTT TTATCACTAA
|
Protein sequence | MKFPSIFNKI KPQSIQQHPE KNQLNWMLEL NKWKEERILT GEIHRPECRN EAAKRISCAF LSKQNDIDLS GLNLSTQPPG LQNFTSINLD NNQLTHFDAT NYDRLVKLSL NSNTLESINI HQGRNVSITH ISMNNNCLRN IDIDRLSSIT YFSAAHNKLE FVQLESCEWL QYLNLSHNQL TDIVTGNKEE LLLLDLSHNK LASLHNALFP NLNTLLINNN LLSEIKMFYS NFCKVQTLNA ANNQLEKINL HFLTYLSSIK SLRLDNNKIT RIDTENTSDI RSLFPIIKKS ESLNFLNISG ENNCPTIQLM LFNLFSPALK LNTGLAILSP GAFEDHSDGL DVDNELFHYT INKAYTPYNI HTYKTEEVVN QRNIKIKNMT LDEINNTYCN NDYYNEAIRE EPIDFLDRSF SSSSWPFYH
|
| |