Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1281 |
Symbol | |
ID | 6970364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1292207 |
End bp | 1296019 |
Gene Length | 3813 bp |
Protein Length | 1270 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643385269 |
Product | haemagglutination activity domain protein |
Protein accession | YP_002269764 |
Protein GI | 209399804 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01731] adhesin HecA family 20-residue repeat (two copies) [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.326078 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGATGA TAAATTTAAG TAAGGAAGCA ACGGTGGGGA AAGCATTAAC CCCTATTGCT ATACTTATGA TGTTGTCTTT TCCTGTAGCT TCTCAAGCGG CGGGATTAGT CATAAAAAAT GGAACGGTAT ATAACGCCAA TGGTGTGCCA GTCGTTGACA TCAACAAACC TAACGGTAGC GGTTTATCTC ATAATATCTG GGATAACCTA AACGTTGATA AAAATGGTGT CGTTTTCAAT AATAGCGCTA ATGAATCCAG TACTTCACTT GCCGGAAATA TTCAGGGAAA CAGTAATCTG ACCTCCGGGT CGGCGAAGGT GATCCTGAAT GAGGTTACTT CCAAAAATCC TTCAACCATT AATGGGATGA TGGAAGTTGC AGGGGATAAA GCGGATCTGA TTATTGCCAA CCCGAATGGT ATTACTGTAA ACGGTGGCGG TTCAATCAAT ACAGGTAAAC TTACCTTAAC CACCGGGACG CCGGATATCC AGGATGACAA GCTGGCCGGT TACTCCGTGA ACGGCGGTAC CATTACGCTC GGTAAACTGG ATAACGCCAG CCCGACAGAA ATTCTGTCCC GTAACGTGGT AGTTAACGGC AAAGTGTCTG CCGATGAGCT GAACGTTGTT GCTGGCAATA ACTATGTTAA TGCCGCAGGC CAGGTGACCG GTAGCGTATC CGCCACGGGG TCCCGTAACG GTTACAGCGT AGATGTTGCC AAACTGGGCG GAATGTATGC GAACAAAATC AGTCTGGTCA GCACCGAGAA AGGTGTGGGG GTTCGCAACC TCGGCGTTAT TGCTGGGGGT GTTAATGGTG TCAGCATCGA TTCCAAAGGT AACCTGTTAA ACAGTAACGC CCAGATTCAG TCTGCAAGCA CGATCAACCT GACAACAAAT GGTACTCTGG ATAACACCAC CGGTACGGTG ACATCTGTAG GCACTATCTC GCTTAATACC AACAAGAATA CTATCGTGAA TACCCGTGCG GGTAACATCT CTACGATGGG CGATATCTAC GTTAACAGCG GTACGATTGA CAATACTAAC GGCAAGCTTG CGGCTGCAGG AATGCTGGCG GTTGATACCA ATAACGCCAC GCTGATTAAC TCTGGTAAAG GGAGTTCTGT CGGGATTGAA GCGGGGCTCG TGGCGCTGAA AACCGGAACG CTCAACAACA GCAATGGTCA GATTCGCGGT GGCTATGTGG GTCTTGAATC CGCTGCGCTG AATAACAACA ACGGTGATAT CCAGACCACC GGCGATATCG CCATTATCAG TAACGGTAAT GTGGATAACA ACAAAGGTCT GATCCGTTCG TCCACCGGGC ATATCGTTAT TGGCGCGGCA GGTAGCGTAA ATAATGGTTC AACCAAAACC GCCGATACCG GCAGTTCTGA CTCTCTGGGC ATTATTGCAG ATACCGGCGT AGAAATTGGT GCGAACAACA TCAATAACAA CGGCGGACAG ATTGCGTCTA ATGGCAACGT CTCCCTGTCA AGTTACAGCA CGATCGACGA CTATGCGGGC AAAATTCTGT CCAACAGCAA AGTGATTATC AAGGGAAGCT CTCTGCGTAA CGATACCGGG GGGATCAGCG GTAAGCAGGG TATTGAAGTC GCCGTTGGCG GCAGCCTGAC CAATAATATT GGCGTGATCA GCTCTGAAGA GGGTGATATC TCCCTGTTAG CCAACTCCGT GGATAACCAC GGCGGCTTCA TGATGGGGCA GAACATCACG ATGGAGTCGA TGTCTGGCGT CAATAACAAC ACAGCGCTGA TCGTGGCCAG CAAAAAACTG AAGATAAATG CGCGCGGCAG TATCGAAAAC CGCGATGGCA ATAACTTCGG TAATGCTTAT GGTCTGTACT TCGGCATGCC TCAGCAAACG GGTGGAATGG TCGGCAAGGA AGGCATCGAG CTTTCCGGGC AGAACATCTA TAACAACAAC AGCCGTCTTA TCGCTGAGGA TGGTCCTCTG ACTCTGCAGG CGCAGAACAC GTTCGACAAC ACGCGTGCTC TGGTCACCAG CGGGGCGGAT GCATCTATTC AGGTTGGCGG AACGTATTAT AACAACTACG CTACCACCTG GAGTGCGGGC AACCTGGATA TCGACGCGAC CACGCTGCAA AACAGCAGCA GCGGTACGAT GATCGATAAC AATGCGACCG GGTTCATAGC ATCTGATAAA AACCTGTCAC TGGAAGTGGT GAATAGCCTT ACCAACTACG GCTGGATCAG CGGTAAAGGC GATGTTGATG TCACGGTGAA TAACGGCAAC CTGTATAACC GCAATACCAT TGCGGCTGAA AAGGGGCTGG ATATTGCCGC GTTGAACGGT ATTGAAAACT GGAAGGATAT TTCTGCTGGC GGCGACCTGA CGATGAACAC CAATCGCCAT GTGACCAACA ACTCCAACAG CAATATGGTG GGGCAGAATA TTGTTATTAA CGCGGTTAAC GATATCAACA ACCGTGGCAA CATTGTCAGT GACGCTGACC TGAACGTGAC GACCAAAGGC AACCTGTATA ACTATCTCTA TATGGTAGGG TATGGGGATA TCGCATTGTC GGCAAATAGC GTGGCGAACA ATAACGCGAC CATCGAAGCG ACAGGCGATC TGATTATCGA TTCGAAGGGT AACGTGGGTA ACAACCGCGG TAATCTGCAT GCGTTGAACG GCGTGTTGTC TGTTAAAGGC AACAATCTGA ACAACGATAA CGGTGAAATT CGTGGTTATG GCGATGTCAC GCTGGCACTG ACGGGCAACT ACGACAGCTA TAAGGGTTCG CTGACCTCTG AAACGGGCGA CGTGACTCTG ACGGCGAACA TTGTAGACAA CGCCTATGGT TTGATTGCCG GTGAGAATGT TTCTGTCGAT GCTAAATCGA CGATTTACAA CAACACTGCG CTGATCGCGG CGAATAAAAA GCTGGTTATT AACGCTGGCG GCAACCTCGA AAACCGCGAC GGGAATAACT TCCTGCGTAA TAACGGCGCG CTGTTTGGAA TTACCGACAA CGTTGGCGGC ATCGTAGGTA AAGAAGGTGT CACGCTTTCT GCTCAGAACG TCTACAACAA TAACAGCAGC ATCATCGCTG AAAATGGTCC GCTTAATCTG CTGTCCAGGG GAACGCTGGA TAATACCCGC GCGCTTCTTA GCAGTGGGGC TGATGCCATC ATCCGTGCGG CAGGGACGTT CTACAACAAC TATGCCACCA CGTACAGCGC CGGTAATCTC GACGTTTATG CGGCGTCGTT GAACAACGCC AGCGATGGTC GCCTGGAAGA CAATACCGCC ACGGGCGTGA TTGCGTCTGA CAAAAACCTG GATCTGAGCG TTGATAACAG TGTCACTAAC TATGGTTGGA TCAGCGGTAA AGGAGATGTG CATTTCAATG TTCTGAAAGG CACGCTGTAT AACCGTAATG CCATCGCGGC GGACAACGCG CTGACCATTA ATGCCCTGAA CGGTGTTGAG AACTTTAAAG ACATTGTGGC GGGTACTGCG CTGACTATTG ATACGCAGAA GTATGTTACC AACAACAGCA ACAGTAATAT GTTGGGACAA ACCATCGCGA TCAATGCCGT GAATGACATT AATAACCGTG GAAATATTGT GGGTGATTAT TCTCTGGGTG TTAAAACCAC CGGTAATATT TATAACTACC TCAATATGCT GAGTTATGGT GTCGCTGGCG TATCGGCAAA TAAGGTTACG AATAGCGGTA AAGACGCTGT TCTCGGTGGC TTCTACGGTT TAGCGTTAGA AGCAAACGAA ACTGATAACA CCGGTACTAT TGTCGGCATG TAA
|
Protein sequence | MAMINLSKEA TVGKALTPIA ILMMLSFPVA SQAAGLVIKN GTVYNANGVP VVDINKPNGS GLSHNIWDNL NVDKNGVVFN NSANESSTSL AGNIQGNSNL TSGSAKVILN EVTSKNPSTI NGMMEVAGDK ADLIIANPNG ITVNGGGSIN TGKLTLTTGT PDIQDDKLAG YSVNGGTITL GKLDNASPTE ILSRNVVVNG KVSADELNVV AGNNYVNAAG QVTGSVSATG SRNGYSVDVA KLGGMYANKI SLVSTEKGVG VRNLGVIAGG VNGVSIDSKG NLLNSNAQIQ SASTINLTTN GTLDNTTGTV TSVGTISLNT NKNTIVNTRA GNISTMGDIY VNSGTIDNTN GKLAAAGMLA VDTNNATLIN SGKGSSVGIE AGLVALKTGT LNNSNGQIRG GYVGLESAAL NNNNGDIQTT GDIAIISNGN VDNNKGLIRS STGHIVIGAA GSVNNGSTKT ADTGSSDSLG IIADTGVEIG ANNINNNGGQ IASNGNVSLS SYSTIDDYAG KILSNSKVII KGSSLRNDTG GISGKQGIEV AVGGSLTNNI GVISSEEGDI SLLANSVDNH GGFMMGQNIT MESMSGVNNN TALIVASKKL KINARGSIEN RDGNNFGNAY GLYFGMPQQT GGMVGKEGIE LSGQNIYNNN SRLIAEDGPL TLQAQNTFDN TRALVTSGAD ASIQVGGTYY NNYATTWSAG NLDIDATTLQ NSSSGTMIDN NATGFIASDK NLSLEVVNSL TNYGWISGKG DVDVTVNNGN LYNRNTIAAE KGLDIAALNG IENWKDISAG GDLTMNTNRH VTNNSNSNMV GQNIVINAVN DINNRGNIVS DADLNVTTKG NLYNYLYMVG YGDIALSANS VANNNATIEA TGDLIIDSKG NVGNNRGNLH ALNGVLSVKG NNLNNDNGEI RGYGDVTLAL TGNYDSYKGS LTSETGDVTL TANIVDNAYG LIAGENVSVD AKSTIYNNTA LIAANKKLVI NAGGNLENRD GNNFLRNNGA LFGITDNVGG IVGKEGVTLS AQNVYNNNSS IIAENGPLNL LSRGTLDNTR ALLSSGADAI IRAAGTFYNN YATTYSAGNL DVYAASLNNA SDGRLEDNTA TGVIASDKNL DLSVDNSVTN YGWISGKGDV HFNVLKGTLY NRNAIAADNA LTINALNGVE NFKDIVAGTA LTIDTQKYVT NNSNSNMLGQ TIAINAVNDI NNRGNIVGDY SLGVKTTGNI YNYLNMLSYG VAGVSANKVT NSGKDAVLGG FYGLALEANE TDNTGTIVGM
|
| |