Gene ECH74115_1281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1281 
Symbol 
ID6970364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1292207 
End bp1296019 
Gene Length3813 bp 
Protein Length1270 aa 
Translation table11 
GC content49% 
IMG OID643385269 
Producthaemagglutination activity domain protein 
Protein accessionYP_002269764 
Protein GI209399804 
COG category 
COG ID 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.326078 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGATGA TAAATTTAAG TAAGGAAGCA ACGGTGGGGA AAGCATTAAC CCCTATTGCT 
ATACTTATGA TGTTGTCTTT TCCTGTAGCT TCTCAAGCGG CGGGATTAGT CATAAAAAAT
GGAACGGTAT ATAACGCCAA TGGTGTGCCA GTCGTTGACA TCAACAAACC TAACGGTAGC
GGTTTATCTC ATAATATCTG GGATAACCTA AACGTTGATA AAAATGGTGT CGTTTTCAAT
AATAGCGCTA ATGAATCCAG TACTTCACTT GCCGGAAATA TTCAGGGAAA CAGTAATCTG
ACCTCCGGGT CGGCGAAGGT GATCCTGAAT GAGGTTACTT CCAAAAATCC TTCAACCATT
AATGGGATGA TGGAAGTTGC AGGGGATAAA GCGGATCTGA TTATTGCCAA CCCGAATGGT
ATTACTGTAA ACGGTGGCGG TTCAATCAAT ACAGGTAAAC TTACCTTAAC CACCGGGACG
CCGGATATCC AGGATGACAA GCTGGCCGGT TACTCCGTGA ACGGCGGTAC CATTACGCTC
GGTAAACTGG ATAACGCCAG CCCGACAGAA ATTCTGTCCC GTAACGTGGT AGTTAACGGC
AAAGTGTCTG CCGATGAGCT GAACGTTGTT GCTGGCAATA ACTATGTTAA TGCCGCAGGC
CAGGTGACCG GTAGCGTATC CGCCACGGGG TCCCGTAACG GTTACAGCGT AGATGTTGCC
AAACTGGGCG GAATGTATGC GAACAAAATC AGTCTGGTCA GCACCGAGAA AGGTGTGGGG
GTTCGCAACC TCGGCGTTAT TGCTGGGGGT GTTAATGGTG TCAGCATCGA TTCCAAAGGT
AACCTGTTAA ACAGTAACGC CCAGATTCAG TCTGCAAGCA CGATCAACCT GACAACAAAT
GGTACTCTGG ATAACACCAC CGGTACGGTG ACATCTGTAG GCACTATCTC GCTTAATACC
AACAAGAATA CTATCGTGAA TACCCGTGCG GGTAACATCT CTACGATGGG CGATATCTAC
GTTAACAGCG GTACGATTGA CAATACTAAC GGCAAGCTTG CGGCTGCAGG AATGCTGGCG
GTTGATACCA ATAACGCCAC GCTGATTAAC TCTGGTAAAG GGAGTTCTGT CGGGATTGAA
GCGGGGCTCG TGGCGCTGAA AACCGGAACG CTCAACAACA GCAATGGTCA GATTCGCGGT
GGCTATGTGG GTCTTGAATC CGCTGCGCTG AATAACAACA ACGGTGATAT CCAGACCACC
GGCGATATCG CCATTATCAG TAACGGTAAT GTGGATAACA ACAAAGGTCT GATCCGTTCG
TCCACCGGGC ATATCGTTAT TGGCGCGGCA GGTAGCGTAA ATAATGGTTC AACCAAAACC
GCCGATACCG GCAGTTCTGA CTCTCTGGGC ATTATTGCAG ATACCGGCGT AGAAATTGGT
GCGAACAACA TCAATAACAA CGGCGGACAG ATTGCGTCTA ATGGCAACGT CTCCCTGTCA
AGTTACAGCA CGATCGACGA CTATGCGGGC AAAATTCTGT CCAACAGCAA AGTGATTATC
AAGGGAAGCT CTCTGCGTAA CGATACCGGG GGGATCAGCG GTAAGCAGGG TATTGAAGTC
GCCGTTGGCG GCAGCCTGAC CAATAATATT GGCGTGATCA GCTCTGAAGA GGGTGATATC
TCCCTGTTAG CCAACTCCGT GGATAACCAC GGCGGCTTCA TGATGGGGCA GAACATCACG
ATGGAGTCGA TGTCTGGCGT CAATAACAAC ACAGCGCTGA TCGTGGCCAG CAAAAAACTG
AAGATAAATG CGCGCGGCAG TATCGAAAAC CGCGATGGCA ATAACTTCGG TAATGCTTAT
GGTCTGTACT TCGGCATGCC TCAGCAAACG GGTGGAATGG TCGGCAAGGA AGGCATCGAG
CTTTCCGGGC AGAACATCTA TAACAACAAC AGCCGTCTTA TCGCTGAGGA TGGTCCTCTG
ACTCTGCAGG CGCAGAACAC GTTCGACAAC ACGCGTGCTC TGGTCACCAG CGGGGCGGAT
GCATCTATTC AGGTTGGCGG AACGTATTAT AACAACTACG CTACCACCTG GAGTGCGGGC
AACCTGGATA TCGACGCGAC CACGCTGCAA AACAGCAGCA GCGGTACGAT GATCGATAAC
AATGCGACCG GGTTCATAGC ATCTGATAAA AACCTGTCAC TGGAAGTGGT GAATAGCCTT
ACCAACTACG GCTGGATCAG CGGTAAAGGC GATGTTGATG TCACGGTGAA TAACGGCAAC
CTGTATAACC GCAATACCAT TGCGGCTGAA AAGGGGCTGG ATATTGCCGC GTTGAACGGT
ATTGAAAACT GGAAGGATAT TTCTGCTGGC GGCGACCTGA CGATGAACAC CAATCGCCAT
GTGACCAACA ACTCCAACAG CAATATGGTG GGGCAGAATA TTGTTATTAA CGCGGTTAAC
GATATCAACA ACCGTGGCAA CATTGTCAGT GACGCTGACC TGAACGTGAC GACCAAAGGC
AACCTGTATA ACTATCTCTA TATGGTAGGG TATGGGGATA TCGCATTGTC GGCAAATAGC
GTGGCGAACA ATAACGCGAC CATCGAAGCG ACAGGCGATC TGATTATCGA TTCGAAGGGT
AACGTGGGTA ACAACCGCGG TAATCTGCAT GCGTTGAACG GCGTGTTGTC TGTTAAAGGC
AACAATCTGA ACAACGATAA CGGTGAAATT CGTGGTTATG GCGATGTCAC GCTGGCACTG
ACGGGCAACT ACGACAGCTA TAAGGGTTCG CTGACCTCTG AAACGGGCGA CGTGACTCTG
ACGGCGAACA TTGTAGACAA CGCCTATGGT TTGATTGCCG GTGAGAATGT TTCTGTCGAT
GCTAAATCGA CGATTTACAA CAACACTGCG CTGATCGCGG CGAATAAAAA GCTGGTTATT
AACGCTGGCG GCAACCTCGA AAACCGCGAC GGGAATAACT TCCTGCGTAA TAACGGCGCG
CTGTTTGGAA TTACCGACAA CGTTGGCGGC ATCGTAGGTA AAGAAGGTGT CACGCTTTCT
GCTCAGAACG TCTACAACAA TAACAGCAGC ATCATCGCTG AAAATGGTCC GCTTAATCTG
CTGTCCAGGG GAACGCTGGA TAATACCCGC GCGCTTCTTA GCAGTGGGGC TGATGCCATC
ATCCGTGCGG CAGGGACGTT CTACAACAAC TATGCCACCA CGTACAGCGC CGGTAATCTC
GACGTTTATG CGGCGTCGTT GAACAACGCC AGCGATGGTC GCCTGGAAGA CAATACCGCC
ACGGGCGTGA TTGCGTCTGA CAAAAACCTG GATCTGAGCG TTGATAACAG TGTCACTAAC
TATGGTTGGA TCAGCGGTAA AGGAGATGTG CATTTCAATG TTCTGAAAGG CACGCTGTAT
AACCGTAATG CCATCGCGGC GGACAACGCG CTGACCATTA ATGCCCTGAA CGGTGTTGAG
AACTTTAAAG ACATTGTGGC GGGTACTGCG CTGACTATTG ATACGCAGAA GTATGTTACC
AACAACAGCA ACAGTAATAT GTTGGGACAA ACCATCGCGA TCAATGCCGT GAATGACATT
AATAACCGTG GAAATATTGT GGGTGATTAT TCTCTGGGTG TTAAAACCAC CGGTAATATT
TATAACTACC TCAATATGCT GAGTTATGGT GTCGCTGGCG TATCGGCAAA TAAGGTTACG
AATAGCGGTA AAGACGCTGT TCTCGGTGGC TTCTACGGTT TAGCGTTAGA AGCAAACGAA
ACTGATAACA CCGGTACTAT TGTCGGCATG TAA
 
Protein sequence
MAMINLSKEA TVGKALTPIA ILMMLSFPVA SQAAGLVIKN GTVYNANGVP VVDINKPNGS 
GLSHNIWDNL NVDKNGVVFN NSANESSTSL AGNIQGNSNL TSGSAKVILN EVTSKNPSTI
NGMMEVAGDK ADLIIANPNG ITVNGGGSIN TGKLTLTTGT PDIQDDKLAG YSVNGGTITL
GKLDNASPTE ILSRNVVVNG KVSADELNVV AGNNYVNAAG QVTGSVSATG SRNGYSVDVA
KLGGMYANKI SLVSTEKGVG VRNLGVIAGG VNGVSIDSKG NLLNSNAQIQ SASTINLTTN
GTLDNTTGTV TSVGTISLNT NKNTIVNTRA GNISTMGDIY VNSGTIDNTN GKLAAAGMLA
VDTNNATLIN SGKGSSVGIE AGLVALKTGT LNNSNGQIRG GYVGLESAAL NNNNGDIQTT
GDIAIISNGN VDNNKGLIRS STGHIVIGAA GSVNNGSTKT ADTGSSDSLG IIADTGVEIG
ANNINNNGGQ IASNGNVSLS SYSTIDDYAG KILSNSKVII KGSSLRNDTG GISGKQGIEV
AVGGSLTNNI GVISSEEGDI SLLANSVDNH GGFMMGQNIT MESMSGVNNN TALIVASKKL
KINARGSIEN RDGNNFGNAY GLYFGMPQQT GGMVGKEGIE LSGQNIYNNN SRLIAEDGPL
TLQAQNTFDN TRALVTSGAD ASIQVGGTYY NNYATTWSAG NLDIDATTLQ NSSSGTMIDN
NATGFIASDK NLSLEVVNSL TNYGWISGKG DVDVTVNNGN LYNRNTIAAE KGLDIAALNG
IENWKDISAG GDLTMNTNRH VTNNSNSNMV GQNIVINAVN DINNRGNIVS DADLNVTTKG
NLYNYLYMVG YGDIALSANS VANNNATIEA TGDLIIDSKG NVGNNRGNLH ALNGVLSVKG
NNLNNDNGEI RGYGDVTLAL TGNYDSYKGS LTSETGDVTL TANIVDNAYG LIAGENVSVD
AKSTIYNNTA LIAANKKLVI NAGGNLENRD GNNFLRNNGA LFGITDNVGG IVGKEGVTLS
AQNVYNNNSS IIAENGPLNL LSRGTLDNTR ALLSSGADAI IRAAGTFYNN YATTYSAGNL
DVYAASLNNA SDGRLEDNTA TGVIASDKNL DLSVDNSVTN YGWISGKGDV HFNVLKGTLY
NRNAIAADNA LTINALNGVE NFKDIVAGTA LTIDTQKYVT NNSNSNMLGQ TIAINAVNDI
NNRGNIVGDY SLGVKTTGNI YNYLNMLSYG VAGVSANKVT NSGKDAVLGG FYGLALEANE
TDNTGTIVGM