Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5054 |
Symbol | |
ID | 6970294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4699562 |
End bp | 4702366 |
Gene Length | 2805 bp |
Protein Length | 934 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643388732 |
Product | intimin C-type lectin domain protein |
Protein accession | YP_002273158 |
Protein GI | 209398935 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTACTC ATGGTTGTTA TACCCGGACC CGGCACAAGC ATAAGCTAAA AAAAACATTG ATTATGCTTA GTGCTGGTTT AGGATTGTTT TTTTATGTTA ATCAGAATTC ATTTGCAAAT GGTGAAAATT ATTTTAAATT GGGTTCGGAT TCAAAACTGT TAACTCATGA TAGCTATCAG AATCGCCTTT TTTATACGTT GAAAACTGGT GAAACTGTTG CCGATCTTTC TAAATCGCAA GATATTAATT TATCGACGAT TTGGTCGTTG AATAAGCATT TATACAGTTC TGAAAGCGAA ATGATGAAGG CCGCGCCTGG TCAGCAGATC ATTTTGCCAC TCAAAAAACT TCCCTTTGAA TACAGTGCAC TACCACTTTT AGGTTCGGCA CCTCTTGTTG CTGCAGGTGG TGTTGCTGGT CACACGAATA AACTGACTAA AATGTCCCCG GACGTGACCA AAAGCAACAT GACCGATGAC AAGGCATTAA ATTATGCGGC ACAACAGGCG GCGAGTCTCG GTAGCCAGCT TCAGTCGCGA TCTCTGAACG GCGATTACGC GAAAGATACC GCTCTTGGTA TCGCTGGTAA CCAGGCTTCG TCACAGTTGC AGGCCTGGTT ACAACATTAT GGAACGGCAG AGGTTAATCT GCAGAGTGGT AATAACTTTG ACGGTAGTTC ACTGGACTTC TTATTACCGT TCTATGATTC CGAAAAAATG CTGGCATTTG GTCAGGTCGG AGCGCGTTAC ATTGACTCCC GCTTTACGGC AAATTTAGGT GCGGGTCAGC GTTTTTTCCT TCCTGCAAAC ATGTTGGGCT ATAACGTCTT CATTGATCAG GATTTTTCTG GTGATAATAC CCGTTTAGGT ATTGGTGGCG AATACTGGCG AGACTATTTC AAAAGTAGCG TTAACGGCTA TTTCCGCATG AGCGGCTGGC ATGAGTCATA CAATAAGAAA GACTATGATG AGCGCCCAGC AAATGGCTTC GATATCCGTT TTAATGGCTA TCTACCGTCA TATCCGGCAT TAGGCGCCAA GCTGATATAT GAGCAGTATT ATGGTGATAA TGTTGCTTTG TTTAATTCTG ATAAGCTGCA GTCGAATCCT GGTGCGGCGA CCGTTGGTGT AAACTATACT CCGATTCCTC TGGTGACGAT GGGGATCGAT TACCGTCATG GTACGGGTAA TGAAAATGAT CTCCTTTACT CAATGCAGTT CCGTTATCAG TTTGATAAAT CGTGGTCTCA GCAAATTGAA CCACAGTATG TTAACGAGTT AAGAACATTA TCAGGCAGCC GTTACGATCT GGTTCAGCGT AATAACAATA TTATTCTGGA GTACAAGAAG CAGGATATTC TTTCTCTGAA TATTCCGCAT GATATTAATG GTACTGAACA CAGTACGCAG AAGATTCAGT TGATCGTTAA GAGCAAATAC GGTCTGGATC GTATCGTCTG GGATGATAGT GCATTACGCA GTCAGGGCGG TCAGATTCAG CATAGCGGAA GCCAAAGCGC ACAAGACTAC CAGGCTATTT TGCCTGCTTA TGTGCAAGGT GGCAGCAATA TTTATAAAGT GACGGCTCGC GCCTATGACC GTAATGGCAA TAGCTCTAAC AATGTACAGC TTACTATTAC CGTTCTGTCG AATGGTCAAG TTGTCGACCA GGTTGGGGTA ACGGACTTTA CGGCGGATAA GACTTCGGCT AAAGCGGATA ACGCCGATAC CATTACTTAT ACCGCGACGG TGAAAAAGAA TGGGGTAGCT CAGGCTAATG TCCCTGTTTC ATTTAATATT GTTTCAGGAA CTGCAACTCT TGGGGCAAAT AGTGCCAAAA CGGATGCTAA CGGTAAGGCA ACCGTAACGT TGAAGTCGAG TACGCCAGGA CAGGTCGTCG TGTCTGCTAA AACCGCGGAG ATGACTTCAG CACTTAATGC CAGTGCGGTT ATATTTTTTG ATCAAACCAA GGCCAGCATT ACTGAGATTA AGGCTGATAA GACAACTGCA GTAGCAAATG GTAAGGATGC TATTAAATAT ACTGTAAAAG TTATGAAAAA CGGTCAGCCA GTTAATAATC AATCCGTTAC ATTCTCAACA AACTTTGGGA TGTTCAACGG TAAGTCTCAA ACGCAAGCAA CCACGGGAAA TGATGGTCGT GCGACGATAA CACTAACTTC CAGTTCCGCC GGTAAAGCGA CTGTTAGTGC GACAGTCAGT GATGGGGCTG AGGTTAAAGC GACTGAGGTC ACTTTTTTTG ATGAACTGAA AATTGACAAC AAGGTTGATA TTATTGGTAA CAATGTCAGA GGCGAGTTGC CTAATATTTG GCTGCAATAT GGTCAGTTTA AACTGAAAGC AAGCGGTGGT GATGGTACAT ATTCATGGTA TTCAGAAAAT ACCAGTATCG CGACTGTCGA TGCATCAGGG AAAGTCACTT TGAATGGTAA AGGCAGTGTC GTAATTAAAG CCACATCTGG TGATAAGCAA ACAGTAAGTT ACACTATAAA AGCACCGTCG TATATGATAA AAGTGGATAA GCAAGCCTAT TATGCTGATG CTATGTCCAT TTGCAAAAAT TTATTACCAT CCACACAGAC GGTATTGTCA GATATTTATG ACTCATGGGG GGCTGCAAAT AAATATAGCC ATTATAGTTC TATGAACTCA ATAACTGCTT GGATTAAACA GACATCTAGT GAGCAGCGTT CTGGAGTATC AAGCACTTAT AACCTAATAA CACAAAACCC TCTTCCTGGG GTTAATGTTA ATACTCCAAA TGTCTATGCG GTTTGTGTAG AATAA
|
Protein sequence | MITHGCYTRT RHKHKLKKTL IMLSAGLGLF FYVNQNSFAN GENYFKLGSD SKLLTHDSYQ NRLFYTLKTG ETVADLSKSQ DINLSTIWSL NKHLYSSESE MMKAAPGQQI ILPLKKLPFE YSALPLLGSA PLVAAGGVAG HTNKLTKMSP DVTKSNMTDD KALNYAAQQA ASLGSQLQSR SLNGDYAKDT ALGIAGNQAS SQLQAWLQHY GTAEVNLQSG NNFDGSSLDF LLPFYDSEKM LAFGQVGARY IDSRFTANLG AGQRFFLPAN MLGYNVFIDQ DFSGDNTRLG IGGEYWRDYF KSSVNGYFRM SGWHESYNKK DYDERPANGF DIRFNGYLPS YPALGAKLIY EQYYGDNVAL FNSDKLQSNP GAATVGVNYT PIPLVTMGID YRHGTGNEND LLYSMQFRYQ FDKSWSQQIE PQYVNELRTL SGSRYDLVQR NNNIILEYKK QDILSLNIPH DINGTEHSTQ KIQLIVKSKY GLDRIVWDDS ALRSQGGQIQ HSGSQSAQDY QAILPAYVQG GSNIYKVTAR AYDRNGNSSN NVQLTITVLS NGQVVDQVGV TDFTADKTSA KADNADTITY TATVKKNGVA QANVPVSFNI VSGTATLGAN SAKTDANGKA TVTLKSSTPG QVVVSAKTAE MTSALNASAV IFFDQTKASI TEIKADKTTA VANGKDAIKY TVKVMKNGQP VNNQSVTFST NFGMFNGKSQ TQATTGNDGR ATITLTSSSA GKATVSATVS DGAEVKATEV TFFDELKIDN KVDIIGNNVR GELPNIWLQY GQFKLKASGG DGTYSWYSEN TSIATVDASG KVTLNGKGSV VIKATSGDKQ TVSYTIKAPS YMIKVDKQAY YADAMSICKN LLPSTQTVLS DIYDSWGAAN KYSHYSSMNS ITAWIKQTSS EQRSGVSSTY NLITQNPLPG VNVNTPNVYA VCVE
|
| |