Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3895 |
Symbol | |
ID | 6967177 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3604560 |
End bp | 3609275 |
Gene Length | 4716 bp |
Protein Length | 1571 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643387672 |
Product | hypothetical protein |
Protein accession | YP_002272121 |
Protein GI | 209399930 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3468] Type V secretory pathway, adhesin AidA |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACACAA TACACTTGCG CTGTCTCTTC AGGATGAATC CCCTGGTCTG GTGCCTGTGG GCTGATGTTG CAGCAAAGCT AAGGTCGCTT AAACGCTACT CAGTATTCAC TTTTCAGAGG ATGAAATTTA TGAACAGGAC CAGTCCCTAT TATTGTCGTC GCTCAGTACT TTCCTTATTG ATATCTGCCT TGATATATGC CCCGCCCGGG ATGGCTGCCT TCACTCCTGA TGTTATTGGT GTGGTAAACG ATGAGACTGT AGATGGCAGC CAACGAGTAG ATGAACGAGG TACAACAAAT AACACTCATA TTATCAACCA TGGCCAGCAG AATGTTTATG GCGGGGTATC TAATGGAAGT CTTATTGAAT CTGGTGGATA TCAAGATGTA GGAAGGCATA ACAATTATGT GGGGCAGTCT AATAATACCA CCATTAACGG GGGCAGACAG TCAATTCATG ACGGGGGTAT TTCCACAGGT ACGATAATCG AGAGTGGCAA TCAGGACGTT TATAAAGGGG GTATCAGCAA TGGAACGACA ATTAAGGGCG GTGCTTCACG CGTAGAGGGA GGGAGTGCGA ATGGAACACT CATTGATGGT GGTAGCCAGA TAGTAAAAGT TCAAGGGCAT GCTGATGGTA CAACGATAAA TAAGTCTGGC TCTCAGGACG TAGTACAAGG AAGTCTGGCA ACGAACACAA CCATAAATGG TGGTCGACAG TATGTTGAAC AGAGCACAGT AGAAACAACC ACCATCAAAA ATGGCGGTGA GCAAAGAGTA TATGAGAGCC GTGCGCTGGA CACGACGATT GAAGGCGGAA CTCAGTCTCT GAATAGTAAG TCAACGGCAA AAAATACTCA GATCTATTCT GGTGGTACGC AAATTATTGA TAACACCAGC TCCTCGGATG TTATTGAAGT TTATTCCGGT GGCGTGCTTG ATGTTAGTGG TGGTACGGCA ACAAATGTTA CCCAGCACGA TGGTGCAATT TTAAAAACTA ACACTAACGG TACGACGGTG AGCGGTACGA ATAGTGAAGG TGCATTCTCC ATCCACAATC ACGTGGCAGA CAATGTGTTG CTGGAAAACG GTGGTCATTT AGACATAAAC GCATATGGTT CGGCAAACAA GACGATTATT AAAGATAAAG GAACAATGTC AGTTTTAACC AATGCTAAAG CTGATGCGAC CCGAATAGAT AATGGCGGGG TTATGGATGT TGCAGGAAAC GCGACAAATA CCATAATTAA TGGTGGCACA CAGAATATTA ATAATTATGG CATAGCCACA GGCACCAATA TCAACAGCGG AACGCAAAAT ATCAAAAGCG GCGGGAAAGC TGACACAACA ATTATATCCT CCGGGAGCCG GCAGGTTGTT GAGAAAGATG GTACGGCAAT TGGCAGCAAT ATTAGCGCCG GAGGCTCGCT GATTGTCTAT ACCGGCGGTA TTGCACATGG GGTTAACCAG GAGACGGGCA GTGCTTTAGT TGCCAACACG GGTGCAGGGA CTGATATCGA AGGATACAAC AAGCTCTCTC ACTTCACTAT TACCGGAGGG GAGGCTAATT ATGTTGTGCT GGAAAATACC GGCGAACTGA CGGTAGTGGC TAAAACCTCG GCGAAAAATA CTACCATTGA TGCTGGCGGT AAGCTGATTG TCCAGAAGGA GGCTAAAACA GATAGCACCA GACTTAATAA TGGCGGCGTT CTGGAGGTTC AGGACGGTGG TGAGGCTAAG CATGTTGAGC AACAATCCGG CGGCGCATTA ATTGCTTCCA CGACCTCCGG AACACTTATC GAAGGAACCA ACAGTTATGG TGATGCTTTC TACATCAGGA ATTCAGAAGC TAAAAATGTA GTGCTGGAAA ACGCTGGCTC ATTAACAGTC GTCACTGGTT CCCGGGCAGT TGACACGATT ATTAATGCCA ACGGCAAAAT GGATGTTTAT GGAAAAGATG TTGGCACTGT ACTCAATAGT GCTGGCACCC AAACAATATA TGCCAGTGCC ACTTCTGATA AAGCAAATAT CAAAGGTGGC AAGCAAACGG TATATGGTTT AGCCACTGAA GCAAATATCG AAAGTGGTGA ACAAATTGTT GATGGTGGGT CAACAGAGAA AACACACATC AATGGTGGCA CGCAAACCGT TCAGAATTAT GGTAAGGCGA TCAATACCGA TATCGTCTCT GGCCTACAAC AAATTATGGC AAACGGGACA GCGGAAGGTT CCATTATTAA TGGCGGTTCA CAGATAGTTA ATGAGGGCGG TCTGGCTGAA AACTCGGTGC TTAATGATGG CGGCACACTC GATGTGCGGG AGAAAGGCAG CGCAACGGGG ATACAGCAGA GTAGCCAGGG CGCGTTGGTT GCAACCACCA GGGCGACGCG GGTCACAGGA ACACGCGCGG ATGGCGTCGC GTTCAGCATC GAGCAGGGTG CGGCGAACAA TATCCTGCTG GCAAATGGCG GAGTGTTAAC CGTGGAGTCA GACACCTCTT CTGACAAAAC ACAGGTCAAT ACGGGCGGAC GGGAGATCGT CAAAACAAAA GCCACTGCGA CAGGCACGAC GCTCACCGGC GGTGAACAAA TTGTCGAGGG TGTGGCGAAT GAGACAACAA TTAACGACGG CGGAATACAA ACAGTTTCAG CTAACGGAGA GGCAATAAAA ACAACGATCA ATGAAGGCGG TACGCTGACA GTCAACGATA ATGGCAAAGC GACAGATATC GTCCAGAACA GCGGTGCCGC TCTCCAGACG AGCACGGCTA ACGGTATTGA AATCAGCGGT ACTCACCAGT ACGGCACTTT TTCCATTTCC GGCAATTTAG CGACCAATAT GTTGCTGGAA AATGGCGGTA ATTTATTGGT ATTAGCAGGT ACCGAAGCTC GCGACTCCAC GGTTGGCAAG GGGGGGGCAA TGCAAAACCA GGGTCAGGAC TCCGCCACAA AGGTTAACTC TGGTGGGCAA TATACCCTTG GGCGGTCAAA AGATGAGTTT CAGGCTCTGG CCCGGGCAGA AGATCTCCAG GTTGCTGGCG GGACAGCAAT CGTCTACGCA GGTACGCTGG CGGATGCATC GGTCAGTGGC GCGACAGGAA GCCTGTCGTT AATGACGCCA CGGGATAATG TTACGCCAGT TAAACTCGAA GGGGCGATCC GGATTACCGA TAGCGCGACA TTAACTATCG GCAATGGCGT TGATACGACG CTTGCCGACC TGACGGCTGC CAGCCGGGGC AGTGTCTGGC TTAACAGCAA TAATTCCTGT GCAGGCACCA GCAACTGCGA GTATAGAGTA AACAGTTTGC TACTTAACGA CGGTAATGTT TATTTATCAG CACAAACAGC AGCGCCTGCC ACAACTAACG GTATATACAA TACGCTGACA ACCAATGAAC TTTCCGGTAG CGGTAATTTC TACCTGCATA CCAACGTTGC AGGCTCTCGG GGCGATCAAC TGGTCGTCAA CAACAACGCC ACTGGTAATT TTAAAATCTT TGTTCAGGAT ACCGGCGTCA GTCCTCAGTC TGACGACGCG ATGACGCTGG TGAAAACAGG GGGAGGGGAT GCTTCGTTTT CGCTGGGCAA TACTGGCGGT TTCGTTGATC TTGGGACCTA TGAGTATGTC CTGAAAAGCG ATGGCAACAG CAACTGGAAC CTGACCAATG ATGTCAAACC CAACCCGGAT CCCAACCCAA ATCCCAACCC AAATCCGAAG CCGGATCCAA AACCAGACCC AAAACCGGAT CCGAAACCAG ACCCGACTCC CGAGCCAACG CCGACACCCG TTCCGGAGAA ACGCATCACG CCTTCTACCG CAGCCGTACT CAATATGGCA GCAACATTAC CGTTGGTATT TGATGCTGAG CTAAACAGTA TTCGCGAGCG GTTGAACATA ATGAAAGCGA GTCCACACAA CAATAATGTC TGGGGGGCGA CGTATAACAC CCGTAATAAT GTCACCACCG ATGCGGGGGC CGGGTTTGAG CAGACGCTGA CCGGAATGAC AGTGGGGATC GACAGCCCTA ATGATATTCC TGAGGGGATT GCGACGCTGG GCGCTTTTAT GGGTTATTCC CATTCACATA TCGGTTTTGA TCGCGGAGGA CATGGCAGTG TGGGCAGTTA TTCTCTGGGC GGCTATGCCA GTTGGGAACA TGAAAGTGGT TTCTATCTGG ACGGTGTCGT GAAGCTGAAC CGTTTTGAAA GTAACGTAGC CGGTAAAATG AGCAGCGGTG GAGCCGCCAA TGGCAGTTAC CACAGCAACG GGCTGGGCGG TCACATTGAA ACCGGGATGC GATTTACCGA TGGTAACTGG AACCTGACGC CGTATGCATC GTTAACGGGG TTCACCGCTG ATAACCCCGA ATATCATTTA TCCAATGGCA TGGAATCGAA ATCAGTCGAT ACCCGCAGTA TATATCGTGA ACTGGGCGCA ACGCTGAGTT ACAACATGCG TCTGGGGAAC GGTATGGAAA TTGAGCCGTG GCTGAAGGCG GCTGTGCGCA AAGAATTTGT CGATGATAAC CGGGTGAAGG TGAATAATGA CGGTAATTTC GTCAATGATT TGTCGGGCAG ACGTGGAATA TACCAGGCAG GTATTAAAGC CTCATTCAGC AGTACGTTAA GCGGGCATCT TGGGGTGGGG TATAGCCATG GTGCCGGTGT GGAATCCCCG TGGAACGCGG TAGCTGGTGT GAACTGGTCG TTCTGA
|
Protein sequence | MNTIHLRCLF RMNPLVWCLW ADVAAKLRSL KRYSVFTFQR MKFMNRTSPY YCRRSVLSLL ISALIYAPPG MAAFTPDVIG VVNDETVDGS QRVDERGTTN NTHIINHGQQ NVYGGVSNGS LIESGGYQDV GRHNNYVGQS NNTTINGGRQ SIHDGGISTG TIIESGNQDV YKGGISNGTT IKGGASRVEG GSANGTLIDG GSQIVKVQGH ADGTTINKSG SQDVVQGSLA TNTTINGGRQ YVEQSTVETT TIKNGGEQRV YESRALDTTI EGGTQSLNSK STAKNTQIYS GGTQIIDNTS SSDVIEVYSG GVLDVSGGTA TNVTQHDGAI LKTNTNGTTV SGTNSEGAFS IHNHVADNVL LENGGHLDIN AYGSANKTII KDKGTMSVLT NAKADATRID NGGVMDVAGN ATNTIINGGT QNINNYGIAT GTNINSGTQN IKSGGKADTT IISSGSRQVV EKDGTAIGSN ISAGGSLIVY TGGIAHGVNQ ETGSALVANT GAGTDIEGYN KLSHFTITGG EANYVVLENT GELTVVAKTS AKNTTIDAGG KLIVQKEAKT DSTRLNNGGV LEVQDGGEAK HVEQQSGGAL IASTTSGTLI EGTNSYGDAF YIRNSEAKNV VLENAGSLTV VTGSRAVDTI INANGKMDVY GKDVGTVLNS AGTQTIYASA TSDKANIKGG KQTVYGLATE ANIESGEQIV DGGSTEKTHI NGGTQTVQNY GKAINTDIVS GLQQIMANGT AEGSIINGGS QIVNEGGLAE NSVLNDGGTL DVREKGSATG IQQSSQGALV ATTRATRVTG TRADGVAFSI EQGAANNILL ANGGVLTVES DTSSDKTQVN TGGREIVKTK ATATGTTLTG GEQIVEGVAN ETTINDGGIQ TVSANGEAIK TTINEGGTLT VNDNGKATDI VQNSGAALQT STANGIEISG THQYGTFSIS GNLATNMLLE NGGNLLVLAG TEARDSTVGK GGAMQNQGQD SATKVNSGGQ YTLGRSKDEF QALARAEDLQ VAGGTAIVYA GTLADASVSG ATGSLSLMTP RDNVTPVKLE GAIRITDSAT LTIGNGVDTT LADLTAASRG SVWLNSNNSC AGTSNCEYRV NSLLLNDGNV YLSAQTAAPA TTNGIYNTLT TNELSGSGNF YLHTNVAGSR GDQLVVNNNA TGNFKIFVQD TGVSPQSDDA MTLVKTGGGD ASFSLGNTGG FVDLGTYEYV LKSDGNSNWN LTNDVKPNPD PNPNPNPNPK PDPKPDPKPD PKPDPTPEPT PTPVPEKRIT PSTAAVLNMA ATLPLVFDAE LNSIRERLNI MKASPHNNNV WGATYNTRNN VTTDAGAGFE QTLTGMTVGI DSPNDIPEGI ATLGAFMGYS HSHIGFDRGG HGSVGSYSLG GYASWEHESG FYLDGVVKLN RFESNVAGKM SSGGAANGSY HSNGLGGHIE TGMRFTDGNW NLTPYASLTG FTADNPEYHL SNGMESKSVD TRSIYRELGA TLSYNMRLGN GMEIEPWLKA AVRKEFVDDN RVKVNNDGNF VNDLSGRRGI YQAGIKASFS STLSGHLGVG YSHGAGVESP WNAVAGVNWS F
|
| |