Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3367 |
Symbol | |
ID | 6967999 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3107562 |
End bp | 3111266 |
Gene Length | 3705 bp |
Protein Length | 1234 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643387176 |
Product | adhesin |
Protein accession | YP_002271639 |
Protein GI | 209396650 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3468] Type V secretory pathway, adhesin AidA |
TIGRFAM ID | [TIGR01376] Chlamydial polymorphic outer membrane protein repeat [TIGR01414] outer membrane autotransporter barrel domain [TIGR02601] autotransporter-associated beta strand repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGCAT CTCTTTTCCC TGCTAACGGT GTCGCGGCGG CCATTGATTT ATGCCAGGGA TATAATATCA AAGCGAGTTG TCACGCCAGC AGGCAAAGCC TTTCAGGCAT TACGCAGGTC TGGAGTATTG CCGATGGGCA ATGGCTGGTT TTTTCGGATA TGACCAATAA TGCCAGCGGT GGGGCCGTAT TTTTGCAACA AGGAGCGGAA TTTACATTAT CACCAGAAAA TGAAACTGGA ATGACTCTGT TTGCCAATAA CATCGTTTCA GGAGAATATA ATAACGGCGG GGCAATATTT GCTAAAGAAA ACTCAACGCT GAATCTTACG GATGTTATTT TTTCTGGTAA CGTCGCAGGC GGCTATGGTG GCGCAATCTA TTCTTCTGGT ACTAACGATA CCGGTGCCAT CGATTTACGT GTCACTAACG CCGTGTTTCG CAATAACATC GCTAATGACG GCAAAGGTGG TGCAATTTAT ACCATCAATA ATGATATCTA TTTAAGTGAT GATGTTTTTA ACAATAACCA GGCATATACA TCAACAAGTT ACAGTGATGG CGATGGCGGC GCAATCGATG TCACAGATAA TAATAGCGAC AGCAAGCATC CTTCAGGTTA TACGATAATA AATAACACTG CCTTTACAAA TAACACTGCC GAAGGTTATG GCGGGGCGAT ATATACCAAT AGCGCGACGG CTCCCTATCT TATTGATATT TCTGTTGATG ACAGCTACAG CCAGAACGGA GGCGTGTTAG TCGATGAGAA CAATAGCGCA GCAGGCTATG GAGATGGTCC TTCCTCTGCG GCGGGTGGCT TTATGTATCT CGGCTTAAGT GAAGTTACCT TTGATATTGC CGACGGAAAA ACGCTGGTTA TTGGCAATAC AGAGAATGAC GGAGCTGTTG ACTCTATTGC TGGTACCGGG TTAATCACCA AAACAGGTTC CGGCGATCTG GTACTTAATG CAGATAACAA TGACTTTACT GGTGAGATGC AGATTGAAAA CGGTGAAGTT ACCCTGGGCC GCAGCAACTC CCTGATGAAT GTCGGCGATA CGCATTGCCA GGACGATCCG CAAGACTGCT ACGGTCTGAC GATAGGGAGT ATTGATAAGT ACCAGAATCA GGCAGAGCTA AATGTTGGCT CCACCCAACA AACCTTTGCG CACTCATTGA CGGGCTTTCA GAATGGCACT TTAAATATCG ATGCTGGTGG CAATGTTACT GTTAATCAAG GCAGTTTTGC TGGCACCATC GAAGGTGCTG GTCAGCTCAC CATTGCGCAA AACGGCAGCT ATGTGCTGGC GGGGGCGCAG TCGATGGCGC TAACCGGCGA TATAGTGGTG GATGCTGGTG CGGTGCTTTC GCTGGAAGGC GACGCGGCAG ATCTTGCCGC TCTCCAGGAC GATCCGCAGT CGATCGTGTT AAACGGCGGT ATGCTCGATC TCTCTGATTT CTCCACCTGG CAGAGCGGTA CATCATACAA AGATGGCCTT GAAGTCAGTG GCAGCAGCGG AACGGTTATC GGCAGTCAGG ATGTGGTAGA TCTTGCAGGC GGAAACGATA TGCATATCGG CGGCGACGGG AAAGATGGCG TCTACGTGGT GATCGATGCG GGTGACGGGC AGGTCAGCCT GGCAAATGAC AATCAATACC TCGGCACAAC GCAAATCGCT TCCGGTACGC TGATGGTGAG CGACAACTCG CAGCTTGGAT ATACCCATTA TAACCGCCAG GTTATCTTTA CCGATAAGCC ACAAGAAAGC GTGATGGAGA TTACTGCCAA TGTCGATACT CGCTCTACAA CGACTGAGCA TGGGCGTGAT ATTGAAATGC GCGCCGACGG TGAAGTGGCA GTTGATGCGG GGGTAGACAC GCAGTGGGGC GCACTGATGG CTGACAGCAG CGGGCAGCAT CAGGATGAGG GTAGCACATT GACTAAAACG GGGGCGGGTA CACTGGAGCT GACCGCCAGC GGTACAACGC AGTCAGCGGT GAGAGTAGAA GAGGGCACGC TGCAAGGTGA TGTTGCGGAT ATCTTCCCTT ATGCTTCGTC GCTATGGGTC GGTGACGGGG CAACGTTCGT TACTGGCGCG GATCAGGATA TTCAGTCAAT TGATGCTACT TCCAGCGGCA CTATCGACAT CAGCGATGGT ACGGTTTTGC GCCTGACCGG GCAGGATACT TCCGTCGCCC TTAATGCCTC ACTGTTTAAC TGCGATGGGA CGCTGGTGAA TGCCACCGAT GGTGTGACGT TGACAGGTGA GCTTAATACC AACCTTGAAA CTGACAGCCT GACTTATCTT TCCAACGTGA CGGTTAATGG CAATCTGACC AATACGTCCG GTGCGGTTAG CCTGCAAAAT GGCGTCGCTG GCGATACGCT GACGGTAAAC GGTGATTATA CCGGCGGCGG TACGCTACTG CTCGATAGCG AATTAAACGG CGATGACTCG GTAAGCGATC AATTGGTGAT GAACGGTAAT ACTGCTGGCA ACACAACTGT GGTGGTTAAC TCCATTACAG GGATTGGTGA GCCGACATCG ACAGGCATTA AAGTGGTTGA TTTCGCAGCT GATCCCACGC AGTTTCAAAA CAATGCGCAG TTCAGTCTGG CAGGCAGCGG CTACGTCAAT ATGGGAGCGT ATGACTACAC GCTGGTGGAA GATAACAACG ACTGGTATCT GCGATCGCAA GAAGTAACGC CGCCATCGCC ACCTGATCCA GACCCGACTC CCGATCCTGA TCCCACGCAG GATCCTGATC CAACACCCGA CCCGGAACCT ACGCCTGCTT ACCAGCCGGT GTTGAATGCC AAAGTTGGCG GTTATCTCAA TAACCTGCGG GCGGCAAATC AGGCGTTTAT GATGGAGCGA CGCGATCACG CAGGTGGCGA TGGTCAGACG CTGAATTTAC GTGTTATCGG CGGAGATTAT CATTACACAG CAGCGGGGCA ACTGGCTCAA CATGAAGACA CTTCTACGGT GCAACTTAGC GGCGATCTGT TTAGCGGGCG CTGGGGCACG GATGGCGAGT GGATGCTTGG GATTGTTGGT GGCTACAGCG ATAACCAGGG CGACAGCCGC TCGAGTATGA CCGGAACTCG CGCCGATAAC CAGAACCACG GTTATGCGGT TGGGCTGACC TCAAGCTGGT TTCAGCACGG TAAGCAGAAG CAGGGGGCCT GGCTGGATAA CTGGTTGCAG TACGCGTGGT TTAGCAATGA TGTTTCTGAA CATGAAGATG GCGTGGATCA TTACCATTCG TCGGGGATTA TCGCCTCGCT GGAAGCGGGG TATCAGTGGT TACCGGGGCG TGGTGTGGTG ATTGAACCGC AGGCGCAGGT GATTTATCAG GGCGTGCAGC AGGATGATTT TACCGCCGCT AACCGTGCGC GCGTGTCACA ATCGCAGGGT GATGATATTC AGACGCGGCT GGGTTTACAC AGCGAATGGC GTACCGCTGT TCATGTCATA CCAACATTAG ATCTGAATTA TTATCACGAT CCCCATTCGA CGGAAATTGA AGAAGATGCC AGCACTATCA GTGACGATGC GGTGAAGCAA CGGGGTGAAA TAAAAGTGGG AGTCACGGGC AATATCAGTC AGCGAGTTTC GCTGCGTGGT AGCGTGGCGT GGCAGAAAGG GAGTGATGAT TTTGCCCAGA CGGCAGGGTT TTTGTCGATG ACGGTGAAAT GGTAA
|
Protein sequence | MIASLFPANG VAAAIDLCQG YNIKASCHAS RQSLSGITQV WSIADGQWLV FSDMTNNASG GAVFLQQGAE FTLSPENETG MTLFANNIVS GEYNNGGAIF AKENSTLNLT DVIFSGNVAG GYGGAIYSSG TNDTGAIDLR VTNAVFRNNI ANDGKGGAIY TINNDIYLSD DVFNNNQAYT STSYSDGDGG AIDVTDNNSD SKHPSGYTII NNTAFTNNTA EGYGGAIYTN SATAPYLIDI SVDDSYSQNG GVLVDENNSA AGYGDGPSSA AGGFMYLGLS EVTFDIADGK TLVIGNTEND GAVDSIAGTG LITKTGSGDL VLNADNNDFT GEMQIENGEV TLGRSNSLMN VGDTHCQDDP QDCYGLTIGS IDKYQNQAEL NVGSTQQTFA HSLTGFQNGT LNIDAGGNVT VNQGSFAGTI EGAGQLTIAQ NGSYVLAGAQ SMALTGDIVV DAGAVLSLEG DAADLAALQD DPQSIVLNGG MLDLSDFSTW QSGTSYKDGL EVSGSSGTVI GSQDVVDLAG GNDMHIGGDG KDGVYVVIDA GDGQVSLAND NQYLGTTQIA SGTLMVSDNS QLGYTHYNRQ VIFTDKPQES VMEITANVDT RSTTTEHGRD IEMRADGEVA VDAGVDTQWG ALMADSSGQH QDEGSTLTKT GAGTLELTAS GTTQSAVRVE EGTLQGDVAD IFPYASSLWV GDGATFVTGA DQDIQSIDAT SSGTIDISDG TVLRLTGQDT SVALNASLFN CDGTLVNATD GVTLTGELNT NLETDSLTYL SNVTVNGNLT NTSGAVSLQN GVAGDTLTVN GDYTGGGTLL LDSELNGDDS VSDQLVMNGN TAGNTTVVVN SITGIGEPTS TGIKVVDFAA DPTQFQNNAQ FSLAGSGYVN MGAYDYTLVE DNNDWYLRSQ EVTPPSPPDP DPTPDPDPTQ DPDPTPDPEP TPAYQPVLNA KVGGYLNNLR AANQAFMMER RDHAGGDGQT LNLRVIGGDY HYTAAGQLAQ HEDTSTVQLS GDLFSGRWGT DGEWMLGIVG GYSDNQGDSR SSMTGTRADN QNHGYAVGLT SSWFQHGKQK QGAWLDNWLQ YAWFSNDVSE HEDGVDHYHS SGIIASLEAG YQWLPGRGVV IEPQAQVIYQ GVQQDDFTAA NRARVSQSQG DDIQTRLGLH SEWRTAVHVI PTLDLNYYHD PHSTEIEEDA STISDDAVKQ RGEIKVGVTG NISQRVSLRG SVAWQKGSDD FAQTAGFLSM TVKW
|
| |