Gene ECH74115_3367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3367 
Symbol 
ID6967999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3107562 
End bp3111266 
Gene Length3705 bp 
Protein Length1234 aa 
Translation table11 
GC content51% 
IMG OID643387176 
Productadhesin 
Protein accessionYP_002271639 
Protein GI209396650 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR01376] Chlamydial polymorphic outer membrane protein repeat
[TIGR01414] outer membrane autotransporter barrel domain
[TIGR02601] autotransporter-associated beta strand repeat 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGCAT CTCTTTTCCC TGCTAACGGT GTCGCGGCGG CCATTGATTT ATGCCAGGGA 
TATAATATCA AAGCGAGTTG TCACGCCAGC AGGCAAAGCC TTTCAGGCAT TACGCAGGTC
TGGAGTATTG CCGATGGGCA ATGGCTGGTT TTTTCGGATA TGACCAATAA TGCCAGCGGT
GGGGCCGTAT TTTTGCAACA AGGAGCGGAA TTTACATTAT CACCAGAAAA TGAAACTGGA
ATGACTCTGT TTGCCAATAA CATCGTTTCA GGAGAATATA ATAACGGCGG GGCAATATTT
GCTAAAGAAA ACTCAACGCT GAATCTTACG GATGTTATTT TTTCTGGTAA CGTCGCAGGC
GGCTATGGTG GCGCAATCTA TTCTTCTGGT ACTAACGATA CCGGTGCCAT CGATTTACGT
GTCACTAACG CCGTGTTTCG CAATAACATC GCTAATGACG GCAAAGGTGG TGCAATTTAT
ACCATCAATA ATGATATCTA TTTAAGTGAT GATGTTTTTA ACAATAACCA GGCATATACA
TCAACAAGTT ACAGTGATGG CGATGGCGGC GCAATCGATG TCACAGATAA TAATAGCGAC
AGCAAGCATC CTTCAGGTTA TACGATAATA AATAACACTG CCTTTACAAA TAACACTGCC
GAAGGTTATG GCGGGGCGAT ATATACCAAT AGCGCGACGG CTCCCTATCT TATTGATATT
TCTGTTGATG ACAGCTACAG CCAGAACGGA GGCGTGTTAG TCGATGAGAA CAATAGCGCA
GCAGGCTATG GAGATGGTCC TTCCTCTGCG GCGGGTGGCT TTATGTATCT CGGCTTAAGT
GAAGTTACCT TTGATATTGC CGACGGAAAA ACGCTGGTTA TTGGCAATAC AGAGAATGAC
GGAGCTGTTG ACTCTATTGC TGGTACCGGG TTAATCACCA AAACAGGTTC CGGCGATCTG
GTACTTAATG CAGATAACAA TGACTTTACT GGTGAGATGC AGATTGAAAA CGGTGAAGTT
ACCCTGGGCC GCAGCAACTC CCTGATGAAT GTCGGCGATA CGCATTGCCA GGACGATCCG
CAAGACTGCT ACGGTCTGAC GATAGGGAGT ATTGATAAGT ACCAGAATCA GGCAGAGCTA
AATGTTGGCT CCACCCAACA AACCTTTGCG CACTCATTGA CGGGCTTTCA GAATGGCACT
TTAAATATCG ATGCTGGTGG CAATGTTACT GTTAATCAAG GCAGTTTTGC TGGCACCATC
GAAGGTGCTG GTCAGCTCAC CATTGCGCAA AACGGCAGCT ATGTGCTGGC GGGGGCGCAG
TCGATGGCGC TAACCGGCGA TATAGTGGTG GATGCTGGTG CGGTGCTTTC GCTGGAAGGC
GACGCGGCAG ATCTTGCCGC TCTCCAGGAC GATCCGCAGT CGATCGTGTT AAACGGCGGT
ATGCTCGATC TCTCTGATTT CTCCACCTGG CAGAGCGGTA CATCATACAA AGATGGCCTT
GAAGTCAGTG GCAGCAGCGG AACGGTTATC GGCAGTCAGG ATGTGGTAGA TCTTGCAGGC
GGAAACGATA TGCATATCGG CGGCGACGGG AAAGATGGCG TCTACGTGGT GATCGATGCG
GGTGACGGGC AGGTCAGCCT GGCAAATGAC AATCAATACC TCGGCACAAC GCAAATCGCT
TCCGGTACGC TGATGGTGAG CGACAACTCG CAGCTTGGAT ATACCCATTA TAACCGCCAG
GTTATCTTTA CCGATAAGCC ACAAGAAAGC GTGATGGAGA TTACTGCCAA TGTCGATACT
CGCTCTACAA CGACTGAGCA TGGGCGTGAT ATTGAAATGC GCGCCGACGG TGAAGTGGCA
GTTGATGCGG GGGTAGACAC GCAGTGGGGC GCACTGATGG CTGACAGCAG CGGGCAGCAT
CAGGATGAGG GTAGCACATT GACTAAAACG GGGGCGGGTA CACTGGAGCT GACCGCCAGC
GGTACAACGC AGTCAGCGGT GAGAGTAGAA GAGGGCACGC TGCAAGGTGA TGTTGCGGAT
ATCTTCCCTT ATGCTTCGTC GCTATGGGTC GGTGACGGGG CAACGTTCGT TACTGGCGCG
GATCAGGATA TTCAGTCAAT TGATGCTACT TCCAGCGGCA CTATCGACAT CAGCGATGGT
ACGGTTTTGC GCCTGACCGG GCAGGATACT TCCGTCGCCC TTAATGCCTC ACTGTTTAAC
TGCGATGGGA CGCTGGTGAA TGCCACCGAT GGTGTGACGT TGACAGGTGA GCTTAATACC
AACCTTGAAA CTGACAGCCT GACTTATCTT TCCAACGTGA CGGTTAATGG CAATCTGACC
AATACGTCCG GTGCGGTTAG CCTGCAAAAT GGCGTCGCTG GCGATACGCT GACGGTAAAC
GGTGATTATA CCGGCGGCGG TACGCTACTG CTCGATAGCG AATTAAACGG CGATGACTCG
GTAAGCGATC AATTGGTGAT GAACGGTAAT ACTGCTGGCA ACACAACTGT GGTGGTTAAC
TCCATTACAG GGATTGGTGA GCCGACATCG ACAGGCATTA AAGTGGTTGA TTTCGCAGCT
GATCCCACGC AGTTTCAAAA CAATGCGCAG TTCAGTCTGG CAGGCAGCGG CTACGTCAAT
ATGGGAGCGT ATGACTACAC GCTGGTGGAA GATAACAACG ACTGGTATCT GCGATCGCAA
GAAGTAACGC CGCCATCGCC ACCTGATCCA GACCCGACTC CCGATCCTGA TCCCACGCAG
GATCCTGATC CAACACCCGA CCCGGAACCT ACGCCTGCTT ACCAGCCGGT GTTGAATGCC
AAAGTTGGCG GTTATCTCAA TAACCTGCGG GCGGCAAATC AGGCGTTTAT GATGGAGCGA
CGCGATCACG CAGGTGGCGA TGGTCAGACG CTGAATTTAC GTGTTATCGG CGGAGATTAT
CATTACACAG CAGCGGGGCA ACTGGCTCAA CATGAAGACA CTTCTACGGT GCAACTTAGC
GGCGATCTGT TTAGCGGGCG CTGGGGCACG GATGGCGAGT GGATGCTTGG GATTGTTGGT
GGCTACAGCG ATAACCAGGG CGACAGCCGC TCGAGTATGA CCGGAACTCG CGCCGATAAC
CAGAACCACG GTTATGCGGT TGGGCTGACC TCAAGCTGGT TTCAGCACGG TAAGCAGAAG
CAGGGGGCCT GGCTGGATAA CTGGTTGCAG TACGCGTGGT TTAGCAATGA TGTTTCTGAA
CATGAAGATG GCGTGGATCA TTACCATTCG TCGGGGATTA TCGCCTCGCT GGAAGCGGGG
TATCAGTGGT TACCGGGGCG TGGTGTGGTG ATTGAACCGC AGGCGCAGGT GATTTATCAG
GGCGTGCAGC AGGATGATTT TACCGCCGCT AACCGTGCGC GCGTGTCACA ATCGCAGGGT
GATGATATTC AGACGCGGCT GGGTTTACAC AGCGAATGGC GTACCGCTGT TCATGTCATA
CCAACATTAG ATCTGAATTA TTATCACGAT CCCCATTCGA CGGAAATTGA AGAAGATGCC
AGCACTATCA GTGACGATGC GGTGAAGCAA CGGGGTGAAA TAAAAGTGGG AGTCACGGGC
AATATCAGTC AGCGAGTTTC GCTGCGTGGT AGCGTGGCGT GGCAGAAAGG GAGTGATGAT
TTTGCCCAGA CGGCAGGGTT TTTGTCGATG ACGGTGAAAT GGTAA
 
Protein sequence
MIASLFPANG VAAAIDLCQG YNIKASCHAS RQSLSGITQV WSIADGQWLV FSDMTNNASG 
GAVFLQQGAE FTLSPENETG MTLFANNIVS GEYNNGGAIF AKENSTLNLT DVIFSGNVAG
GYGGAIYSSG TNDTGAIDLR VTNAVFRNNI ANDGKGGAIY TINNDIYLSD DVFNNNQAYT
STSYSDGDGG AIDVTDNNSD SKHPSGYTII NNTAFTNNTA EGYGGAIYTN SATAPYLIDI
SVDDSYSQNG GVLVDENNSA AGYGDGPSSA AGGFMYLGLS EVTFDIADGK TLVIGNTEND
GAVDSIAGTG LITKTGSGDL VLNADNNDFT GEMQIENGEV TLGRSNSLMN VGDTHCQDDP
QDCYGLTIGS IDKYQNQAEL NVGSTQQTFA HSLTGFQNGT LNIDAGGNVT VNQGSFAGTI
EGAGQLTIAQ NGSYVLAGAQ SMALTGDIVV DAGAVLSLEG DAADLAALQD DPQSIVLNGG
MLDLSDFSTW QSGTSYKDGL EVSGSSGTVI GSQDVVDLAG GNDMHIGGDG KDGVYVVIDA
GDGQVSLAND NQYLGTTQIA SGTLMVSDNS QLGYTHYNRQ VIFTDKPQES VMEITANVDT
RSTTTEHGRD IEMRADGEVA VDAGVDTQWG ALMADSSGQH QDEGSTLTKT GAGTLELTAS
GTTQSAVRVE EGTLQGDVAD IFPYASSLWV GDGATFVTGA DQDIQSIDAT SSGTIDISDG
TVLRLTGQDT SVALNASLFN CDGTLVNATD GVTLTGELNT NLETDSLTYL SNVTVNGNLT
NTSGAVSLQN GVAGDTLTVN GDYTGGGTLL LDSELNGDDS VSDQLVMNGN TAGNTTVVVN
SITGIGEPTS TGIKVVDFAA DPTQFQNNAQ FSLAGSGYVN MGAYDYTLVE DNNDWYLRSQ
EVTPPSPPDP DPTPDPDPTQ DPDPTPDPEP TPAYQPVLNA KVGGYLNNLR AANQAFMMER
RDHAGGDGQT LNLRVIGGDY HYTAAGQLAQ HEDTSTVQLS GDLFSGRWGT DGEWMLGIVG
GYSDNQGDSR SSMTGTRADN QNHGYAVGLT SSWFQHGKQK QGAWLDNWLQ YAWFSNDVSE
HEDGVDHYHS SGIIASLEAG YQWLPGRGVV IEPQAQVIYQ GVQQDDFTAA NRARVSQSQG
DDIQTRLGLH SEWRTAVHVI PTLDLNYYHD PHSTEIEEDA STISDDAVKQ RGEIKVGVTG
NISQRVSLRG SVAWQKGSDD FAQTAGFLSM TVKW