Gene ECH74115_3895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3895 
Symbol 
ID6967177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3604560 
End bp3609275 
Gene Length4716 bp 
Protein Length1571 aa 
Translation table11 
GC content49% 
IMG OID643387672 
Producthypothetical protein 
Protein accessionYP_002272121 
Protein GI209399930 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACACAA TACACTTGCG CTGTCTCTTC AGGATGAATC CCCTGGTCTG GTGCCTGTGG 
GCTGATGTTG CAGCAAAGCT AAGGTCGCTT AAACGCTACT CAGTATTCAC TTTTCAGAGG
ATGAAATTTA TGAACAGGAC CAGTCCCTAT TATTGTCGTC GCTCAGTACT TTCCTTATTG
ATATCTGCCT TGATATATGC CCCGCCCGGG ATGGCTGCCT TCACTCCTGA TGTTATTGGT
GTGGTAAACG ATGAGACTGT AGATGGCAGC CAACGAGTAG ATGAACGAGG TACAACAAAT
AACACTCATA TTATCAACCA TGGCCAGCAG AATGTTTATG GCGGGGTATC TAATGGAAGT
CTTATTGAAT CTGGTGGATA TCAAGATGTA GGAAGGCATA ACAATTATGT GGGGCAGTCT
AATAATACCA CCATTAACGG GGGCAGACAG TCAATTCATG ACGGGGGTAT TTCCACAGGT
ACGATAATCG AGAGTGGCAA TCAGGACGTT TATAAAGGGG GTATCAGCAA TGGAACGACA
ATTAAGGGCG GTGCTTCACG CGTAGAGGGA GGGAGTGCGA ATGGAACACT CATTGATGGT
GGTAGCCAGA TAGTAAAAGT TCAAGGGCAT GCTGATGGTA CAACGATAAA TAAGTCTGGC
TCTCAGGACG TAGTACAAGG AAGTCTGGCA ACGAACACAA CCATAAATGG TGGTCGACAG
TATGTTGAAC AGAGCACAGT AGAAACAACC ACCATCAAAA ATGGCGGTGA GCAAAGAGTA
TATGAGAGCC GTGCGCTGGA CACGACGATT GAAGGCGGAA CTCAGTCTCT GAATAGTAAG
TCAACGGCAA AAAATACTCA GATCTATTCT GGTGGTACGC AAATTATTGA TAACACCAGC
TCCTCGGATG TTATTGAAGT TTATTCCGGT GGCGTGCTTG ATGTTAGTGG TGGTACGGCA
ACAAATGTTA CCCAGCACGA TGGTGCAATT TTAAAAACTA ACACTAACGG TACGACGGTG
AGCGGTACGA ATAGTGAAGG TGCATTCTCC ATCCACAATC ACGTGGCAGA CAATGTGTTG
CTGGAAAACG GTGGTCATTT AGACATAAAC GCATATGGTT CGGCAAACAA GACGATTATT
AAAGATAAAG GAACAATGTC AGTTTTAACC AATGCTAAAG CTGATGCGAC CCGAATAGAT
AATGGCGGGG TTATGGATGT TGCAGGAAAC GCGACAAATA CCATAATTAA TGGTGGCACA
CAGAATATTA ATAATTATGG CATAGCCACA GGCACCAATA TCAACAGCGG AACGCAAAAT
ATCAAAAGCG GCGGGAAAGC TGACACAACA ATTATATCCT CCGGGAGCCG GCAGGTTGTT
GAGAAAGATG GTACGGCAAT TGGCAGCAAT ATTAGCGCCG GAGGCTCGCT GATTGTCTAT
ACCGGCGGTA TTGCACATGG GGTTAACCAG GAGACGGGCA GTGCTTTAGT TGCCAACACG
GGTGCAGGGA CTGATATCGA AGGATACAAC AAGCTCTCTC ACTTCACTAT TACCGGAGGG
GAGGCTAATT ATGTTGTGCT GGAAAATACC GGCGAACTGA CGGTAGTGGC TAAAACCTCG
GCGAAAAATA CTACCATTGA TGCTGGCGGT AAGCTGATTG TCCAGAAGGA GGCTAAAACA
GATAGCACCA GACTTAATAA TGGCGGCGTT CTGGAGGTTC AGGACGGTGG TGAGGCTAAG
CATGTTGAGC AACAATCCGG CGGCGCATTA ATTGCTTCCA CGACCTCCGG AACACTTATC
GAAGGAACCA ACAGTTATGG TGATGCTTTC TACATCAGGA ATTCAGAAGC TAAAAATGTA
GTGCTGGAAA ACGCTGGCTC ATTAACAGTC GTCACTGGTT CCCGGGCAGT TGACACGATT
ATTAATGCCA ACGGCAAAAT GGATGTTTAT GGAAAAGATG TTGGCACTGT ACTCAATAGT
GCTGGCACCC AAACAATATA TGCCAGTGCC ACTTCTGATA AAGCAAATAT CAAAGGTGGC
AAGCAAACGG TATATGGTTT AGCCACTGAA GCAAATATCG AAAGTGGTGA ACAAATTGTT
GATGGTGGGT CAACAGAGAA AACACACATC AATGGTGGCA CGCAAACCGT TCAGAATTAT
GGTAAGGCGA TCAATACCGA TATCGTCTCT GGCCTACAAC AAATTATGGC AAACGGGACA
GCGGAAGGTT CCATTATTAA TGGCGGTTCA CAGATAGTTA ATGAGGGCGG TCTGGCTGAA
AACTCGGTGC TTAATGATGG CGGCACACTC GATGTGCGGG AGAAAGGCAG CGCAACGGGG
ATACAGCAGA GTAGCCAGGG CGCGTTGGTT GCAACCACCA GGGCGACGCG GGTCACAGGA
ACACGCGCGG ATGGCGTCGC GTTCAGCATC GAGCAGGGTG CGGCGAACAA TATCCTGCTG
GCAAATGGCG GAGTGTTAAC CGTGGAGTCA GACACCTCTT CTGACAAAAC ACAGGTCAAT
ACGGGCGGAC GGGAGATCGT CAAAACAAAA GCCACTGCGA CAGGCACGAC GCTCACCGGC
GGTGAACAAA TTGTCGAGGG TGTGGCGAAT GAGACAACAA TTAACGACGG CGGAATACAA
ACAGTTTCAG CTAACGGAGA GGCAATAAAA ACAACGATCA ATGAAGGCGG TACGCTGACA
GTCAACGATA ATGGCAAAGC GACAGATATC GTCCAGAACA GCGGTGCCGC TCTCCAGACG
AGCACGGCTA ACGGTATTGA AATCAGCGGT ACTCACCAGT ACGGCACTTT TTCCATTTCC
GGCAATTTAG CGACCAATAT GTTGCTGGAA AATGGCGGTA ATTTATTGGT ATTAGCAGGT
ACCGAAGCTC GCGACTCCAC GGTTGGCAAG GGGGGGGCAA TGCAAAACCA GGGTCAGGAC
TCCGCCACAA AGGTTAACTC TGGTGGGCAA TATACCCTTG GGCGGTCAAA AGATGAGTTT
CAGGCTCTGG CCCGGGCAGA AGATCTCCAG GTTGCTGGCG GGACAGCAAT CGTCTACGCA
GGTACGCTGG CGGATGCATC GGTCAGTGGC GCGACAGGAA GCCTGTCGTT AATGACGCCA
CGGGATAATG TTACGCCAGT TAAACTCGAA GGGGCGATCC GGATTACCGA TAGCGCGACA
TTAACTATCG GCAATGGCGT TGATACGACG CTTGCCGACC TGACGGCTGC CAGCCGGGGC
AGTGTCTGGC TTAACAGCAA TAATTCCTGT GCAGGCACCA GCAACTGCGA GTATAGAGTA
AACAGTTTGC TACTTAACGA CGGTAATGTT TATTTATCAG CACAAACAGC AGCGCCTGCC
ACAACTAACG GTATATACAA TACGCTGACA ACCAATGAAC TTTCCGGTAG CGGTAATTTC
TACCTGCATA CCAACGTTGC AGGCTCTCGG GGCGATCAAC TGGTCGTCAA CAACAACGCC
ACTGGTAATT TTAAAATCTT TGTTCAGGAT ACCGGCGTCA GTCCTCAGTC TGACGACGCG
ATGACGCTGG TGAAAACAGG GGGAGGGGAT GCTTCGTTTT CGCTGGGCAA TACTGGCGGT
TTCGTTGATC TTGGGACCTA TGAGTATGTC CTGAAAAGCG ATGGCAACAG CAACTGGAAC
CTGACCAATG ATGTCAAACC CAACCCGGAT CCCAACCCAA ATCCCAACCC AAATCCGAAG
CCGGATCCAA AACCAGACCC AAAACCGGAT CCGAAACCAG ACCCGACTCC CGAGCCAACG
CCGACACCCG TTCCGGAGAA ACGCATCACG CCTTCTACCG CAGCCGTACT CAATATGGCA
GCAACATTAC CGTTGGTATT TGATGCTGAG CTAAACAGTA TTCGCGAGCG GTTGAACATA
ATGAAAGCGA GTCCACACAA CAATAATGTC TGGGGGGCGA CGTATAACAC CCGTAATAAT
GTCACCACCG ATGCGGGGGC CGGGTTTGAG CAGACGCTGA CCGGAATGAC AGTGGGGATC
GACAGCCCTA ATGATATTCC TGAGGGGATT GCGACGCTGG GCGCTTTTAT GGGTTATTCC
CATTCACATA TCGGTTTTGA TCGCGGAGGA CATGGCAGTG TGGGCAGTTA TTCTCTGGGC
GGCTATGCCA GTTGGGAACA TGAAAGTGGT TTCTATCTGG ACGGTGTCGT GAAGCTGAAC
CGTTTTGAAA GTAACGTAGC CGGTAAAATG AGCAGCGGTG GAGCCGCCAA TGGCAGTTAC
CACAGCAACG GGCTGGGCGG TCACATTGAA ACCGGGATGC GATTTACCGA TGGTAACTGG
AACCTGACGC CGTATGCATC GTTAACGGGG TTCACCGCTG ATAACCCCGA ATATCATTTA
TCCAATGGCA TGGAATCGAA ATCAGTCGAT ACCCGCAGTA TATATCGTGA ACTGGGCGCA
ACGCTGAGTT ACAACATGCG TCTGGGGAAC GGTATGGAAA TTGAGCCGTG GCTGAAGGCG
GCTGTGCGCA AAGAATTTGT CGATGATAAC CGGGTGAAGG TGAATAATGA CGGTAATTTC
GTCAATGATT TGTCGGGCAG ACGTGGAATA TACCAGGCAG GTATTAAAGC CTCATTCAGC
AGTACGTTAA GCGGGCATCT TGGGGTGGGG TATAGCCATG GTGCCGGTGT GGAATCCCCG
TGGAACGCGG TAGCTGGTGT GAACTGGTCG TTCTGA
 
Protein sequence
MNTIHLRCLF RMNPLVWCLW ADVAAKLRSL KRYSVFTFQR MKFMNRTSPY YCRRSVLSLL 
ISALIYAPPG MAAFTPDVIG VVNDETVDGS QRVDERGTTN NTHIINHGQQ NVYGGVSNGS
LIESGGYQDV GRHNNYVGQS NNTTINGGRQ SIHDGGISTG TIIESGNQDV YKGGISNGTT
IKGGASRVEG GSANGTLIDG GSQIVKVQGH ADGTTINKSG SQDVVQGSLA TNTTINGGRQ
YVEQSTVETT TIKNGGEQRV YESRALDTTI EGGTQSLNSK STAKNTQIYS GGTQIIDNTS
SSDVIEVYSG GVLDVSGGTA TNVTQHDGAI LKTNTNGTTV SGTNSEGAFS IHNHVADNVL
LENGGHLDIN AYGSANKTII KDKGTMSVLT NAKADATRID NGGVMDVAGN ATNTIINGGT
QNINNYGIAT GTNINSGTQN IKSGGKADTT IISSGSRQVV EKDGTAIGSN ISAGGSLIVY
TGGIAHGVNQ ETGSALVANT GAGTDIEGYN KLSHFTITGG EANYVVLENT GELTVVAKTS
AKNTTIDAGG KLIVQKEAKT DSTRLNNGGV LEVQDGGEAK HVEQQSGGAL IASTTSGTLI
EGTNSYGDAF YIRNSEAKNV VLENAGSLTV VTGSRAVDTI INANGKMDVY GKDVGTVLNS
AGTQTIYASA TSDKANIKGG KQTVYGLATE ANIESGEQIV DGGSTEKTHI NGGTQTVQNY
GKAINTDIVS GLQQIMANGT AEGSIINGGS QIVNEGGLAE NSVLNDGGTL DVREKGSATG
IQQSSQGALV ATTRATRVTG TRADGVAFSI EQGAANNILL ANGGVLTVES DTSSDKTQVN
TGGREIVKTK ATATGTTLTG GEQIVEGVAN ETTINDGGIQ TVSANGEAIK TTINEGGTLT
VNDNGKATDI VQNSGAALQT STANGIEISG THQYGTFSIS GNLATNMLLE NGGNLLVLAG
TEARDSTVGK GGAMQNQGQD SATKVNSGGQ YTLGRSKDEF QALARAEDLQ VAGGTAIVYA
GTLADASVSG ATGSLSLMTP RDNVTPVKLE GAIRITDSAT LTIGNGVDTT LADLTAASRG
SVWLNSNNSC AGTSNCEYRV NSLLLNDGNV YLSAQTAAPA TTNGIYNTLT TNELSGSGNF
YLHTNVAGSR GDQLVVNNNA TGNFKIFVQD TGVSPQSDDA MTLVKTGGGD ASFSLGNTGG
FVDLGTYEYV LKSDGNSNWN LTNDVKPNPD PNPNPNPNPK PDPKPDPKPD PKPDPTPEPT
PTPVPEKRIT PSTAAVLNMA ATLPLVFDAE LNSIRERLNI MKASPHNNNV WGATYNTRNN
VTTDAGAGFE QTLTGMTVGI DSPNDIPEGI ATLGAFMGYS HSHIGFDRGG HGSVGSYSLG
GYASWEHESG FYLDGVVKLN RFESNVAGKM SSGGAANGSY HSNGLGGHIE TGMRFTDGNW
NLTPYASLTG FTADNPEYHL SNGMESKSVD TRSIYRELGA TLSYNMRLGN GMEIEPWLKA
AVRKEFVDDN RVKVNNDGNF VNDLSGRRGI YQAGIKASFS STLSGHLGVG YSHGAGVESP
WNAVAGVNWS F