Gene ECH74115_0808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0808 
Symbol 
ID6969852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp829639 
End bp832089 
Gene Length2451 bp 
Protein Length816 aa 
Translation table11 
GC content51% 
IMG OID643384834 
Productfimbrial usher protein 
Protein accessionYP_002269340 
Protein GI209399371 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.404851 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATATTT ATCGACTCTC TGTTCTTTCC TGTCTGGCAA TGGTAACCCC TCCTGCGCTG 
ACAGCTGAAT TTAACCTTAA CGTTCTCGAT AAGTCGATTC GGGATAGCGT TGATATTTCG
TTGCTCAATC AAAAAGGCGT CGTAGCTCCC GGCGATTACT TTGTTAGTGT TACGGTTAAT
AATAATAAAA TCAGTAATGG GCAACAAATT CGCTGGCAAA AATCTGGCGA TAAAATTATT
CCTTGTATCA ATGAATCGCT GATCGAACTC TTCGGACTTA AATCTGACTT TCGTAAAAAA
TTACCGGCAA TAAAAGAATG TGTCGATTTT AGTGTCTTCC CTGAAATCAT CTTTACTTTC
GATCAGGCAA ATCAACAGCT CAATATTACA ATCCCGCAGG CATGGCTGGC GTGGCATTCA
GAAAACTGGA CGCCACCCTC AACGTGGAAT AACGGTATTC CTGGCTTCCT GATGGATTAC
AACCTGTTTG CCAGCACTTA TCGCCCACAA AGCGGCAGCA GCAGCAACAA CCTGAACGCC
TATGGCACCA CGGGCCTCAA CGCCGGGGCA TGGCGTCTGC GCAGCGATTA TCAGCTTAGC
CAGTCCGATA GCGGCGACAA CCGTGAACAA TCTGGGGCGA TTTCACGGAC CTACCTGTTC
CGCCCGTTAC CGCAAATAGG TTCCCGATTA ACCCTGGGCG AAACCGATTT TAGCTCTAAC
ATTTTTGATG GTTTCTCCTA TACCGGTGCG GCGTTAGCCA GCGACGATCG AATGCTCCCC
TGGGAGTTAC GCGGCTACGC GCCGCAAATC AGCGGCATCG CACAGACTAA CGCCACGGTC
ACGATAAGCC ATTCTGGTCG CGTGATTTAC CAGAAAAAAG TTCCACCAGG CCCATTTATT
ATTGATGACC TGAATCAGTC AGTGCAAGGT ACACTGGATG TCAAAGTGAG CGAAGAAGAT
GGACGGGTGA ATAACTTCCA GGTTTCAGCG GCCTCTACCC CGTTCCTGAC GCGTCAGGGC
CAGGTGCGCT ATAAACTGGC TGCCGGTCGC CCGCGTTCCT CTATGTCGCA CCACACCGAG
GATGAAACCT TCATTAGCCA TGAAGTCTCC TGGGGAATGC TCTCCAACAC TTCACTATAT
GGCGGCATGC TTCTCGCAGG CGATGACTAT CGTTCCGGCG CGCTGGGTAT TGGGCAAAAT
ATGCTCTGGA TGGGGGCGCT GTCGTTTGAC GTGACCTGGG CCGACAGCCA TTTTGATACC
CAACAGGATG AGCAGGGTTA CAGCTACCGC TTTAACTACA GCAAACAGGT TGATGCGACC
AACAGCACCA TCTCATTGGC AGCCTATCGC TTCTCCGATC GCCATTTTCA CAGCTACGCT
AACTACATTG ATCACAAATA CAATGACGCC GACGCGCAGG ATGAAAAGCA AACAATCAGC
CTCTCCTTCG GTCAGCCTAT TACTCTGCTG AATCTTAATC TCTACGCCAA CATCCTGCAT
CAGAGCTGGT GGAATGCGGA TACCTCCACC ACGGCCAACA TTACCGTCGG TTTTAACGTC
GACATTGGCG ACTGGAAAGA TATTTCTGTC TCGACCTCGT TCAACACCAC CCATTATGAA
GATAAAGATC GCGACAACCA GATTTACTTT TCGATCTCCT TACCGATTGG GGAAAGTGGT
CGACTGGGTT ATGACATGCA GAACAACAGT AATACTACCA CCCACCGCAT GTCGTGGAAC
GATACCCTGG ATGAACGAAA CAGTTGGGGA ATGTCGGCTG GAATACAATC CGATCGCCCG
GATAACGGTG CGCAGGTCAG CGGCAATTAC CAACACCTCA GTTCGGCGGG GGAATGGGAT
CTGACCGGAA CCTATGCCGC CAATGATTAC ACTTCCGCCA GTGCCAGTTG GAGCGGCTCG
TTTACCGCGA CTCAACATGG CGCGGCATTC CATCGCCGCA GTTCCACCAA TGAACCACGC
CTGATGGTCA GCACCGACGG CGTGGGCGAT ATTCCGATTC AGGGCAATAT TGACTACACC
AACCGCTTTG GTATTGCCGT GGTACCGTTT GTTTCCAGCT ACCAGCCTAC GACGGTGGCG
GTCAATATGA ACGACTTGCC TGACGGCGTT ACCGTTTCCG AAAACGTGGT GAAAGAAACC
TGGACCGAAG GCGCCATCGG CTTTAAATCG CTGGCGTCCC GCGCCGGGAA AGATCTCAAC
GTCATTATCA GCGATGCTAA CGGGCATTTC CCGCCGCTCG GTGCCGATGT TCGCCAGGCT
GAAGGCGGTG TCAGCGTCGG GATGGTTGGC GAAAACGGTC ACGCATGGCT GAGTGGGGTC
GATGAAAATC AACAATTTAC CGTGCACTGG GGCGACCAGA AAACGTGTGC CATTCATCTG
CCGGAACATC TGGAAGATGT GACTAAGCGC CTGATTTTAC CCTGTCATTA A
 
Protein sequence
MNIYRLSVLS CLAMVTPPAL TAEFNLNVLD KSIRDSVDIS LLNQKGVVAP GDYFVSVTVN 
NNKISNGQQI RWQKSGDKII PCINESLIEL FGLKSDFRKK LPAIKECVDF SVFPEIIFTF
DQANQQLNIT IPQAWLAWHS ENWTPPSTWN NGIPGFLMDY NLFASTYRPQ SGSSSNNLNA
YGTTGLNAGA WRLRSDYQLS QSDSGDNREQ SGAISRTYLF RPLPQIGSRL TLGETDFSSN
IFDGFSYTGA ALASDDRMLP WELRGYAPQI SGIAQTNATV TISHSGRVIY QKKVPPGPFI
IDDLNQSVQG TLDVKVSEED GRVNNFQVSA ASTPFLTRQG QVRYKLAAGR PRSSMSHHTE
DETFISHEVS WGMLSNTSLY GGMLLAGDDY RSGALGIGQN MLWMGALSFD VTWADSHFDT
QQDEQGYSYR FNYSKQVDAT NSTISLAAYR FSDRHFHSYA NYIDHKYNDA DAQDEKQTIS
LSFGQPITLL NLNLYANILH QSWWNADTST TANITVGFNV DIGDWKDISV STSFNTTHYE
DKDRDNQIYF SISLPIGESG RLGYDMQNNS NTTTHRMSWN DTLDERNSWG MSAGIQSDRP
DNGAQVSGNY QHLSSAGEWD LTGTYAANDY TSASASWSGS FTATQHGAAF HRRSSTNEPR
LMVSTDGVGD IPIQGNIDYT NRFGIAVVPF VSSYQPTTVA VNMNDLPDGV TVSENVVKET
WTEGAIGFKS LASRAGKDLN VIISDANGHF PPLGADVRQA EGGVSVGMVG ENGHAWLSGV
DENQQFTVHW GDQKTCAIHL PEHLEDVTKR LILPCH