Gene ECH74115_3092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3092 
Symbol 
ID6967900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2860965 
End bp2863445 
Gene Length2481 bp 
Protein Length826 aa 
Translation table11 
GC content45% 
IMG OID643386923 
Productfimbrial usher protein 
Protein accessionYP_002271391 
Protein GI209400432 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.595563 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAGAA TGACCCCGCT TGCATCAGCA ATAGTCGCGT TATTGCTCGG CATTGAAGCA 
CATGCAGCAG AAGAAACCTT TGACACCCAT TTTATGATGG GAGGAATGAA AGGTGAGCAA
GTAACAAATT TGCGTCTTGA TGATAATCAG CCTTTACCTG GGCAGTACGA CATTGATATT
TATGTCAATA AACAATGGCG TGGAAAATAT GAGATTATCG TCAAAGATAA CCCACATGAA
ACTTGCCTGA CGCGTGAAAT TGTAAAGCGG TTAGGTATAA ATAGCGATAA TTTTGCACGC
GAAAATCAGT GTTTAACATT TGAGCAACTT GTTCAGGGCG GAAGCTACTC CTGGGATATT
GGGATTTTTC GTTTGGATCT CGCTGTTCCG CAAGCCTGGG TTGAAGAGCT GGAAAATGGT
TACGTTCCGC CAGAAAACTG GGAGCGGGGA ATCAATGCGT TTTATACGTC TTATTACGTA
AGTCAGTATT ACAGTGACTA TAAAGCCTCG GGCAATAGTA AGAGCACTTA TGTGCGTTTT
AACAGTGGGC TAAATTTGCT GGGATGGCAG CTGCATTCAG ATGCCAGTTT TAGCAAAACG
GATAATAATC CTGGTGAGTG GAAAAGCAAC ACGCTGTATC TGGAGCATGG ATTTTCTCAA
ATTCTCGGTA CGCTGCGTAT TGGTGATATG TACACCTCAG CCGATATATT TGATTCCGTT
CGCTTCACCG GGGTTCGGTT ATTTCGTGAT ATGCAGATGT TGCCAAACTC GAAGCAAAAT
TTTACCCCAC GGGTACAGGG GATTGCCCAG AGTAACGCGT TGGTAACTAT TGAACAGAAC
GGTTTTGTCG TTTATCAGAA AGAGGTTCCA CCAGGTCCGT TTTCGATTAG TGATTTGCAG
TTAGCGGGCG GAGGGGCGGA TCTTGATGTT AGTGTTAAGG AAGCCGATGG TTCTGTGACT
ACGTATCTGG TGCCTTATGC GGCAGTACCT AATATGCTGC AACCTGGTGT ATCAAAATAT
GATTTTGCGG CAGGCCGTAG TCATATTGAA GGTGCGAGCA AGCAAAGTGA TTTTGTCCAG
GCGGGTTATC AGTATGGTTT TAATAATTTA TTGACGCTGT ATGGCGGCAC GATGGTTGCT
AATAATTACT ATGCCTTTAC CCTCGGAACA GGTTGGAACA CGCGTATTGG CGCAATTTCA
GTCGATGCAA CTAAATCGCA CAGTAAGCAA GACAATGGTG ATGTGTTTGA CGGACAAAGT
TATCAAATTG CCTATAACAA ATTTTTGAGC CAAACATCAA CACGTTTTGG TCTGGCGGCC
TGGCGTTATT CGTCGCGTGA TTACCGCACA TTTAACGATT ATGTGTGGGC AAATAATAAA
GATAATTATC GTCGTGATAA AAACGATGTC TATGATATTG CCGATTATTA CCAGAATGAT
TTTGGTCGCA AAAATAGCTT TTCCGCCAAT ATGAGTCAGT CATTGCCAGA AGGCTGGGGA
TCTGTGTCAT TAAGTACTTT ATGGCGTGAT TACTGGGGGC GAAGCGGCAG TAGTAAAGAT
TATCAACTGA GTTATTCCAA TAACTGGCGG CGGATAAGCT ATACCCTCGC GGCAAGCCAG
GCGTATGACG AGAACCACGC CGAAGAGAAA CGCTTTAATA TTTTTATATC GATTCCCTTT
GACTGGGGCG ATGACGTTAC GACGCCGCGA CGGCAAATAT ATATGTCTAA CTCAACGACC
TTTGATGATC AGGGTTTTGC TTCAAACAAT ACCGGATTAT CGGGAACCGT TGGGAACCGC
GATCAGTTTA ACTATGGTAT CAACTTGAGC CATCAACATC AGGGAAATGA AACGACAGCT
GGGGCCAATT TGACCTGGAC CGCACCGGCC GCAACAGTGA ATGGCAGCTA TAGTCAGTCG
AGTACTTATC GACAGGTCGG AGCCAGTGTT TCAGGGGGAC TGGTTGCCTG GTCTGGTGGC
GTTAATCTGG CGAACCGTCT TTCCGAAACG TTTGCTGTGA TGCATGCGCC GGGAATCAAA
GATGCTTATG TCAATGGGCA AAAATATCGT ACAACAAACT GTAATGGTGT GGTGGTGTAC
GACGGACTGA CACCTTATCG GGAAAATCAC CTGATGATGG ATGTGTCGCA AAGCGATAGC
GAAACAGAAT TACGCGGGAA CCGTAAAATG ACCGCCCCTT ATCGCGGCGC GGTCGTTCTG
GTTGATTTTG ATACCGATCA GCGTAAGCCC TGGTTTATAA AAGCATTAAG ATCCGATGGA
CAGCCATTAA CGTTTGGTTA TGAAGTCAAT GATATGCATG GTCATAACAT TGGTGTTGTC
GGCCAGGGCA GTCAGATATT TATTCGCACC AATGAAATAC CGCCAGCGGT TAATGTGGCA
ATTGATAAGC AACAAGGGCT TTCATGCACA ATCACCTTCG GTAAAGAGAT TGATGAAAGT
AAAAATTATA TCTGTCGGTG A
 
Protein sequence
MLRMTPLASA IVALLLGIEA HAAEETFDTH FMMGGMKGEQ VTNLRLDDNQ PLPGQYDIDI 
YVNKQWRGKY EIIVKDNPHE TCLTREIVKR LGINSDNFAR ENQCLTFEQL VQGGSYSWDI
GIFRLDLAVP QAWVEELENG YVPPENWERG INAFYTSYYV SQYYSDYKAS GNSKSTYVRF
NSGLNLLGWQ LHSDASFSKT DNNPGEWKSN TLYLEHGFSQ ILGTLRIGDM YTSADIFDSV
RFTGVRLFRD MQMLPNSKQN FTPRVQGIAQ SNALVTIEQN GFVVYQKEVP PGPFSISDLQ
LAGGGADLDV SVKEADGSVT TYLVPYAAVP NMLQPGVSKY DFAAGRSHIE GASKQSDFVQ
AGYQYGFNNL LTLYGGTMVA NNYYAFTLGT GWNTRIGAIS VDATKSHSKQ DNGDVFDGQS
YQIAYNKFLS QTSTRFGLAA WRYSSRDYRT FNDYVWANNK DNYRRDKNDV YDIADYYQND
FGRKNSFSAN MSQSLPEGWG SVSLSTLWRD YWGRSGSSKD YQLSYSNNWR RISYTLAASQ
AYDENHAEEK RFNIFISIPF DWGDDVTTPR RQIYMSNSTT FDDQGFASNN TGLSGTVGNR
DQFNYGINLS HQHQGNETTA GANLTWTAPA ATVNGSYSQS STYRQVGASV SGGLVAWSGG
VNLANRLSET FAVMHAPGIK DAYVNGQKYR TTNCNGVVVY DGLTPYRENH LMMDVSQSDS
ETELRGNRKM TAPYRGAVVL VDFDTDQRKP WFIKALRSDG QPLTFGYEVN DMHGHNIGVV
GQGSQIFIRT NEIPPAVNVA IDKQQGLSCT ITFGKEIDES KNYICR