Gene ECH74115_3480 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3480 
Symbol 
ID6966571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3220670 
End bp3223252 
Gene Length2583 bp 
Protein Length860 aa 
Translation table11 
GC content55% 
IMG OID643387286 
Productfimbrial usher protein 
Protein accessionYP_002271749 
Protein GI209396561 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGGA GTTATGTCAA TGCCTGGGCT GAAAATGAAA TTCAGTTTGA TTCCCGTTTT 
CTGGAGTTAA AAGGCGACAC AAAAATCGAT CTGAAGCGAT TTTCCAGCCA GGGTTATGTC
GAACCTGGGA AATACAATTT ACAGGTTCAA CTAAATAAAC AGCCGCTGAC GGAAGAATAC
GATATTTACT GGTACGCCTC TGAGAACGAT GCCAGTAAAA CCTATGCCTG CCTGACGCCT
GAACTGGTCG CGCAGTTTGG CTTAAAAGAG GATGTGGCAA AAAACCTGCA ATGGATCCAC
GACGGCAAAT GCCTGAAACC CGGTCAACTG GAAGGCATTG ATATTAAAGC TGACCTGAGT
CAGTCAGCGT TAGTCATTTC ATTACCCCAG GCTTACCTTG AATATACCGA CATCAACTGG
GATCCGCCTT CACGCTGGGA TGACGGTATA TCTGGTTTAA TTGCTGACTA CAGTATTACC
GCCCAGACAC GACATGAAGA AAATGGCGGG GATGACAGCA ATGAAATTAG CGGTAACGGG
ACGGTTGGGG TGAACCTCGG CGCATGGCGT CTTCGTGCCG ACTGGCAGAC TGATTATTTG
CATAGTAAAA GCAATGATGA CGATGTTATC AACGGTGATG ACACGCAAAA AAACTGGGAG
TGGAGCCGCT ACTACGCCTG GCGAGCCTTA CCGTCGCTAA AAGCCAAACT TGGCCTTGGC
GAAGACTACC TGAATTCTGA TATTTTCGAC GGCTTTAACT ACGTGGGTGG CAGTATCAGC
ACCGACGATC AAATGTTGCC GCCGAATCTG CGCGGCTATG CGCCGGATAT CTCCGGCGTG
GCGCACACCA CCGCGAAAGT GACCGTCAGC CAGTTGGGCC GCGTCATCTA CGAAACCCAG
GTCCCGGCGG GGCCGTTCCG CATCCAGGAT CTTGGCGATT CGGTCTCCGG TACGCTGCAT
ATCCGCATTG AAGAACAGAA CGGTCAGGTG CAGGAATATG ACATCAACAC CGCCTCGATG
CCGTTCCTGA CTCGCCCCGG CCAGGTGCGC TATAAACTGA TGATGGGCCG CCCGCAGGAG
TGGGGGCACC ACGTGGAAGG CGGTTTCTTC TCCGGCGGCG AAGCTTCCTG GGGGATTGCC
AACGGCTGGT CGCTATACGG CGGGGCGCTG GCAGATGAAC ACTATCAGTC GGCGGCGCTT
GGCGTCGGTC GCGACCTGTC TGTGTTTGGT GCGGTGGCCT TTGATATCAC CCACTCGCAT
ACCCGTCTGG ATAAAGAGAC CGCCTACGGG AAAGGTTCAC TGGACGGCAA CTCGTTTCGC
CTGAGCTATT CCAAAGACTT CGATGAACTG AACAGCCGCG TCACTTTTGC CGGATACCGC
TTCTCGGAAG AGAACTTCAT GACCATGAGC GAGTATCTCG ATGCCAGCGA CAGCGAAATG
GTGCGCACCG GCAACGACAA AGAGATGTAC ACCGCCACCT ATAACCAGAA CTTCAGGGAT
GCCGGTGTGT CTGTTTATCT CAACTACACC CGCCATACCT ACTGGGATCG CGACGAACAG
ACCAACTACA ACGTCATGCT CTCGCACTAC TTCAACCTGG GCAGTATCCG CAACATGAGC
ATTTCCATGA CCGGATACCG CTACGAGTAT GACAACCAGG CCGATAAAGG TGTGTACATA
TCGCTCAGTA TGCCGTGGGG TGACAGCAGC ACCATCAGCT ATAACGGCAA CTACGGCAGC
GGTTCGGACA GCAGCCAGGT GGGGTATTTC AGCCGTGTCG ATGACGCAAC CCATTACCAG
TTGAACGTAG GCACCAGCGA CAATCACTCC AGCGTTGACG GTTATTACAG CCACGACGGA
TCGCTGGCGC AGGTCGATCT CAGCGCTAAC TACCATGAAG GGCAGTACAC CTCGGCGGGT
ATTTCCTTAC AGGGCGGCGC GACGCTCACC GCACAAGGTG GCGCGCTCCA CCGTACCCAG
AATATGGGCG GTACGCGTCT GCTGATTGAT GCCGACGGTG TGGCTGGTGT TCCGGTGGAA
GGAAATGGCG CGGCGGTTTA CACCAATATG TTCGGTAAGG CAGTGGTGGC AGACGTCAAC
AACTACTACC GCAACCAGGC GTATATCGAC CTAAACAACC TGCCGGAAAA CGCCGAAGCC
ACCCAGTCCG TGGTGCAGGC CACGCTTACC GAAGGGGCCA TTGGCTACCG TAAGTTCTCG
GTGATCAGCG GGCAAAAAGC GATGGCGGTG CTGCGTCTGC AAGATGGCAG TTATCCGCCG
TTTGGCGCGG AAGTGAAAAA CGACAGCGCG CAGAACGTCG GTCTGGTTGA CGATGACGGC
AACGTCTACC TCGCGGGCGT AAAACCTGGC GAGCATATGA TCGTTTCATG GGGCGGTGTG
GCCCACTGCG ATATTCATCT GCCTGACCCG CTGCCAGCCG ATCTGTTCAA TGGCCTGTTA
TTACCATGCC AGCAAACAGG GGCGATATCT CCTTCGATGC CTCATGAAAT TAAGCCGGTG
ATCCAGGAGC AGACCCAGCA GGTGATGCCA ACGGAAGCGC CAGTATCGGT ATCAGCCAAT
TAA
 
Protein sequence
MSGSYVNAWA ENEIQFDSRF LELKGDTKID LKRFSSQGYV EPGKYNLQVQ LNKQPLTEEY 
DIYWYASEND ASKTYACLTP ELVAQFGLKE DVAKNLQWIH DGKCLKPGQL EGIDIKADLS
QSALVISLPQ AYLEYTDINW DPPSRWDDGI SGLIADYSIT AQTRHEENGG DDSNEISGNG
TVGVNLGAWR LRADWQTDYL HSKSNDDDVI NGDDTQKNWE WSRYYAWRAL PSLKAKLGLG
EDYLNSDIFD GFNYVGGSIS TDDQMLPPNL RGYAPDISGV AHTTAKVTVS QLGRVIYETQ
VPAGPFRIQD LGDSVSGTLH IRIEEQNGQV QEYDINTASM PFLTRPGQVR YKLMMGRPQE
WGHHVEGGFF SGGEASWGIA NGWSLYGGAL ADEHYQSAAL GVGRDLSVFG AVAFDITHSH
TRLDKETAYG KGSLDGNSFR LSYSKDFDEL NSRVTFAGYR FSEENFMTMS EYLDASDSEM
VRTGNDKEMY TATYNQNFRD AGVSVYLNYT RHTYWDRDEQ TNYNVMLSHY FNLGSIRNMS
ISMTGYRYEY DNQADKGVYI SLSMPWGDSS TISYNGNYGS GSDSSQVGYF SRVDDATHYQ
LNVGTSDNHS SVDGYYSHDG SLAQVDLSAN YHEGQYTSAG ISLQGGATLT AQGGALHRTQ
NMGGTRLLID ADGVAGVPVE GNGAAVYTNM FGKAVVADVN NYYRNQAYID LNNLPENAEA
TQSVVQATLT EGAIGYRKFS VISGQKAMAV LRLQDGSYPP FGAEVKNDSA QNVGLVDDDG
NVYLAGVKPG EHMIVSWGGV AHCDIHLPDP LPADLFNGLL LPCQQTGAIS PSMPHEIKPV
IQEQTQQVMP TEAPVSVSAN