Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3480 |
Symbol | |
ID | 6966571 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3220670 |
End bp | 3223252 |
Gene Length | 2583 bp |
Protein Length | 860 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643387286 |
Product | fimbrial usher protein |
Protein accession | YP_002271749 |
Protein GI | 209396561 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3188] P pilus assembly protein, porin PapC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGGGA GTTATGTCAA TGCCTGGGCT GAAAATGAAA TTCAGTTTGA TTCCCGTTTT CTGGAGTTAA AAGGCGACAC AAAAATCGAT CTGAAGCGAT TTTCCAGCCA GGGTTATGTC GAACCTGGGA AATACAATTT ACAGGTTCAA CTAAATAAAC AGCCGCTGAC GGAAGAATAC GATATTTACT GGTACGCCTC TGAGAACGAT GCCAGTAAAA CCTATGCCTG CCTGACGCCT GAACTGGTCG CGCAGTTTGG CTTAAAAGAG GATGTGGCAA AAAACCTGCA ATGGATCCAC GACGGCAAAT GCCTGAAACC CGGTCAACTG GAAGGCATTG ATATTAAAGC TGACCTGAGT CAGTCAGCGT TAGTCATTTC ATTACCCCAG GCTTACCTTG AATATACCGA CATCAACTGG GATCCGCCTT CACGCTGGGA TGACGGTATA TCTGGTTTAA TTGCTGACTA CAGTATTACC GCCCAGACAC GACATGAAGA AAATGGCGGG GATGACAGCA ATGAAATTAG CGGTAACGGG ACGGTTGGGG TGAACCTCGG CGCATGGCGT CTTCGTGCCG ACTGGCAGAC TGATTATTTG CATAGTAAAA GCAATGATGA CGATGTTATC AACGGTGATG ACACGCAAAA AAACTGGGAG TGGAGCCGCT ACTACGCCTG GCGAGCCTTA CCGTCGCTAA AAGCCAAACT TGGCCTTGGC GAAGACTACC TGAATTCTGA TATTTTCGAC GGCTTTAACT ACGTGGGTGG CAGTATCAGC ACCGACGATC AAATGTTGCC GCCGAATCTG CGCGGCTATG CGCCGGATAT CTCCGGCGTG GCGCACACCA CCGCGAAAGT GACCGTCAGC CAGTTGGGCC GCGTCATCTA CGAAACCCAG GTCCCGGCGG GGCCGTTCCG CATCCAGGAT CTTGGCGATT CGGTCTCCGG TACGCTGCAT ATCCGCATTG AAGAACAGAA CGGTCAGGTG CAGGAATATG ACATCAACAC CGCCTCGATG CCGTTCCTGA CTCGCCCCGG CCAGGTGCGC TATAAACTGA TGATGGGCCG CCCGCAGGAG TGGGGGCACC ACGTGGAAGG CGGTTTCTTC TCCGGCGGCG AAGCTTCCTG GGGGATTGCC AACGGCTGGT CGCTATACGG CGGGGCGCTG GCAGATGAAC ACTATCAGTC GGCGGCGCTT GGCGTCGGTC GCGACCTGTC TGTGTTTGGT GCGGTGGCCT TTGATATCAC CCACTCGCAT ACCCGTCTGG ATAAAGAGAC CGCCTACGGG AAAGGTTCAC TGGACGGCAA CTCGTTTCGC CTGAGCTATT CCAAAGACTT CGATGAACTG AACAGCCGCG TCACTTTTGC CGGATACCGC TTCTCGGAAG AGAACTTCAT GACCATGAGC GAGTATCTCG ATGCCAGCGA CAGCGAAATG GTGCGCACCG GCAACGACAA AGAGATGTAC ACCGCCACCT ATAACCAGAA CTTCAGGGAT GCCGGTGTGT CTGTTTATCT CAACTACACC CGCCATACCT ACTGGGATCG CGACGAACAG ACCAACTACA ACGTCATGCT CTCGCACTAC TTCAACCTGG GCAGTATCCG CAACATGAGC ATTTCCATGA CCGGATACCG CTACGAGTAT GACAACCAGG CCGATAAAGG TGTGTACATA TCGCTCAGTA TGCCGTGGGG TGACAGCAGC ACCATCAGCT ATAACGGCAA CTACGGCAGC GGTTCGGACA GCAGCCAGGT GGGGTATTTC AGCCGTGTCG ATGACGCAAC CCATTACCAG TTGAACGTAG GCACCAGCGA CAATCACTCC AGCGTTGACG GTTATTACAG CCACGACGGA TCGCTGGCGC AGGTCGATCT CAGCGCTAAC TACCATGAAG GGCAGTACAC CTCGGCGGGT ATTTCCTTAC AGGGCGGCGC GACGCTCACC GCACAAGGTG GCGCGCTCCA CCGTACCCAG AATATGGGCG GTACGCGTCT GCTGATTGAT GCCGACGGTG TGGCTGGTGT TCCGGTGGAA GGAAATGGCG CGGCGGTTTA CACCAATATG TTCGGTAAGG CAGTGGTGGC AGACGTCAAC AACTACTACC GCAACCAGGC GTATATCGAC CTAAACAACC TGCCGGAAAA CGCCGAAGCC ACCCAGTCCG TGGTGCAGGC CACGCTTACC GAAGGGGCCA TTGGCTACCG TAAGTTCTCG GTGATCAGCG GGCAAAAAGC GATGGCGGTG CTGCGTCTGC AAGATGGCAG TTATCCGCCG TTTGGCGCGG AAGTGAAAAA CGACAGCGCG CAGAACGTCG GTCTGGTTGA CGATGACGGC AACGTCTACC TCGCGGGCGT AAAACCTGGC GAGCATATGA TCGTTTCATG GGGCGGTGTG GCCCACTGCG ATATTCATCT GCCTGACCCG CTGCCAGCCG ATCTGTTCAA TGGCCTGTTA TTACCATGCC AGCAAACAGG GGCGATATCT CCTTCGATGC CTCATGAAAT TAAGCCGGTG ATCCAGGAGC AGACCCAGCA GGTGATGCCA ACGGAAGCGC CAGTATCGGT ATCAGCCAAT TAA
|
Protein sequence | MSGSYVNAWA ENEIQFDSRF LELKGDTKID LKRFSSQGYV EPGKYNLQVQ LNKQPLTEEY DIYWYASEND ASKTYACLTP ELVAQFGLKE DVAKNLQWIH DGKCLKPGQL EGIDIKADLS QSALVISLPQ AYLEYTDINW DPPSRWDDGI SGLIADYSIT AQTRHEENGG DDSNEISGNG TVGVNLGAWR LRADWQTDYL HSKSNDDDVI NGDDTQKNWE WSRYYAWRAL PSLKAKLGLG EDYLNSDIFD GFNYVGGSIS TDDQMLPPNL RGYAPDISGV AHTTAKVTVS QLGRVIYETQ VPAGPFRIQD LGDSVSGTLH IRIEEQNGQV QEYDINTASM PFLTRPGQVR YKLMMGRPQE WGHHVEGGFF SGGEASWGIA NGWSLYGGAL ADEHYQSAAL GVGRDLSVFG AVAFDITHSH TRLDKETAYG KGSLDGNSFR LSYSKDFDEL NSRVTFAGYR FSEENFMTMS EYLDASDSEM VRTGNDKEMY TATYNQNFRD AGVSVYLNYT RHTYWDRDEQ TNYNVMLSHY FNLGSIRNMS ISMTGYRYEY DNQADKGVYI SLSMPWGDSS TISYNGNYGS GSDSSQVGYF SRVDDATHYQ LNVGTSDNHS SVDGYYSHDG SLAQVDLSAN YHEGQYTSAG ISLQGGATLT AQGGALHRTQ NMGGTRLLID ADGVAGVPVE GNGAAVYTNM FGKAVVADVN NYYRNQAYID LNNLPENAEA TQSVVQATLT EGAIGYRKFS VISGQKAMAV LRLQDGSYPP FGAEVKNDSA QNVGLVDDDG NVYLAGVKPG EHMIVSWGGV AHCDIHLPDP LPADLFNGLL LPCQQTGAIS PSMPHEIKPV IQEQTQQVMP TEAPVSVSAN
|
| |