Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2118 |
Symbol | |
ID | 6968645 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2024700 |
End bp | 2026922 |
Gene Length | 2223 bp |
Protein Length | 740 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 643386016 |
Product | fimbrial usher family protein |
Protein accession | YP_002270505 |
Protein GI | 209396439 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3188] P pilus assembly protein, porin PapC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.167309 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGTCGACG CATCAACTGA GTTTGATGTT GGTCAGCAAC ATCTGTCACT CTCCGTTCCG CAAATTTATG TTGGCAGAAT GGCTCGCGGC TATGTTTCTC CTGATCTGTG GGAAGAAGGA ATAAATGCTG GGCTTCTAAA CTATAGTTTT AATGGTAATT CTATTAATAA TCGTAGTAAC CATAATGCAG GAAAATCCAA CTATGCATAT TTGAATTTAC AGAGTGGCAT CAACATTGGT AGTTGGCGAC TACGCGATAA CTCAACGTGG AGTTATAACA GTGGGAGTAG CAATTCATCT GACAGCAATA AATGGCAGCA TATCAATACG TCGGCTGAAC GTGACATTAT TCCCTTACGC TCACGGTTAA CGGTAGGTGA TAGTTATACC GATGGCGATA TTTTTGATAG TGTGAACTTT CGTGGCCTCA AAATAAATTC AACAGAAGCG ATGTTGCCCG ATAGCCAACA TGGTTTCGCT CCGGTGATTC ATGGTATTGC CCGTGGCACC GCACAAGTGA GTGTAAAACA GAACGGATAC GATGTTTATC AGACTACTGT TCCACCCGGC CCTTTTACTA TTGATGATAT CAACTCTGCG GCCAATGGTG GTGATCTGCA AGTAACCATA AAAGAGGCAG ACGGCAGTAT TCAGACATTA TATGTTCCTT ATTCGTCTGT TCCGGTTCTC CAACGTGCTG GATATACGCG TTATGCGCTT GCCATGGGGG AATATCGTAG TGGAAATAAC CTGCAAAGCT CCCCCAGGTT CATACAAGGT AGCTTGATGC ATGGACTGGA AGGAAACTGG ACACCTTATG GCGGAATGCA AATTGCAGAA GATTATCAGG CCTTCAACCT TGGTATTGGT AAAGATTTAG GACTTTTTGG TGCCTTTTCT TTCGATATCA CGCAGGCCAA TACGACACTT GCAGATGGCA CCCGTCACAG CGGGCAATCG GTTAAATCCG TCTACAGCAA ATCCTTCTAC CAGACAGGAA CCAATATCCA GGTCGCAGGA TATCGCTATT CTACGCAAGG TTTTTATAAC TTATCCGACA GTGCCTACAG TCGAATGAGT GGTTACACCG TCAAGCCTCC TACCGGAGAC AGCAATGAGC AGACACAATT TATTGATTAT TTTAATCTGT TCTACAGTAA GCGTGGTCAG GAACAAATAA GCATCTCTCA GCAGCTTGGA AATTACGGTG CGACATTTTT CAGTGCCAGT CGCCAAAGTT ACTGGAACAC GTCACGCAGC GACCAGCAAA TATCATTTGG ATTAAATGTG CCGTTTGGTG ATATTACGAC TTCGCTGAAT TACAGCTATT CCAATAATAT ATGGCAAAAC GATCGGGATC ATTTACTCGC TTTTACGCTT AATGTTCCCT TCAGTCATTG GATGCGTACA GACAGTCAGT CGGCATTTCG TAATTCAAAC GCCAGTTACA GTATGTCAAA CGATTTGAAA GGCGGCATGA CCAATCTATC GGGGGTTTAT GGCACTCTGC TGCCGGATAA TAACCTGAAT TATAGCGTTC AGGTCGGTAA CACCCACGGA GGTAATACAT CGTCTGGCAC CAGTGGTTAC AGTACTCTTA ATTATCGTGG AGCTTACGGC AATACTAATG TCGGTTACAG TCGGAGTGGT GACAGCAGCC AGATTTATTA CGGAATGAGT GGTGGGATTA TTGCTCATGC TGATGGCATC ACCTTTGGAC AGCCGCTGGG CGACACAATG GTTCTGGTTA AGGCTCCTGG CGCTGATAAT GTCAAAATAG AGAACCAGAC CGGAATTCAT ACCGACTGGC GTGGCTATGC CATATTACCA TTTGCGACAG AATATAGAGA AAATCGTGTC GCTCTTAACG CGAATTCCCT TGCAGATAAT GTTGAACTGG ATGAAACCGT GGTCACTGTC ATCCCAACTC ACGGTGCTAT TGCCAGAGCA ACATTTAATG CACAAATCGG CGGGAAAGTA TTAATGACGT TGAAGTACGG TAATAAAAGC GTTCCATTCG GTGCAATTGT CACTCACGGA GAGAATAAAA ATGGCAGCAT TGTCGCGGAA AACGGTCAGG TTTATCTGAC TGGACTTCCA CAGTCAGGGA AATTACAGGT TTCATGGGGC AATGATAAAA ACTCAAACTG TATTGTCGAT TACAAGCTTC CTGAAGTCTC TCCTGGAACC TTGCTGAACC AGCAGACAGC AATCTGTCGC TAA
|
Protein sequence | MVDASTEFDV GQQHLSLSVP QIYVGRMARG YVSPDLWEEG INAGLLNYSF NGNSINNRSN HNAGKSNYAY LNLQSGINIG SWRLRDNSTW SYNSGSSNSS DSNKWQHINT SAERDIIPLR SRLTVGDSYT DGDIFDSVNF RGLKINSTEA MLPDSQHGFA PVIHGIARGT AQVSVKQNGY DVYQTTVPPG PFTIDDINSA ANGGDLQVTI KEADGSIQTL YVPYSSVPVL QRAGYTRYAL AMGEYRSGNN LQSSPRFIQG SLMHGLEGNW TPYGGMQIAE DYQAFNLGIG KDLGLFGAFS FDITQANTTL ADGTRHSGQS VKSVYSKSFY QTGTNIQVAG YRYSTQGFYN LSDSAYSRMS GYTVKPPTGD SNEQTQFIDY FNLFYSKRGQ EQISISQQLG NYGATFFSAS RQSYWNTSRS DQQISFGLNV PFGDITTSLN YSYSNNIWQN DRDHLLAFTL NVPFSHWMRT DSQSAFRNSN ASYSMSNDLK GGMTNLSGVY GTLLPDNNLN YSVQVGNTHG GNTSSGTSGY STLNYRGAYG NTNVGYSRSG DSSQIYYGMS GGIIAHADGI TFGQPLGDTM VLVKAPGADN VKIENQTGIH TDWRGYAILP FATEYRENRV ALNANSLADN VELDETVVTV IPTHGAIARA TFNAQIGGKV LMTLKYGNKS VPFGAIVTHG ENKNGSIVAE NGQVYLTGLP QSGKLQVSWG NDKNSNCIVD YKLPEVSPGT LLNQQTAICR
|
| |