Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4462 |
Symbol | |
ID | 6972085 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4133362 |
End bp | 4135953 |
Gene Length | 2592 bp |
Protein Length | 863 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 643388179 |
Product | fimbrial usher family protein |
Protein accession | YP_002272616 |
Protein GI | 209400215 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3188] P pilus assembly protein, porin PapC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTATGG CGGCAGTTCC GGGACAGAAA CTCGTCCACT GCAATAACAA ATATAAAAAC ACAGGTCATC AGGGAATGCC ACAACGACAC CACCAGGGAC ATAAACGCAC ACCGAAACAG TTGGCGCTCA TTATCAAACG CTGTTTGCCG ATGGTGCTCA CTGGCAGCGG CATGCTTTGC ACTACCGCTA ACGCCGAAGA GTATTATTTC GACCCCATTA TGCTGGAAAC CACAAAAAGT GGTATGCAAA CAACCGATCT GTCACGTTTT TCAAAAAAAT ACGCACAACT ACCAGGAACT TATCAGGTTG ATATCTGGCT GAATAAAAAG AAGGTTTCAC AGAAAAAAAT TACATTTACC GCCAATGCAG AGCAACTTCT GCAGCCACAG TTTACGGTAG AACAACTACG TGAGCTGGGT ATTAAGGTGG ATGAAATCCC GGCGCTGGCT GAAAAAGATG ACGATAGCGT GATCAACTCG CTTGAACAAA TCATTCCCGG TACAGCTGCT GAATTTGATT TCAATCATCA GCGACTTAAT TTGAGCATTC CCCAAATTGC ACTGTACCGT GATGCAAGAG GTTACGTCTC CCCTTCTCGT TGGGACGATG GTATACCAAC GCTGTTTACC AACTACTCGT TTACAGGTTC TGATAACCGT TACCGCCAGG GCAATCGTAG CCAACGACAG TACCTAAATA TGCAAAATGG TGCCAATTTT GGCCCCTGGC GATTACGTAA CTATTCTACG TGGACACGCA ACGATCAGGC GTCAAGCTGG AACACTATCA GTAGTTATTT ACAACGTGAT ATCAAGGCGT TGAAGTCTCA GTTGCTTCTG GGAGAAAGCG CCACCAGCGG CAGTATTTTT TCCAGCTACA ACTTTACTGG CGTGCAACTC GCTTCCGACG ATAATATGTT GCCAAACAGC CAGCGCGGAT TTGCCCCAAC GGTACGCGGT ATCGCAAACA GTAGTGCAAT CGTGACTATC AGGCAAAATG GTTATGTGAT CTATCAAAGC AACGTGCCAG CGGGTGCCTT TGAAATTAAC GATCTCTACC CCTCTTCCAA CAGCGGCGAT TTAGAAGTCA CGATTGAAGA AAGTGACGGT ACGCAACGTC GCTTTATCCA GCCTTATTCT TCATTACCCA TGATGCAGCG ACCTGGGCAT CTAAAATATA GCGCGACCGC TGGACGCTAT CGCGCTGATG CAAACAGTGA TAGCAAGGAA CCCGAATTTG CTGAAGCCAC GGCAATATAT GGTTTGAATA ATACTTTTAC GCTGTATGGC GGCCTGCTCG GTTCTGAAGA TTATTATGCG CTGGGGATCG GTATCGGCGG CACACTTGGC GCACTGGGCG CGTTGTCGAT GGATATCAAC AGAGCTGACA CCCAATTCGA TAACCAGCAC TCTTTTCATG GCTATCAATG GCGTACGCAG TACATCAAAG ATATCCCGGA AACCAACACC AATATCGCTG TCAGCTACTA TCGCTATACC AACGATGGCT ATTTTAGTTT TGATGAAGCC AATACCCGCA ATTGGGACTA TAACAGTCGC CAAAAAAGTG AAATTCAATT CAACATCAGC CAGACAATAT TTGATGGGGT AAGTCTGTAT GCCTCCGGTT CACAGCAAGA CTATTGGGGC AATAACGAGA AAAACAGGAA TATCTCTGTT GGGGTTTCCG GCCAGCAATG GGGAATTGGT TACAGCCTGA ATTATCAATA CAGCCGCTAC ACTGATCAAA ATAATGACCG CGCACTCTCT TTGAATCTCA GTATTCCGTT AGAACGCTGG TTACCGCGTA GCCGGGTTTC CTATCAGATG ACCAGCCAGA AAGATCGCCC AACCCAACAT GAAATGCGTC TTGATGGCTC ACTGCTGGAT GATGGTCGCC TGAGCTATAG TCTGGAACAA AGTCTGGATG ACGATAACAA CCATAACAGT AGCGTGAACG CCAGTTACCG TTCACCTTAT GGAACCTTCA GTGCCGGATA CAGTTACGGT AATGACAGTA GCCAATACAA TTACGGCGTT ACCGGCGGCG TGGTTATCCA TCCTCATGGT GTGACACTCT CGCAATATCT GGGCAACGCT TTTGCGCTTA TTGATGCTAA CGGGGCATCT GGCGTGAGGA TACAAAACTA TCCGGGGATT GCTACTGATC CCTTTGGCTA TGCAGTGGTT CCTTATCTCA CAACTTATCA GGAAAACCGT CTCTCGGTAG ATACTACGCA GCTGCCCGAT AACGTCGATC TTGAACAAAC AACACAGTTT GTGGTGCCCA ACAGAGGTGC AATGGTAGCG GCGCGTTTCA ACGCCAATAT CGGTTATCGC GTACTTGTTA CAGTCAGCGA TCGCAACGGT AAACCGTTGC CCTTTGGCGC TCTTGCCAGC AACGATGATA CGGGGCAACA AAGTATCGTC GATGAGGGCG GCATACTATA TCTCTCTGGG ATATCGAGTA AATCACAAAG CTGGACTGTA CGCTGGGGAA ATCAGGCAGA TCAACAATGT CAGTTTGCTT TTAGTACACC GGATTCAGAA CCAACAACCT CTGTATTACA AGGCACAGCG CAGTGCCATT AA
|
Protein sequence | MIMAAVPGQK LVHCNNKYKN TGHQGMPQRH HQGHKRTPKQ LALIIKRCLP MVLTGSGMLC TTANAEEYYF DPIMLETTKS GMQTTDLSRF SKKYAQLPGT YQVDIWLNKK KVSQKKITFT ANAEQLLQPQ FTVEQLRELG IKVDEIPALA EKDDDSVINS LEQIIPGTAA EFDFNHQRLN LSIPQIALYR DARGYVSPSR WDDGIPTLFT NYSFTGSDNR YRQGNRSQRQ YLNMQNGANF GPWRLRNYST WTRNDQASSW NTISSYLQRD IKALKSQLLL GESATSGSIF SSYNFTGVQL ASDDNMLPNS QRGFAPTVRG IANSSAIVTI RQNGYVIYQS NVPAGAFEIN DLYPSSNSGD LEVTIEESDG TQRRFIQPYS SLPMMQRPGH LKYSATAGRY RADANSDSKE PEFAEATAIY GLNNTFTLYG GLLGSEDYYA LGIGIGGTLG ALGALSMDIN RADTQFDNQH SFHGYQWRTQ YIKDIPETNT NIAVSYYRYT NDGYFSFDEA NTRNWDYNSR QKSEIQFNIS QTIFDGVSLY ASGSQQDYWG NNEKNRNISV GVSGQQWGIG YSLNYQYSRY TDQNNDRALS LNLSIPLERW LPRSRVSYQM TSQKDRPTQH EMRLDGSLLD DGRLSYSLEQ SLDDDNNHNS SVNASYRSPY GTFSAGYSYG NDSSQYNYGV TGGVVIHPHG VTLSQYLGNA FALIDANGAS GVRIQNYPGI ATDPFGYAVV PYLTTYQENR LSVDTTQLPD NVDLEQTTQF VVPNRGAMVA ARFNANIGYR VLVTVSDRNG KPLPFGALAS NDDTGQQSIV DEGGILYLSG ISSKSQSWTV RWGNQADQQC QFAFSTPDSE PTTSVLQGTA QCH
|
| |