Gene ECH74115_4462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4462 
Symbol 
ID6972085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4133362 
End bp4135953 
Gene Length2592 bp 
Protein Length863 aa 
Translation table11 
GC content47% 
IMG OID643388179 
Productfimbrial usher family protein 
Protein accessionYP_002272616 
Protein GI209400215 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTATGG CGGCAGTTCC GGGACAGAAA CTCGTCCACT GCAATAACAA ATATAAAAAC 
ACAGGTCATC AGGGAATGCC ACAACGACAC CACCAGGGAC ATAAACGCAC ACCGAAACAG
TTGGCGCTCA TTATCAAACG CTGTTTGCCG ATGGTGCTCA CTGGCAGCGG CATGCTTTGC
ACTACCGCTA ACGCCGAAGA GTATTATTTC GACCCCATTA TGCTGGAAAC CACAAAAAGT
GGTATGCAAA CAACCGATCT GTCACGTTTT TCAAAAAAAT ACGCACAACT ACCAGGAACT
TATCAGGTTG ATATCTGGCT GAATAAAAAG AAGGTTTCAC AGAAAAAAAT TACATTTACC
GCCAATGCAG AGCAACTTCT GCAGCCACAG TTTACGGTAG AACAACTACG TGAGCTGGGT
ATTAAGGTGG ATGAAATCCC GGCGCTGGCT GAAAAAGATG ACGATAGCGT GATCAACTCG
CTTGAACAAA TCATTCCCGG TACAGCTGCT GAATTTGATT TCAATCATCA GCGACTTAAT
TTGAGCATTC CCCAAATTGC ACTGTACCGT GATGCAAGAG GTTACGTCTC CCCTTCTCGT
TGGGACGATG GTATACCAAC GCTGTTTACC AACTACTCGT TTACAGGTTC TGATAACCGT
TACCGCCAGG GCAATCGTAG CCAACGACAG TACCTAAATA TGCAAAATGG TGCCAATTTT
GGCCCCTGGC GATTACGTAA CTATTCTACG TGGACACGCA ACGATCAGGC GTCAAGCTGG
AACACTATCA GTAGTTATTT ACAACGTGAT ATCAAGGCGT TGAAGTCTCA GTTGCTTCTG
GGAGAAAGCG CCACCAGCGG CAGTATTTTT TCCAGCTACA ACTTTACTGG CGTGCAACTC
GCTTCCGACG ATAATATGTT GCCAAACAGC CAGCGCGGAT TTGCCCCAAC GGTACGCGGT
ATCGCAAACA GTAGTGCAAT CGTGACTATC AGGCAAAATG GTTATGTGAT CTATCAAAGC
AACGTGCCAG CGGGTGCCTT TGAAATTAAC GATCTCTACC CCTCTTCCAA CAGCGGCGAT
TTAGAAGTCA CGATTGAAGA AAGTGACGGT ACGCAACGTC GCTTTATCCA GCCTTATTCT
TCATTACCCA TGATGCAGCG ACCTGGGCAT CTAAAATATA GCGCGACCGC TGGACGCTAT
CGCGCTGATG CAAACAGTGA TAGCAAGGAA CCCGAATTTG CTGAAGCCAC GGCAATATAT
GGTTTGAATA ATACTTTTAC GCTGTATGGC GGCCTGCTCG GTTCTGAAGA TTATTATGCG
CTGGGGATCG GTATCGGCGG CACACTTGGC GCACTGGGCG CGTTGTCGAT GGATATCAAC
AGAGCTGACA CCCAATTCGA TAACCAGCAC TCTTTTCATG GCTATCAATG GCGTACGCAG
TACATCAAAG ATATCCCGGA AACCAACACC AATATCGCTG TCAGCTACTA TCGCTATACC
AACGATGGCT ATTTTAGTTT TGATGAAGCC AATACCCGCA ATTGGGACTA TAACAGTCGC
CAAAAAAGTG AAATTCAATT CAACATCAGC CAGACAATAT TTGATGGGGT AAGTCTGTAT
GCCTCCGGTT CACAGCAAGA CTATTGGGGC AATAACGAGA AAAACAGGAA TATCTCTGTT
GGGGTTTCCG GCCAGCAATG GGGAATTGGT TACAGCCTGA ATTATCAATA CAGCCGCTAC
ACTGATCAAA ATAATGACCG CGCACTCTCT TTGAATCTCA GTATTCCGTT AGAACGCTGG
TTACCGCGTA GCCGGGTTTC CTATCAGATG ACCAGCCAGA AAGATCGCCC AACCCAACAT
GAAATGCGTC TTGATGGCTC ACTGCTGGAT GATGGTCGCC TGAGCTATAG TCTGGAACAA
AGTCTGGATG ACGATAACAA CCATAACAGT AGCGTGAACG CCAGTTACCG TTCACCTTAT
GGAACCTTCA GTGCCGGATA CAGTTACGGT AATGACAGTA GCCAATACAA TTACGGCGTT
ACCGGCGGCG TGGTTATCCA TCCTCATGGT GTGACACTCT CGCAATATCT GGGCAACGCT
TTTGCGCTTA TTGATGCTAA CGGGGCATCT GGCGTGAGGA TACAAAACTA TCCGGGGATT
GCTACTGATC CCTTTGGCTA TGCAGTGGTT CCTTATCTCA CAACTTATCA GGAAAACCGT
CTCTCGGTAG ATACTACGCA GCTGCCCGAT AACGTCGATC TTGAACAAAC AACACAGTTT
GTGGTGCCCA ACAGAGGTGC AATGGTAGCG GCGCGTTTCA ACGCCAATAT CGGTTATCGC
GTACTTGTTA CAGTCAGCGA TCGCAACGGT AAACCGTTGC CCTTTGGCGC TCTTGCCAGC
AACGATGATA CGGGGCAACA AAGTATCGTC GATGAGGGCG GCATACTATA TCTCTCTGGG
ATATCGAGTA AATCACAAAG CTGGACTGTA CGCTGGGGAA ATCAGGCAGA TCAACAATGT
CAGTTTGCTT TTAGTACACC GGATTCAGAA CCAACAACCT CTGTATTACA AGGCACAGCG
CAGTGCCATT AA
 
Protein sequence
MIMAAVPGQK LVHCNNKYKN TGHQGMPQRH HQGHKRTPKQ LALIIKRCLP MVLTGSGMLC 
TTANAEEYYF DPIMLETTKS GMQTTDLSRF SKKYAQLPGT YQVDIWLNKK KVSQKKITFT
ANAEQLLQPQ FTVEQLRELG IKVDEIPALA EKDDDSVINS LEQIIPGTAA EFDFNHQRLN
LSIPQIALYR DARGYVSPSR WDDGIPTLFT NYSFTGSDNR YRQGNRSQRQ YLNMQNGANF
GPWRLRNYST WTRNDQASSW NTISSYLQRD IKALKSQLLL GESATSGSIF SSYNFTGVQL
ASDDNMLPNS QRGFAPTVRG IANSSAIVTI RQNGYVIYQS NVPAGAFEIN DLYPSSNSGD
LEVTIEESDG TQRRFIQPYS SLPMMQRPGH LKYSATAGRY RADANSDSKE PEFAEATAIY
GLNNTFTLYG GLLGSEDYYA LGIGIGGTLG ALGALSMDIN RADTQFDNQH SFHGYQWRTQ
YIKDIPETNT NIAVSYYRYT NDGYFSFDEA NTRNWDYNSR QKSEIQFNIS QTIFDGVSLY
ASGSQQDYWG NNEKNRNISV GVSGQQWGIG YSLNYQYSRY TDQNNDRALS LNLSIPLERW
LPRSRVSYQM TSQKDRPTQH EMRLDGSLLD DGRLSYSLEQ SLDDDNNHNS SVNASYRSPY
GTFSAGYSYG NDSSQYNYGV TGGVVIHPHG VTLSQYLGNA FALIDANGAS GVRIQNYPGI
ATDPFGYAVV PYLTTYQENR LSVDTTQLPD NVDLEQTTQF VVPNRGAMVA ARFNANIGYR
VLVTVSDRNG KPLPFGALAS NDDTGQQSIV DEGGILYLSG ISSKSQSWTV RWGNQADQQC
QFAFSTPDSE PTTSVLQGTA QCH