Gene ECH74115_5161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5161 
Symbol 
ID6966625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4803732 
End bp4806266 
Gene Length2535 bp 
Protein Length844 aa 
Translation table11 
GC content48% 
IMG OID643388829 
Productfimbrial usher protein 
Protein accessionYP_002273255 
Protein GI209397496 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.711297 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTCTT CGCATCTTTT CATTACGCTT GCATCGGGCA TATGTCTGCT CTGTTCCATA 
TCTGCTTTTG CCCGGGATAG CTTGTTCAAC CCCAGATTAC TGGAACTGGA TCATCCTGCG
GATAATATTG ATATTCACCA GTTCAACCGT TCGAATACCT TACCTGCGGG AACATACAAA
GTTGATGTGA TGATCAACGG CATGCTCTTC GAACGCCAGG AAGTTAAATT CGCCCAGGAT
AACCCTGATG CTGAACTCCA CCCATGCTAC GTGGCGATAA AAAACGTGCT GGCGACCTAT
GGTATAAAAG TTGATGCGAT AAAATCTCTG GCGAATGTTG ATGACAAAAC ATGCGTAAAT
CCAGTTCCGC TGATCGACGG GGCTACCTGG TTACTGGACG CCAGTAAACT TGCATTGAAT
ATTACTATTC CGCAAATTTA TCTCAACAAT GCAGTTAATG GTTATATCAG CCCTTCCCGT
TGGGATCAGG GGATCAATGC CATGATGATG AATTATGATT TTTCGGCATC GCATACCATC
CGGTCAAATT ATGACGACGA CGATGACAGT TATTATCTGA ATTTGCGTAA TGGTATTAAT
TTAGGCGCAT GGCGTTTTCG TAATTACAGC ACCCTGAATT CTTATGACGG TAATGTGGAC
TACCATTCCG TCAGTAATTA CATTCAGCGC GACATCATGG CATTACGTAG CCAGATTATG
ATTGGCGATA CCTGGACGGC AAGCGATGTA TTTGATAGTA CACAGGTGCG TGGCGTGCGG
CTGTATACCG ATGACGATAT GTTGCCCTCC AGCCAGAACG GCTTTGCGCC AGTGGTACAT
GGGATTGCGA AAACTAACGC CACGGTGATC ATCAAACAAA ACGGCTACGT TATTTATCAA
TCAGCCGTAC CACAGGGCGC ATTTGCCCTC ACCGACTTAA ACACGACCAG TAGCGGCGGC
GATCTCGATG TCACTATCAA AGAAGAAGAT GGCAGCGAGC AGCACTTTAT TCAGCCATTT
ACTTCACTGG CCATTCTCAA GCGTGAAGGT CAGACCGATG TAGACCTTAG CATTGGAGAA
GTGCGCGACG AAAGCGGCTT TACGCCTGAG GTCTTGCAGT TACAAGCAAT GCACGGTTTC
CCTTTGGGAA TAACTTTGTA TGGCGGAACA CAATTGGCAA ATGATTACGC TTCTGCCGCG
CTGGGTATTG GTAAAGATAT GGGGGCGCTG GGCGCGATTT CTTTTGACGT GACTCATGCC
CGCTCGCAGT TTGACTACGA CGATAATGAG AGTGGTCAAT CGTATCGTTT TCTCTATTCC
AAACGTTTTG AAGACACCAA TACCACCTTT CGTCTGGTGG GTTATCGCTA CTCTATGGAG
GGGTTCTACA CCCTCAATGA ATGGGTGTCG CGACAGGATA ATGATTCTGA TTTCTGGGTA
ACGGGCAACC GTCGCAGCCG CTTCGAAGGT ACCTGGACGC AATCTTTCAC GCCAGGCTGG
GGCAATATTT ATTTAACATT CAGTCGACAG GAATACTGGC AGACCGATGA GGTCGAACGT
TTATTACAGT TCGGCTATAA CAACAACTGG CGAAACATCT CCTGGAACGT TTCCTGGAAC
TATACGGACT CGATCAAGCG CTCATTGGGC AACCATCATG ATGATAACAA TGATGATTTC
GGCAAAGAAC AGATTTTCAT GTTCTCAATG TCGATACCGC TATCGTGCTG GATGGAAGAC
AGCTACGTCA ACTATTCGTT AACGCAAAAC AACCACCATG AAAGCACGAT GCAGGTCGGT
CTGAACGGAA CGATGCTGGA AGGGCGTAAC CTGTCTTATA ACGTACAGGA ATCGTGGATG
CACTCTCCTG ATGACTCCTA CAGCGGCAAT GCCGGAATGA CCTATGACGG GACTTATGGC
TCGGTCAATG GTAGCTATTC CTGGAGCCGT GACTCCCAAC ATTTTGATTA TGGCGCCAGA
GGCGGCGTGC TGGTGCATAG TGACGGAGTG ACCTTCTCGC AGGAACTGGG CGAAACGGTG
GCATTGGTCA AAGCGCCGGG CGCAGAAGGC CTGTCCATTG AAAACGCCAC CGGGATTTCT
ACCGACTGGC GTGGTTATAC CGTAAAAACG CAGCTTAGCC CGTATGACGA AAACCGCGTG
GCATTGAACA GCGACTATTT CTCCAAAGCC AATATTGAAC TGGAAAACAC CGTCATCAAC
CTGGTACCAA CGCGCGGTGC GGTGGTGAAA GCCGAATTTG TCACCCATGT CGGTTATCGC
GTGCTATTTA ACGTCCGCCA GGTCAACGGT AAACCAATAA TGTTTGGCGC GATGGCAACC
GCCTCTCTCG AAACGGGCAC AGTCACCGGG ATTGTCGGTG ATAACGGCGA ACTGTATCTC
TCCGGGATGC CTGAAAAAGG CGAGTTTTTA TTGAGTTGGG GACAAGCTGC GGATGAAAAA
TGTAAGGCGG CCTATCACAT CACCCATAAA CCTGATGATA CCAGCCTGGT TCAAATGGAT
GCGATTTGTC GCTAA
 
Protein sequence
MASSHLFITL ASGICLLCSI SAFARDSLFN PRLLELDHPA DNIDIHQFNR SNTLPAGTYK 
VDVMINGMLF ERQEVKFAQD NPDAELHPCY VAIKNVLATY GIKVDAIKSL ANVDDKTCVN
PVPLIDGATW LLDASKLALN ITIPQIYLNN AVNGYISPSR WDQGINAMMM NYDFSASHTI
RSNYDDDDDS YYLNLRNGIN LGAWRFRNYS TLNSYDGNVD YHSVSNYIQR DIMALRSQIM
IGDTWTASDV FDSTQVRGVR LYTDDDMLPS SQNGFAPVVH GIAKTNATVI IKQNGYVIYQ
SAVPQGAFAL TDLNTTSSGG DLDVTIKEED GSEQHFIQPF TSLAILKREG QTDVDLSIGE
VRDESGFTPE VLQLQAMHGF PLGITLYGGT QLANDYASAA LGIGKDMGAL GAISFDVTHA
RSQFDYDDNE SGQSYRFLYS KRFEDTNTTF RLVGYRYSME GFYTLNEWVS RQDNDSDFWV
TGNRRSRFEG TWTQSFTPGW GNIYLTFSRQ EYWQTDEVER LLQFGYNNNW RNISWNVSWN
YTDSIKRSLG NHHDDNNDDF GKEQIFMFSM SIPLSCWMED SYVNYSLTQN NHHESTMQVG
LNGTMLEGRN LSYNVQESWM HSPDDSYSGN AGMTYDGTYG SVNGSYSWSR DSQHFDYGAR
GGVLVHSDGV TFSQELGETV ALVKAPGAEG LSIENATGIS TDWRGYTVKT QLSPYDENRV
ALNSDYFSKA NIELENTVIN LVPTRGAVVK AEFVTHVGYR VLFNVRQVNG KPIMFGAMAT
ASLETGTVTG IVGDNGELYL SGMPEKGEFL LSWGQAADEK CKAAYHITHK PDDTSLVQMD
AICR