Gene EcSMS35_1666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1666 
Symbol 
ID6142997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1659328 
End bp1661943 
Gene Length2616 bp 
Protein Length871 aa 
Translation table11 
GC content44% 
IMG OID641616542 
Productfimbrial usher family protein 
Protein accessionYP_001743720 
Protein GI170679788 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.504139 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCAGG TGTTACTACT GCCTCGCTTT GCACGGCTTA CCATTGCCCT TGGCTTAGCA 
ACTGCGGTAT TTCCCGTTGA CGCAGAGTTT TATTTTAATC CACGGTTTTT AAGCAATGAT
CTTGCTGAAT CTGTCGATTT ATCCGCCTTT ACTAAGGGAC GCGAGGCACC ACCTGGTACA
TATCGGGTCG ACATTTATTT GAACGACGAA TTTATGACTA GTCGGGATAT TACTTTTATT
GCAGATGATA ATAATGCAGA TCTCATCCCA TGCCTGAGTA CAGACCTTCT TGTCAGTCTT
GGCATAAAAA AATCAGCATT ATTAGACAAT AAGGAACATT CGGCAGAAAA ACATGTGTCG
GACAACAGCG CATGCACACC ACTTCGGGAT CGTTTGGCTG ACGCATCATC TGAGTTTAAT
GTTGGTCAGC AACATCTATC ACTCTCCGTT CCGCAAATTT ATGTTGGCAG AATGGCTCGC
GGCTATGTTT CTCCGGATCT GTGGGAAGAA GGAATAAATG CTGGGCTACT AAACTATAGT
TTTAATGGTA ATTCTATTAA TAATCGTGGC AACCATAATG CAGGAAAATC CAACTATGCA
TATTTGAATT TACAGAGTGG CATCAACATT GGTAGTTGGC GACTGCGCGA TAACTCAACG
TGGAGTTATA ACAGTGGGAG TAGCAATTCA TCTGACAGCA ATAAATGGCA GCATATCAAT
ACGTCGGCTG AACGTGACAT TATTCCCTTA CGCTCACGCT TAACGGTAGG TGATAGTTAT
ACCGATGGCG ATATTTTTGA TAGTGTGAAC TTTCGTGGCC TCAAAATAAA TTCAACAGAA
GCGATGTTGC CCGATAGCCA ACATGGTTTC GCTCCGGTGA TTCATGGTAT TGCCCGCAGC
ACCGCACAAG TGAGTGTAAA ACAGAATGGA TACGATGTTT ATCAGACTAC TGTCCCACCC
GGCCCTTTTA CTATTGATGA TATCAACTCT GCGGCCAATG GTGGTGATCT GCAAGTAACC
ATAAAAGAGG CAGACGGCAG TATTCAGACA TTATATGTTC CTTATTCGTC TGTTCCGGTT
CTCCAACGTG CTGGATATAC GCGTTATGCG CTTGCCATGG GGGAATATCG TAGTGGAAAT
AACCTGCAAA GCTCCCCCAA GTTCATACAA GGTAGCTTGA TGCATGGACT GGAAGGAAAC
TGGACACCTT ATGGCGGAAT GCAAATTGCA GAAGATTATC AGGCCTTCAA CCTTGGTATT
GGTAAAGATT TAGGACTTTT TGGTGCCTTT TCTTTCGATA TCACGCAGGC CAATACGACA
CTTGCAGATG GCACCCGTCA CAGCGGGCAA TCGATTAAAT CCGTCTACAG CAAATCCTTC
TACCAGACGG GAACCAATAT CCAGGTCGCA GGATATCGCT ATTCTACGCA AGGTTTTTAT
AACTTATCCG ACAGTGCCTA CAGTCGAATG AGTGGTTACA CCGTCAAGCC TCCTACCGGA
GACAGCAATG AGCAGACACA ATTTATTGAT TATTTTAATC TGTTCTACAG TAAGCGTGGT
CAGGAACAAA TAAGCATCTC TCAGCAGCTT GGAAATTACG GTACGACATT TTTCAGTGCC
AGTCGCCAAA GTTACTGGAA CACGTCACGC AGCGACCAGC AAATATCATT TGGATTAAAT
GTGCCGTTTA GTGATATTAC GACTTCGCTG AATTACAGCT ATTCCAATAA TATATGGCAA
AACGATCGGG ATCATTTACT TGCTTTTACG CTTAATGTTC CCTTCAGTCA TTGGATGCGT
ACAGACAGTC AGTCGGCATT TCGTAACTCA AACGCCAGTT ACAGTATGTC AAACGATTTG
AAAGGCGGCA TGACCAACCT ATCAGGGGTT TATGGCACTC TGCTGCCGGA TAATAACCTG
AATTATAGTG TTCAGGTCGG TAACACCCAC GGGGGTAATA CATCGTCTGG CACCAGTGGT
TACAGTTCTC TTAATTATCG TGGAGCTTAC GGTAATACGA ATATCGGTTA CAGTCGGAGT
GGTGACAGCA GCCAGATTTA TTACGGAATG AGTGGTGGCA TTATTGCTCA TGCTGATGGC
ATCACCTTTG GACAGCCACT GGGCGACACA ATGGTTCTGG TTAAGGCTCC TGGCGCTGAT
AATGTCAAAA TAGAGAACCA GACCGGAATT CATACCGACT GGCGTGGCTA TGCCATATTA
CCATTTGCGA CAGAATATAG AGAAAATCGT GTCGCTCTTA ACGCGAATTC CCTTGCAGAT
AATGTTGAAC TGGATGAAAC CGTAGTCACT GTCATCCCAA CTCACGGTGC TATTGCCAGA
GCAACATTTA ATGCACAAAT CGGCGGGAAA GTATTAATGA CGTTGAAGTA CGGTAATAAA
AGCGTTCCAT TCGGTGCAAT TGTCACTCAC GGAGAGAATA AAAATGGCAG CATTGTCGCG
GAAAATGGTC AGGTTTATCT GACTGGACTT CCACAGTCAG GGAAATTACA GGTTTCATGG
GGCAATGATA AAAACTCAAA CTGTATTGTC GATTACAAGC TTCCTGCAGT CTCTCCTGGA
ACCTTGCTGA ACCAGCAGAC AGCAATCTGT CGCTAA
 
Protein sequence
MHQVLLLPRF ARLTIALGLA TAVFPVDAEF YFNPRFLSND LAESVDLSAF TKGREAPPGT 
YRVDIYLNDE FMTSRDITFI ADDNNADLIP CLSTDLLVSL GIKKSALLDN KEHSAEKHVS
DNSACTPLRD RLADASSEFN VGQQHLSLSV PQIYVGRMAR GYVSPDLWEE GINAGLLNYS
FNGNSINNRG NHNAGKSNYA YLNLQSGINI GSWRLRDNST WSYNSGSSNS SDSNKWQHIN
TSAERDIIPL RSRLTVGDSY TDGDIFDSVN FRGLKINSTE AMLPDSQHGF APVIHGIARS
TAQVSVKQNG YDVYQTTVPP GPFTIDDINS AANGGDLQVT IKEADGSIQT LYVPYSSVPV
LQRAGYTRYA LAMGEYRSGN NLQSSPKFIQ GSLMHGLEGN WTPYGGMQIA EDYQAFNLGI
GKDLGLFGAF SFDITQANTT LADGTRHSGQ SIKSVYSKSF YQTGTNIQVA GYRYSTQGFY
NLSDSAYSRM SGYTVKPPTG DSNEQTQFID YFNLFYSKRG QEQISISQQL GNYGTTFFSA
SRQSYWNTSR SDQQISFGLN VPFSDITTSL NYSYSNNIWQ NDRDHLLAFT LNVPFSHWMR
TDSQSAFRNS NASYSMSNDL KGGMTNLSGV YGTLLPDNNL NYSVQVGNTH GGNTSSGTSG
YSSLNYRGAY GNTNIGYSRS GDSSQIYYGM SGGIIAHADG ITFGQPLGDT MVLVKAPGAD
NVKIENQTGI HTDWRGYAIL PFATEYRENR VALNANSLAD NVELDETVVT VIPTHGAIAR
ATFNAQIGGK VLMTLKYGNK SVPFGAIVTH GENKNGSIVA ENGQVYLTGL PQSGKLQVSW
GNDKNSNCIV DYKLPAVSPG TLLNQQTAIC R