Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1666 |
Symbol | |
ID | 6142997 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1659328 |
End bp | 1661943 |
Gene Length | 2616 bp |
Protein Length | 871 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641616542 |
Product | fimbrial usher family protein |
Protein accession | YP_001743720 |
Protein GI | 170679788 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3188] P pilus assembly protein, porin PapC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.504139 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATCAGG TGTTACTACT GCCTCGCTTT GCACGGCTTA CCATTGCCCT TGGCTTAGCA ACTGCGGTAT TTCCCGTTGA CGCAGAGTTT TATTTTAATC CACGGTTTTT AAGCAATGAT CTTGCTGAAT CTGTCGATTT ATCCGCCTTT ACTAAGGGAC GCGAGGCACC ACCTGGTACA TATCGGGTCG ACATTTATTT GAACGACGAA TTTATGACTA GTCGGGATAT TACTTTTATT GCAGATGATA ATAATGCAGA TCTCATCCCA TGCCTGAGTA CAGACCTTCT TGTCAGTCTT GGCATAAAAA AATCAGCATT ATTAGACAAT AAGGAACATT CGGCAGAAAA ACATGTGTCG GACAACAGCG CATGCACACC ACTTCGGGAT CGTTTGGCTG ACGCATCATC TGAGTTTAAT GTTGGTCAGC AACATCTATC ACTCTCCGTT CCGCAAATTT ATGTTGGCAG AATGGCTCGC GGCTATGTTT CTCCGGATCT GTGGGAAGAA GGAATAAATG CTGGGCTACT AAACTATAGT TTTAATGGTA ATTCTATTAA TAATCGTGGC AACCATAATG CAGGAAAATC CAACTATGCA TATTTGAATT TACAGAGTGG CATCAACATT GGTAGTTGGC GACTGCGCGA TAACTCAACG TGGAGTTATA ACAGTGGGAG TAGCAATTCA TCTGACAGCA ATAAATGGCA GCATATCAAT ACGTCGGCTG AACGTGACAT TATTCCCTTA CGCTCACGCT TAACGGTAGG TGATAGTTAT ACCGATGGCG ATATTTTTGA TAGTGTGAAC TTTCGTGGCC TCAAAATAAA TTCAACAGAA GCGATGTTGC CCGATAGCCA ACATGGTTTC GCTCCGGTGA TTCATGGTAT TGCCCGCAGC ACCGCACAAG TGAGTGTAAA ACAGAATGGA TACGATGTTT ATCAGACTAC TGTCCCACCC GGCCCTTTTA CTATTGATGA TATCAACTCT GCGGCCAATG GTGGTGATCT GCAAGTAACC ATAAAAGAGG CAGACGGCAG TATTCAGACA TTATATGTTC CTTATTCGTC TGTTCCGGTT CTCCAACGTG CTGGATATAC GCGTTATGCG CTTGCCATGG GGGAATATCG TAGTGGAAAT AACCTGCAAA GCTCCCCCAA GTTCATACAA GGTAGCTTGA TGCATGGACT GGAAGGAAAC TGGACACCTT ATGGCGGAAT GCAAATTGCA GAAGATTATC AGGCCTTCAA CCTTGGTATT GGTAAAGATT TAGGACTTTT TGGTGCCTTT TCTTTCGATA TCACGCAGGC CAATACGACA CTTGCAGATG GCACCCGTCA CAGCGGGCAA TCGATTAAAT CCGTCTACAG CAAATCCTTC TACCAGACGG GAACCAATAT CCAGGTCGCA GGATATCGCT ATTCTACGCA AGGTTTTTAT AACTTATCCG ACAGTGCCTA CAGTCGAATG AGTGGTTACA CCGTCAAGCC TCCTACCGGA GACAGCAATG AGCAGACACA ATTTATTGAT TATTTTAATC TGTTCTACAG TAAGCGTGGT CAGGAACAAA TAAGCATCTC TCAGCAGCTT GGAAATTACG GTACGACATT TTTCAGTGCC AGTCGCCAAA GTTACTGGAA CACGTCACGC AGCGACCAGC AAATATCATT TGGATTAAAT GTGCCGTTTA GTGATATTAC GACTTCGCTG AATTACAGCT ATTCCAATAA TATATGGCAA AACGATCGGG ATCATTTACT TGCTTTTACG CTTAATGTTC CCTTCAGTCA TTGGATGCGT ACAGACAGTC AGTCGGCATT TCGTAACTCA AACGCCAGTT ACAGTATGTC AAACGATTTG AAAGGCGGCA TGACCAACCT ATCAGGGGTT TATGGCACTC TGCTGCCGGA TAATAACCTG AATTATAGTG TTCAGGTCGG TAACACCCAC GGGGGTAATA CATCGTCTGG CACCAGTGGT TACAGTTCTC TTAATTATCG TGGAGCTTAC GGTAATACGA ATATCGGTTA CAGTCGGAGT GGTGACAGCA GCCAGATTTA TTACGGAATG AGTGGTGGCA TTATTGCTCA TGCTGATGGC ATCACCTTTG GACAGCCACT GGGCGACACA ATGGTTCTGG TTAAGGCTCC TGGCGCTGAT AATGTCAAAA TAGAGAACCA GACCGGAATT CATACCGACT GGCGTGGCTA TGCCATATTA CCATTTGCGA CAGAATATAG AGAAAATCGT GTCGCTCTTA ACGCGAATTC CCTTGCAGAT AATGTTGAAC TGGATGAAAC CGTAGTCACT GTCATCCCAA CTCACGGTGC TATTGCCAGA GCAACATTTA ATGCACAAAT CGGCGGGAAA GTATTAATGA CGTTGAAGTA CGGTAATAAA AGCGTTCCAT TCGGTGCAAT TGTCACTCAC GGAGAGAATA AAAATGGCAG CATTGTCGCG GAAAATGGTC AGGTTTATCT GACTGGACTT CCACAGTCAG GGAAATTACA GGTTTCATGG GGCAATGATA AAAACTCAAA CTGTATTGTC GATTACAAGC TTCCTGCAGT CTCTCCTGGA ACCTTGCTGA ACCAGCAGAC AGCAATCTGT CGCTAA
|
Protein sequence | MHQVLLLPRF ARLTIALGLA TAVFPVDAEF YFNPRFLSND LAESVDLSAF TKGREAPPGT YRVDIYLNDE FMTSRDITFI ADDNNADLIP CLSTDLLVSL GIKKSALLDN KEHSAEKHVS DNSACTPLRD RLADASSEFN VGQQHLSLSV PQIYVGRMAR GYVSPDLWEE GINAGLLNYS FNGNSINNRG NHNAGKSNYA YLNLQSGINI GSWRLRDNST WSYNSGSSNS SDSNKWQHIN TSAERDIIPL RSRLTVGDSY TDGDIFDSVN FRGLKINSTE AMLPDSQHGF APVIHGIARS TAQVSVKQNG YDVYQTTVPP GPFTIDDINS AANGGDLQVT IKEADGSIQT LYVPYSSVPV LQRAGYTRYA LAMGEYRSGN NLQSSPKFIQ GSLMHGLEGN WTPYGGMQIA EDYQAFNLGI GKDLGLFGAF SFDITQANTT LADGTRHSGQ SIKSVYSKSF YQTGTNIQVA GYRYSTQGFY NLSDSAYSRM SGYTVKPPTG DSNEQTQFID YFNLFYSKRG QEQISISQQL GNYGTTFFSA SRQSYWNTSR SDQQISFGLN VPFSDITTSL NYSYSNNIWQ NDRDHLLAFT LNVPFSHWMR TDSQSAFRNS NASYSMSNDL KGGMTNLSGV YGTLLPDNNL NYSVQVGNTH GGNTSSGTSG YSSLNYRGAY GNTNIGYSRS GDSSQIYYGM SGGIIAHADG ITFGQPLGDT MVLVKAPGAD NVKIENQTGI HTDWRGYAIL PFATEYRENR VALNANSLAD NVELDETVVT VIPTHGAIAR ATFNAQIGGK VLMTLKYGNK SVPFGAIVTH GENKNGSIVA ENGQVYLTGL PQSGKLQVSW GNDKNSNCIV DYKLPAVSPG TLLNQQTAIC R
|
| |