Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2496 |
Symbol | |
ID | 6144254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2541162 |
End bp | 2543810 |
Gene Length | 2649 bp |
Protein Length | 882 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641617368 |
Product | fimbrial usher protein |
Protein accession | YP_001744540 |
Protein GI | 170679648 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3188] P pilus assembly protein, porin PapC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.111704 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTAATC ACTCAAATTT TCGGCTGCGG GGAATCGCCT GCTATATTGC GCTGGCAATC TCAGGTGGAT CAGTCAATGC ATGGGCTGAT GATTCCATTC AATTTGACCC CCGTTTCCTT GAGTTAAAGG GCGACACGAA AATTGATCTC GGTAAGTTTT CAAAAAAAGG GTATGTCGAC GCGGGAAAAT ATAATTTACG TGTATTTATA AATAAACAAC CCCTTTCTGA TGAATACGAC ATTAACTGGT ATGTTTCTGA AAACGATCCA ACAAAAACGT ATGCCTGCCT GACACCTGAG TTAGTGGCGG CGCTGGGGCT GAAAGAAGGG ATAGCAAAAA GCCTGCAGTG GACGCACAAC GATGAATGCC TTAAACCGGG GCAATTAGAT GGGATGGAAG TCGAGAATGA TTTAAGCCAG TCGGCGTTGC TGCTGACAGT GCCACAGGCT TATCTCGAAT ATACCAGCAG CGACTGGGAC CCACCCTCAC GCTGGGACGA CGGTATTCCT GGCCTGATTG CCGACTACAG CCTCAATGCG CAAACCCGCC ACCAGGAGCA GGGTGGCGAG GACTCACATG ATATCAGCGG CAACGGTACC GTTGGGGCGA ACCTGGGGGC ATGGCGTTTC CGCGCAGACT GGCAAAGTGA TTATCAGCAC ACCCGCAGCA ACGATGACGA CGATGACAGC AGTAACAGTA CAACGAGCAA AAACTGGGAC TGGAGCCGTT ATTACGCCTG GCGGGCCTTA CCCTCTTTAA AAGCGAAGCT GTCGCTGGGG GAAGATTATC TCAATTCCGA TATTTTCGAC GGCTTTAACT ATATTGGCAG CAGCGTCAGC ACCGATGACC AGATGCTGCC GCCCAACCTG CGCGGCTATG CGCCGGATGT CTCCGGCGTG GCGCACAGCA GTGCAAAAGT CACCATTAGC CAGATGGGCC GGGTACTTTA CGAAACCCAG GTTCCCGCCG GGCCATTCCG CATTCAGGAT ATCGGCGACT CCGTCTCCGG CACACTGCAC GTCCGCGTTG AAGAACAGAA TGGTCAGGTG CAGGAATATG ACGTTACGAC CGCATCTATG CCATTCCTCA CGCGCCAGGG GCAGGTGCGT TACAAAGTGA TGATGGGGCG TCCGGAAGAC TGGAACCACA AGACCGAAGG CGGCTTTTTC TCCGGCGGAG AAGCGTCATG GGGGGTGGCA GATGGCTGGT CGCTCTACGG TGGCGCGCTG GCAGATGAAC ACTACCAGTC GGCGGCGATG GGGGTGGGGC GCGACCTCGC ACAGTTTGGC GCGCTGGCGT TCGATGTGAC TCACTCGCAC GTCAACCTGG ATCATGACAG CGCATACGGC AAAGGAAAAC TGGACGGCAA CTCCTTTCGC GTGAGCTATG CCAAAGACTT TGACGAACTC AACAGCCGCG TCACCTTTGC AGGCTACCGT TTTTCTGAAA AGAACTTCAT GACCATGAGC GAGTATCTGG ACGCGAACCA GTCGGACATG GCGCGGACCG GTAACGACAA AGAGATGTAT ACGATCACCT ATAACCAGAA CTTTGCCGCT GCGGGTGTCT CGATCTATCT CAACTACTCC CATCGTACTT ACTGGGATCG CCCGGAACAG ACAAACTATA ACCTGATGTT TTCCCACTAT TTTAATATGG GGAGCATTCG CAACGTGAGC ATCTCGGTGA CCGGCTATCG CTACGAATAT GACGATAACG CGGATAAGGG GATGTACCTC AGCATGAGCA TTCCGTGGAG CGACAGCAGC ACCGTGACCT ACAACGGTTC CTACGGCAGC GGGTCGGACA GCAGCCAGGT CGGTTACTTT AAGCGCGTTG ATGACGCAAC GCACTACCAG GTTAACGTCG GTACCAGCGA ACAGCACGGC AGCGTGGATG GTTATCTGAG TCACGACGGC TCGCTGGCGA AGGTTGATCT CAGCGCCAAC TACCATGAGG GGGAATACCG CTCGGCGGGG ATCGCCTTAC AGGGCGGGGC AACGCTGACC GCGCATGGTG GGGCGCTGCA TCGCACTCAA AGCATGGGCG GTACGCGCCT GCTGATTGAC GCCGACGGGA TTGCCAATGT GCCGGTCGAA AGCAACGGCG CGCCGGTGTA CACCAATATG TTTGGCAAGG CGGTGGTCGC CGATATTAAC AACTACTATC GCAATCAGGC GTATATCGAT TTAAACAACC TGCCAGAAGA TGCGGAAGCC ACCCAGTCGG TGGTACAGGC AACCCTGACG GAAGGTGCAA TTGGCTATCG CAAATTCAAA GTGATCAGCG GGCAAAAAGC GATGGCGGTA CTGCGGTTGC GTGACGGCAG CTACCCGCCG TTTGGCGCGG AAGTGAAAAA CGACGAGCAG CAGCAGGTTG GCATTGTGGA TGATGAAGGC AATGTCTATC TGGCGGGAGT CAATGCGGGT GAGCATATGA CGGTGTTCTG GGAAGGCAGC GCACAATGCG AGATCGTATT GCCGAAGCCG CTACCTGCCG ATCTGTTCAG CGGCCTGTTG TTGCCGTGTG AACAAAAGGG AACGGCAGCC CCTGATTCTT CAGCGCCAGA AATTAAGCCT GTTATTCAGG ACCAGACGCG GCAAGTCACA CCAACGGAAG CGCCGACGTC AATTTCAGCT ACTCAATAA
|
Protein sequence | MPNHSNFRLR GIACYIALAI SGGSVNAWAD DSIQFDPRFL ELKGDTKIDL GKFSKKGYVD AGKYNLRVFI NKQPLSDEYD INWYVSENDP TKTYACLTPE LVAALGLKEG IAKSLQWTHN DECLKPGQLD GMEVENDLSQ SALLLTVPQA YLEYTSSDWD PPSRWDDGIP GLIADYSLNA QTRHQEQGGE DSHDISGNGT VGANLGAWRF RADWQSDYQH TRSNDDDDDS SNSTTSKNWD WSRYYAWRAL PSLKAKLSLG EDYLNSDIFD GFNYIGSSVS TDDQMLPPNL RGYAPDVSGV AHSSAKVTIS QMGRVLYETQ VPAGPFRIQD IGDSVSGTLH VRVEEQNGQV QEYDVTTASM PFLTRQGQVR YKVMMGRPED WNHKTEGGFF SGGEASWGVA DGWSLYGGAL ADEHYQSAAM GVGRDLAQFG ALAFDVTHSH VNLDHDSAYG KGKLDGNSFR VSYAKDFDEL NSRVTFAGYR FSEKNFMTMS EYLDANQSDM ARTGNDKEMY TITYNQNFAA AGVSIYLNYS HRTYWDRPEQ TNYNLMFSHY FNMGSIRNVS ISVTGYRYEY DDNADKGMYL SMSIPWSDSS TVTYNGSYGS GSDSSQVGYF KRVDDATHYQ VNVGTSEQHG SVDGYLSHDG SLAKVDLSAN YHEGEYRSAG IALQGGATLT AHGGALHRTQ SMGGTRLLID ADGIANVPVE SNGAPVYTNM FGKAVVADIN NYYRNQAYID LNNLPEDAEA TQSVVQATLT EGAIGYRKFK VISGQKAMAV LRLRDGSYPP FGAEVKNDEQ QQVGIVDDEG NVYLAGVNAG EHMTVFWEGS AQCEIVLPKP LPADLFSGLL LPCEQKGTAA PDSSAPEIKP VIQDQTRQVT PTEAPTSISA TQ
|
| |