Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0577 |
Symbol | sfmD |
ID | 6147467 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 583602 |
End bp | 586211 |
Gene Length | 2610 bp |
Protein Length | 869 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641615469 |
Product | outer membrane usher protein SfmD |
Protein accession | YP_001742676 |
Protein GI | 170682983 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3188] P pilus assembly protein, porin PapC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.5582 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATAC CCACTACTAC GGATATTCCG CAGAGGTATA CCTGGTGTCT GGCCGGAATT TGTTATTCAT CTCTTGCCAT TTTACCCTCC TTTTTAAGCT ATGCGGAAAG TTATTTCAAC CCGGCATTTT TATTAGAGAA TGGCACATCC GTTGCTGATT TATCGCGCTT TGAGAGAGGT AATCATCAAC CTGCGGGCGT GTATCGGGTG GATCTCTGGC GTAATGATGA GTTCATTGGT TCACAGGATA TCGTATTTGA ATCGACGACA GTAAATACAG GTGATAAATC AGGTGGGTTA ATGCCTTGTT TTAACCAGGC ACTCCTTGAA CGAATTGGCC TTAATAGCAG TGCATTCCCC GAGTTAGCCC AGCAGCAAAA TAATAAATGC ATCAATTTAC TGAAAGCTGT ACCTGATGCC ACAATTAACT TTGATTTTGC AGCGATGCGC CTGAACATCA CTATTCCTCA GATAGCGTTG TTGAGTAGCG CTCACGGTTA CATTCCGCCT GAAGAGTGGG ATGAAGGTAT TCCTGCTTTA CTCCTGAATT ATAATTTCAC CGGTAACAGA GGTAATGGTA ACGATAGCTA TTTTTTTAGT GAACTCAGCG GGATTAATAT TGGCCCGTGG CGTTTACGCA ACAATGGTTC CTGGAACTAT TTTCGCGGAA ATGGATATCA TTCAGAACAG TGGAATAATA TTGGCACCTG GGTACAGCGC GCCATTATTC CGCTGAAAAG TGAACTGGTA ATGGGAGACG GTAATACAGG GAGTGATATT TTCGATGGCG TTGGATTTCG AGGTGTACGG CTTTATTCTT CCGATAATAT GTATCCTGAT AGCCAGCAGG GGTTTGCTCC AACGGTACGT GGGATTGCCC GTACGGCGGC CCAGCTAACG ATTCGGCAAA ATGGTTTTAT TATCTATCAA AGCTATGTTT CCCCCGGTGC TTTTGAAATT ACAGATTTGC ACCCGACATC TTCAAATGGC GATCTGGACG TCACCATCGA CGAGCGCGAT GGTAATCAGC AGAATTACAC TATTCCGTAT TCAACAGTGC CAATTTTACA ACGCGAAGGG CGTTTCAAAT TTGACCTGAC GGCGGGCGAT TTTCGTAGCG GTAATAGTCA GCAATCGTCG CCTTTCTTTT TTCAGGGCAC GGCACTCGGC GGTTTACCAC AGGAATTTAC TGCCTACGGC GGGACGCAAT TATCTGCAAA TTACACCGCC TTTTTGTTAG GGCTGGGGCG CAACCTCGGG AACTGGGGCG CAGTGTCGCT GGATGTAACG CATGCGCGCA GTCAGTTAGC CGACGACAGT CGTCATGAGG GGGATTCCAT TCGCTTCCTC TATGCGAAAT CAATGAACAC TTTCGGCACC AATTTTCAGT TAATGGGTTA CCGCTATTCG ACACAAGGTT TTTATACCCT TGATGATGTT GCGTATCGTC GAATGGAGGG GTACGAATAT GATTACGATT ATGACGGTGA GCATCGCGAT GAACCGATAA TCGTGAATTA CCACAATTTA CGCTTTAGTC GTAAAGACCG TTTGCAGTTA AATATTTCAC AATCACTTAA TGACTTTGGC TCGCTTTATA TTTCTGGTAC CCATCAAAAA TACTGGAATA CTTCGGATTC AGATACCTGG TATCAGGTGG GGTATACCAG CAGCTGGGTT GGCATCAGTT ATTCACTCTC ATTTTCGTGG AATGAATCTG TAGGGATCCC CGATAACGAA CGTATTGTCG GACTTAATGT TTCAGTGCCT TTCAATGTTC TGACCAAACG TCGCTACACC CGGGAAAATG CGCTCGACCG CGCTTATGCC TCCTTTAACG CCAACCGTAA CAGCAACGGG CAAAATAACT GGCTGGCAGG CGTTGGTGGG ACCTTACTGG AAGGCCACAA CCTGAGTTAT CACGTAAGCC AGGGCGATAC CTCGAATAAT GGGTATACGG GCAGCGCCAC GGCAAACTGG CAGGCCGCTT ACGGTACGCT GGGGGTCGGG TATAACTACG ACCGCGATCA ACATGACGTT AACTGGCAGC TGTCTGGCGG TGTGGTCGGG CATGAAAATG GTATAACGCT GAGCCAGCCT TTAGGGGATA CCAATGTTTT GATTAAAGCG CCAGGCGCAG GAGGTGTACG CATTGAAAAT CAAACTGGAA TTTTAACCGA CTGGCGCGGC TATGCGGTGA TGCCGTATGC CACGGTTTAT CGGTATAACC GTATCGCGCT TGATACCAAT ACGATGGGGA ACTCCATCGA TGTTGAAAAA AATATTAGCA GCGTTGTGCC GACGCAAGGC GCGTTGGTTC GTGCCAATTT TGATACCCGC ATAGGCGTGC GGGCGCTCAT TACCGTTACC CAGGGCGGAA AACCGGTGCC GTTTGGATCA CCGGTACGGG AAAACAGTAC CGGAATAACC AGTATGGTGG GTGATGACGG GCAAGTTTAT TTAAGCGGTG CGCCATTGTC TGGTGAATTA CTGGTTCAGT GGGGAGACGG CGCGAACTCA CGCTGCATAG CGCACTATGT ATTGCCGAAG CAAAGCTTAC AGCAAGCCGT CACTGTTATT TCGGCAGTTT GCACACATCC TGGCTCATAA
|
Protein sequence | MKIPTTTDIP QRYTWCLAGI CYSSLAILPS FLSYAESYFN PAFLLENGTS VADLSRFERG NHQPAGVYRV DLWRNDEFIG SQDIVFESTT VNTGDKSGGL MPCFNQALLE RIGLNSSAFP ELAQQQNNKC INLLKAVPDA TINFDFAAMR LNITIPQIAL LSSAHGYIPP EEWDEGIPAL LLNYNFTGNR GNGNDSYFFS ELSGINIGPW RLRNNGSWNY FRGNGYHSEQ WNNIGTWVQR AIIPLKSELV MGDGNTGSDI FDGVGFRGVR LYSSDNMYPD SQQGFAPTVR GIARTAAQLT IRQNGFIIYQ SYVSPGAFEI TDLHPTSSNG DLDVTIDERD GNQQNYTIPY STVPILQREG RFKFDLTAGD FRSGNSQQSS PFFFQGTALG GLPQEFTAYG GTQLSANYTA FLLGLGRNLG NWGAVSLDVT HARSQLADDS RHEGDSIRFL YAKSMNTFGT NFQLMGYRYS TQGFYTLDDV AYRRMEGYEY DYDYDGEHRD EPIIVNYHNL RFSRKDRLQL NISQSLNDFG SLYISGTHQK YWNTSDSDTW YQVGYTSSWV GISYSLSFSW NESVGIPDNE RIVGLNVSVP FNVLTKRRYT RENALDRAYA SFNANRNSNG QNNWLAGVGG TLLEGHNLSY HVSQGDTSNN GYTGSATANW QAAYGTLGVG YNYDRDQHDV NWQLSGGVVG HENGITLSQP LGDTNVLIKA PGAGGVRIEN QTGILTDWRG YAVMPYATVY RYNRIALDTN TMGNSIDVEK NISSVVPTQG ALVRANFDTR IGVRALITVT QGGKPVPFGS PVRENSTGIT SMVGDDGQVY LSGAPLSGEL LVQWGDGANS RCIAHYVLPK QSLQQAVTVI SAVCTHPGS
|
| |