Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0633 |
Symbol | sfmD |
ID | 6968250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 655425 |
End bp | 658034 |
Gene Length | 2610 bp |
Protein Length | 869 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 643384671 |
Product | outer membrane usher protein SfmD |
Protein accession | YP_002269185 |
Protein GI | 209401002 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3188] P pilus assembly protein, porin PapC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.770244 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.392496 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATAC CCACTACTAC GGATATTCCG CAGAGGTATA CCTGGTGTCT GGCCGGAATT TGTTATTCAT CTCTTGCCAT TTTACCCTCC TTTTTAAGCT ATGCGGAAAG TTATTTCAAC CCGGCATTTT TATTAGAGAA TGGCACATTC GTTGCTGATT TATCGCGCTT TGAGAGAGGT AATCATCAAC CTGCGGGCGT GTATCGGGTG GATCTCTGGC GTAATGATGA GTTCATTGGT TCACAGGATA TCGTATTTGA ATCGACAACA GAAAATACAG GTGATAAATC AGGTGGGTTA ATGCCCTGTT TTAACCAGGT ACTCCTTGAA CGAATTGGCC TTAATAGCAG TGCATTTCCC GAGTTAGCCC AGCAGCAAAA CAATAAATGC ATCAATTTAC TGAAAGCTGT ACCTGATGCC ACAATTAACT TTGATTTTGC AGCGATGCGC CTGAACATCA CTATTCCTCA GATAGCGTTG TTGAGTAGCG CTCACGGTTA CATTCCGCCT GAAGAGTGGG ATGAAGGTAT TCCTGCTTTA CTCCTGAATT ATAATTTCAC CGGTAACAGA GGTAATGGTA ACGATAGCTA TTTTTTTAGT GAGCTCAGCG GGATTAATAT TGGCCCGTGG CGTTTACGCA ACAATGGTTC CTGGAACTAT TTTCGCGGAA ATGGATATCA TTCAGAACAG TGGAATAATA TTGGCACCTG GGTACAGCGC GCCATTATTC CGCTGAAAAG TGAACTGGTA ATGGGAGACG GCAATACAGG AAGTGATATT TTCGATGGTG TTGGATTTCG TGGTGTACGG CTTTATTCTT CCGATAATAT GTATCCTGAT AGCCAGCAAG GGTTTGCCCC AACGGTACGT GGGATTGCCC GTACGGCGGC CCAGCTAACG ATTCGGCAAA ATGGTTTTAT TATCTATCAA AGCTATGTTT CCCCCGGCGC TTTTGAAATT ACAGATTTGC ACCCGACATC TTCAAATGGC GATCTGGACG TCACCATCGA CGAACGCGAT GGCAATCAGC AGAATTACAC TATTCCGTAT TCAACAGTGC CAATTTTACA ACGCGAAGGG CGTTTCAAAT TTGACCTGAC GGCGGGCGAT TTTCGTAGCG GTAATAGTCA GCAATCATCG CCTTTCTTTT TTCAGGGCAC GGCACTCGGC GGTTTACCAC AGGAATTTAC TGCCTACGGC GGGACGCAAT TATCTGCCAA TTACACCGCC TTTTTATTAG GGCTGGGGCG CAACCTCGGG AACTGGGGCG CAGTGTCGCT GGATGTGACC CATGCGCGCA GTCAGTTAGC CGACGACAGT CGTCATGAGG GGGATTCCAT TCGCTTCCTC TATGCGAAAT CGATGAACAC CTTCGGCACC AATTTTCAGT TAATGGGTTA CCGCTATTCG ACACAAGGTT TTTATACCCT TGATGATGTT GCGTATCGTC GAATGGAGGG GTACGAATAT GATTACGATT ATGACGGTGA GCATCGGGAT GAACCGATAA TCGTGAATTA CCACAATTTA CGCTTTAGCC GTAAAGACCG TTTGCAGTTA AATATTTCAC AATCACTTAA TGACTTTGGC TCGCTTTATA TCTCTGGTAC CCATCAAAAA TACTGGAATA CTTCGGATTC AGATACCTGG TATCAGGTGG GGTATACCAG CAGCTGGGTT GGCATCAGTT ATTCACTCTC ATTTTCGTGG AATGAATCTG TAGGGATCCC CGATAACGAA CGTATTGTCG GACTTAATGT TTCAGTGCCT TTCAATGTTC TGACCAAACG TCGCTACACC CGGGAAAATG CGCTCGACCG CGCTTATGCC TCCTTTAACG CCAACCGTAA CAGCAACGGG CAAAATAGCT GGCTGGCAGG TGTAGGTGGG ACCTTACTGG AAGGCCACAA CCTGAGTTAT CACGTAAGCC AGGGTGATAC CTCGAATAAT GGGTATACGG GCAGCGCCAC GGCAAACTGG CAGGCCGCTT ACGGTACGCT GGGGGTCGGG TATAACTACG ACCGCGATCA ACATGACGTT AACTGGCAGC TGTCTGGCGG TGTGGTCGGA CATGAAAATG GCATAACGCT GAGCCAGCCT TTAGGGGATA CCAATGTTTT GATTAAAGCG CCTGGCGCAG GCGGTGTACG CATTGAAAAT CAAACTGGCA TTTTAACCGA CTGGCGCGGC TATGCGGTGA TGCCGTATGC CACGGTTTAT CGGTATAACC GTATCGCGCT TGATACCAAT ACGATGGGGA ATTCCATCGA TGTTGAAAAA AATATTAGCA GCGTTGTGCC GACGCAAGGC GCGTTGGTTC GTGCCAATTT TGATACCCGC ATAGGCGTGC GGGCGCTCAT TACCGTTACC CAGGGCGGAA AACCGGTGCC GTTTGGATCA CTGGTACGGG AAAACAGTAC CGGAATAACC AGTATGGTGG GTGATGACGG GCAAGTTTAT TTAAGTGGTG CGCCATTGTC TGGTGAATTA CTGGTTCAGT GGGGAGACGG CGCGAACTCA CGCTGCATTG CGCACTATGT ATTGCCGAAG CAAAGCTTAC AGCAAGCCGT CACTGTTATT TCGGCAGTTT GCACACATCC TGGCTCATAA
|
Protein sequence | MKIPTTTDIP QRYTWCLAGI CYSSLAILPS FLSYAESYFN PAFLLENGTF VADLSRFERG NHQPAGVYRV DLWRNDEFIG SQDIVFESTT ENTGDKSGGL MPCFNQVLLE RIGLNSSAFP ELAQQQNNKC INLLKAVPDA TINFDFAAMR LNITIPQIAL LSSAHGYIPP EEWDEGIPAL LLNYNFTGNR GNGNDSYFFS ELSGINIGPW RLRNNGSWNY FRGNGYHSEQ WNNIGTWVQR AIIPLKSELV MGDGNTGSDI FDGVGFRGVR LYSSDNMYPD SQQGFAPTVR GIARTAAQLT IRQNGFIIYQ SYVSPGAFEI TDLHPTSSNG DLDVTIDERD GNQQNYTIPY STVPILQREG RFKFDLTAGD FRSGNSQQSS PFFFQGTALG GLPQEFTAYG GTQLSANYTA FLLGLGRNLG NWGAVSLDVT HARSQLADDS RHEGDSIRFL YAKSMNTFGT NFQLMGYRYS TQGFYTLDDV AYRRMEGYEY DYDYDGEHRD EPIIVNYHNL RFSRKDRLQL NISQSLNDFG SLYISGTHQK YWNTSDSDTW YQVGYTSSWV GISYSLSFSW NESVGIPDNE RIVGLNVSVP FNVLTKRRYT RENALDRAYA SFNANRNSNG QNSWLAGVGG TLLEGHNLSY HVSQGDTSNN GYTGSATANW QAAYGTLGVG YNYDRDQHDV NWQLSGGVVG HENGITLSQP LGDTNVLIKA PGAGGVRIEN QTGILTDWRG YAVMPYATVY RYNRIALDTN TMGNSIDVEK NISSVVPTQG ALVRANFDTR IGVRALITVT QGGKPVPFGS LVRENSTGIT SMVGDDGQVY LSGAPLSGEL LVQWGDGANS RCIAHYVLPK QSLQQAVTVI SAVCTHPGS
|
| |