Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3387 |
Symbol | |
ID | 6143432 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3472403 |
End bp | 3475087 |
Gene Length | 2685 bp |
Protein Length | 894 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641618216 |
Product | hypothetical protein |
Protein accession | YP_001745365 |
Protein GI | 170682266 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3188] P pilus assembly protein, porin PapC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAAAA AATTACTGGC TCTTTTGATC CTGGCGAGTC TCAGCCCGGC AAAGGCAACA TTAACAAAAA TTCCCGCTGG TTTTGAGGTT ATTGCTCAGG GACAGCAGGA GTATATCGAG GTTTATTTTG CAGGGAAAAG TCTCGGTAAA TATTATGCGA TGGTTAATCT TGATACCGTA ACCTTTCTTG ATTCATCAAG TTTATACAGC AAACTTGAAT TGAGCACTGA CGATCAAAAA ATCGCTCATA CAGTGAAAGA AAAACTATCG CAACCCTTAG CTCGCCACGG TGAACTGGCT TGTGGTTTTG TACGTACTGA TTCAGGATGC GGTTTTCTTA ATACTGAGAC AGTAGAAATA ATCTACAATG ATGAAGAAAG TTCAGCAACG CTATTTCTTA ATCCGCAATG GAGTTCTGCG TTTAACTCGA AGTCATTGTA TTTAAATCCA GACAAAAATA CGGTTAACGC TTTTATACAT CAGCAAGATA TTAATGTGCT GGTACAGGAT GATTACCAAT CGTTGTCAAT TCAGGGTAAT GGCGCACTGG GAATAACAGA AAATAGCTAT ATTGGTGCGC ACTGGAATTT CGACGGTTAT GATGCAGATG ATGTCAGCGA CAACAATGTC GATGTCAGCG ATCTCTATTA CCGCTATGAT TTTTTACGCC GTTATTATGT GCAGGCGGGG CGGATGGACA ACCGCACATT ATTTAATGCG CAAGGTGGAA ACTTTACTTT TAACTTTCTG CCACTCGGTG CCATCGACGG AATGCGTATC GGGTCGACTC TCAGTTATTT AAACCAGACG CAAAGCCAGC AGGGAACGCC GGTAATGATC CTGCTTTCGC GCAATTCTCG TGTTGATGCT TATCGTAATG AACAACTGTT GGGATCGTTT TACCTCAATA GTGGTTCGCA ATTTATTGAT ACCAGTTCCT TTCCGCCAGG CAGCTACAGC GTGGCGTTAA AAGTTTATGA AAATAACCAA CTCACCCGCA CCGAGCTTGT ACCGTTTACC AAAACCGGCG GTCTGACAGA CGGAAATGCC CAGTGGTTCT TGCAGGCAGG TAAAACCACA TCACAGGTTT CTGATGATGA AAGTTCTGCT TATCAGCTAG GCGTACGCCT GCCATTACAT CCGCAATATG AGCTCTACGC AGGGCTGGCG AATGCCGATG ATGTAAGTGC TTTCGAGTTA GGCAATAACT GGACGGCAGA TTTAGGCGGG GCGGGGAATC TGGCTATCAG TGCCAGCGTG TTCCGTAACG ATGACGGCGG CAAAGGCGAT ATGCAACAGG CCAACTGGAG TCATCCGGGA TGGCCGACGT TGGGCTTTTA TCGGACCAAC TCTGACGGTG ATGCTTGTAC AACCGACAAC AGAGAGAGCT ACAACGCCTT AAGCTGTTAT GAAAGTATTT CCGCGACGGT TTCACAGAAT TTTGTCGGCT GGAATATGAT GCTGGGTTAT ACCCGCACAC AAAATAACAC TGATGATAGT TTGCGCTGGG ACAAACAGCA GAGCTTTGAA AATAACTATC TTCGCCAGAC TTCGGCTCAA AGCATTTCCG AGACCGTACA ACTTAGCGCT TCTCGCGCTT TTGTGATGCG TGACTGGATT TTGAGTACTT CCCTTGGGGT TTTCCATCGT AATGACAACG GCGGTGGTAG CGATGACAAC GGTTTGTATC TGTCGTTTTC GTTATCTGAT ACGCCGACGA TGGACAGCAA TAACAACAGC CATTCAACCA ATGTTTCTAC TGATTATCGT TATAGCGATC AGGATGGCGA TCAAACGTCA TGGCAGTTAT CGCATACCTT TTATAACGAC TCATTCAGCC ATAAAGAGCT TGGCGTGACC GTCGGCGGTT TGAACACCGA TACCATAAAC AGCGCGGTTA ACGGGCGTTG GGATGGCCAA TATGGAAATG TCTACGCTAC TGTCTCAGAC AGTTATGATC GCCAGAACCA TGATCATCTC TCGGCCTTCA CCGGGACATA CAGCTCCACG CTGGCGGTGA GTCGCTATGG CATCAATTTG GGGGCCAGCG GCTCAGATGA TTTGCTGGGA GCGGTGTTGG TGGATGTGAA AGGTTTCTCT GAACAGGATG AACAGAGTCA GGGTCTGCAA CTCGAAGCGC GGGTGGCTGG CAGCAGAACA TTGCAGCTTG GTCAAAGCGA CAGCGTGTTA TTCCCCTATC CTGGATTTCA GTCTGGCTTT GTTGAAGTTA ACGACAGTAA TCAGGGTAAC CAGCAAGGAA CAACCAACAT CATTAACGGT GCGGGAAATC GTGAGTTAAT GCTGTTGCCG GGCAAATTAC GTTATCGCGA AGTGTCTGCC AGTTTTAATT ACAACTATAT TGGTCGTTTG TTATTGCCCG CATCGGTAGA GAAATTCCCG CTGGTGGGTC TGAATAGCGC CATGTTACTG GTGGCTGAAG ATGGCGGATT CACACTTGAG ATCAGCAGTG GTGAAAAAGA GTTGTATCTG CTTTCCGGGC AGCAGTTCCT GAAATGTCCG CTGAATGTTT TGAAGAAACG CGCCAGCATT CGCTATAGCG GGGATGTGAA TTGTAGCGTG GTGAGTTATT CACAATTGCC GGAATCTATT CAGGTTCAGG CACAGTTGAA ACAGCCTAAA TTACGTGGAA ACGTCCAGAC GGCGCAAAGG GAGGTTGCAC CATGA
|
Protein sequence | MDKKLLALLI LASLSPAKAT LTKIPAGFEV IAQGQQEYIE VYFAGKSLGK YYAMVNLDTV TFLDSSSLYS KLELSTDDQK IAHTVKEKLS QPLARHGELA CGFVRTDSGC GFLNTETVEI IYNDEESSAT LFLNPQWSSA FNSKSLYLNP DKNTVNAFIH QQDINVLVQD DYQSLSIQGN GALGITENSY IGAHWNFDGY DADDVSDNNV DVSDLYYRYD FLRRYYVQAG RMDNRTLFNA QGGNFTFNFL PLGAIDGMRI GSTLSYLNQT QSQQGTPVMI LLSRNSRVDA YRNEQLLGSF YLNSGSQFID TSSFPPGSYS VALKVYENNQ LTRTELVPFT KTGGLTDGNA QWFLQAGKTT SQVSDDESSA YQLGVRLPLH PQYELYAGLA NADDVSAFEL GNNWTADLGG AGNLAISASV FRNDDGGKGD MQQANWSHPG WPTLGFYRTN SDGDACTTDN RESYNALSCY ESISATVSQN FVGWNMMLGY TRTQNNTDDS LRWDKQQSFE NNYLRQTSAQ SISETVQLSA SRAFVMRDWI LSTSLGVFHR NDNGGGSDDN GLYLSFSLSD TPTMDSNNNS HSTNVSTDYR YSDQDGDQTS WQLSHTFYND SFSHKELGVT VGGLNTDTIN SAVNGRWDGQ YGNVYATVSD SYDRQNHDHL SAFTGTYSST LAVSRYGINL GASGSDDLLG AVLVDVKGFS EQDEQSQGLQ LEARVAGSRT LQLGQSDSVL FPYPGFQSGF VEVNDSNQGN QQGTTNIING AGNRELMLLP GKLRYREVSA SFNYNYIGRL LLPASVEKFP LVGLNSAMLL VAEDGGFTLE ISSGEKELYL LSGQQFLKCP LNVLKKRASI RYSGDVNCSV VSYSQLPESI QVQAQLKQPK LRGNVQTAQR EVAP
|
| |