Gene EcSMS35_3387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3387 
Symbol 
ID6143432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3472403 
End bp3475087 
Gene Length2685 bp 
Protein Length894 aa 
Translation table11 
GC content47% 
IMG OID641618216 
Producthypothetical protein 
Protein accessionYP_001745365 
Protein GI170682266 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAAAA AATTACTGGC TCTTTTGATC CTGGCGAGTC TCAGCCCGGC AAAGGCAACA 
TTAACAAAAA TTCCCGCTGG TTTTGAGGTT ATTGCTCAGG GACAGCAGGA GTATATCGAG
GTTTATTTTG CAGGGAAAAG TCTCGGTAAA TATTATGCGA TGGTTAATCT TGATACCGTA
ACCTTTCTTG ATTCATCAAG TTTATACAGC AAACTTGAAT TGAGCACTGA CGATCAAAAA
ATCGCTCATA CAGTGAAAGA AAAACTATCG CAACCCTTAG CTCGCCACGG TGAACTGGCT
TGTGGTTTTG TACGTACTGA TTCAGGATGC GGTTTTCTTA ATACTGAGAC AGTAGAAATA
ATCTACAATG ATGAAGAAAG TTCAGCAACG CTATTTCTTA ATCCGCAATG GAGTTCTGCG
TTTAACTCGA AGTCATTGTA TTTAAATCCA GACAAAAATA CGGTTAACGC TTTTATACAT
CAGCAAGATA TTAATGTGCT GGTACAGGAT GATTACCAAT CGTTGTCAAT TCAGGGTAAT
GGCGCACTGG GAATAACAGA AAATAGCTAT ATTGGTGCGC ACTGGAATTT CGACGGTTAT
GATGCAGATG ATGTCAGCGA CAACAATGTC GATGTCAGCG ATCTCTATTA CCGCTATGAT
TTTTTACGCC GTTATTATGT GCAGGCGGGG CGGATGGACA ACCGCACATT ATTTAATGCG
CAAGGTGGAA ACTTTACTTT TAACTTTCTG CCACTCGGTG CCATCGACGG AATGCGTATC
GGGTCGACTC TCAGTTATTT AAACCAGACG CAAAGCCAGC AGGGAACGCC GGTAATGATC
CTGCTTTCGC GCAATTCTCG TGTTGATGCT TATCGTAATG AACAACTGTT GGGATCGTTT
TACCTCAATA GTGGTTCGCA ATTTATTGAT ACCAGTTCCT TTCCGCCAGG CAGCTACAGC
GTGGCGTTAA AAGTTTATGA AAATAACCAA CTCACCCGCA CCGAGCTTGT ACCGTTTACC
AAAACCGGCG GTCTGACAGA CGGAAATGCC CAGTGGTTCT TGCAGGCAGG TAAAACCACA
TCACAGGTTT CTGATGATGA AAGTTCTGCT TATCAGCTAG GCGTACGCCT GCCATTACAT
CCGCAATATG AGCTCTACGC AGGGCTGGCG AATGCCGATG ATGTAAGTGC TTTCGAGTTA
GGCAATAACT GGACGGCAGA TTTAGGCGGG GCGGGGAATC TGGCTATCAG TGCCAGCGTG
TTCCGTAACG ATGACGGCGG CAAAGGCGAT ATGCAACAGG CCAACTGGAG TCATCCGGGA
TGGCCGACGT TGGGCTTTTA TCGGACCAAC TCTGACGGTG ATGCTTGTAC AACCGACAAC
AGAGAGAGCT ACAACGCCTT AAGCTGTTAT GAAAGTATTT CCGCGACGGT TTCACAGAAT
TTTGTCGGCT GGAATATGAT GCTGGGTTAT ACCCGCACAC AAAATAACAC TGATGATAGT
TTGCGCTGGG ACAAACAGCA GAGCTTTGAA AATAACTATC TTCGCCAGAC TTCGGCTCAA
AGCATTTCCG AGACCGTACA ACTTAGCGCT TCTCGCGCTT TTGTGATGCG TGACTGGATT
TTGAGTACTT CCCTTGGGGT TTTCCATCGT AATGACAACG GCGGTGGTAG CGATGACAAC
GGTTTGTATC TGTCGTTTTC GTTATCTGAT ACGCCGACGA TGGACAGCAA TAACAACAGC
CATTCAACCA ATGTTTCTAC TGATTATCGT TATAGCGATC AGGATGGCGA TCAAACGTCA
TGGCAGTTAT CGCATACCTT TTATAACGAC TCATTCAGCC ATAAAGAGCT TGGCGTGACC
GTCGGCGGTT TGAACACCGA TACCATAAAC AGCGCGGTTA ACGGGCGTTG GGATGGCCAA
TATGGAAATG TCTACGCTAC TGTCTCAGAC AGTTATGATC GCCAGAACCA TGATCATCTC
TCGGCCTTCA CCGGGACATA CAGCTCCACG CTGGCGGTGA GTCGCTATGG CATCAATTTG
GGGGCCAGCG GCTCAGATGA TTTGCTGGGA GCGGTGTTGG TGGATGTGAA AGGTTTCTCT
GAACAGGATG AACAGAGTCA GGGTCTGCAA CTCGAAGCGC GGGTGGCTGG CAGCAGAACA
TTGCAGCTTG GTCAAAGCGA CAGCGTGTTA TTCCCCTATC CTGGATTTCA GTCTGGCTTT
GTTGAAGTTA ACGACAGTAA TCAGGGTAAC CAGCAAGGAA CAACCAACAT CATTAACGGT
GCGGGAAATC GTGAGTTAAT GCTGTTGCCG GGCAAATTAC GTTATCGCGA AGTGTCTGCC
AGTTTTAATT ACAACTATAT TGGTCGTTTG TTATTGCCCG CATCGGTAGA GAAATTCCCG
CTGGTGGGTC TGAATAGCGC CATGTTACTG GTGGCTGAAG ATGGCGGATT CACACTTGAG
ATCAGCAGTG GTGAAAAAGA GTTGTATCTG CTTTCCGGGC AGCAGTTCCT GAAATGTCCG
CTGAATGTTT TGAAGAAACG CGCCAGCATT CGCTATAGCG GGGATGTGAA TTGTAGCGTG
GTGAGTTATT CACAATTGCC GGAATCTATT CAGGTTCAGG CACAGTTGAA ACAGCCTAAA
TTACGTGGAA ACGTCCAGAC GGCGCAAAGG GAGGTTGCAC CATGA
 
Protein sequence
MDKKLLALLI LASLSPAKAT LTKIPAGFEV IAQGQQEYIE VYFAGKSLGK YYAMVNLDTV 
TFLDSSSLYS KLELSTDDQK IAHTVKEKLS QPLARHGELA CGFVRTDSGC GFLNTETVEI
IYNDEESSAT LFLNPQWSSA FNSKSLYLNP DKNTVNAFIH QQDINVLVQD DYQSLSIQGN
GALGITENSY IGAHWNFDGY DADDVSDNNV DVSDLYYRYD FLRRYYVQAG RMDNRTLFNA
QGGNFTFNFL PLGAIDGMRI GSTLSYLNQT QSQQGTPVMI LLSRNSRVDA YRNEQLLGSF
YLNSGSQFID TSSFPPGSYS VALKVYENNQ LTRTELVPFT KTGGLTDGNA QWFLQAGKTT
SQVSDDESSA YQLGVRLPLH PQYELYAGLA NADDVSAFEL GNNWTADLGG AGNLAISASV
FRNDDGGKGD MQQANWSHPG WPTLGFYRTN SDGDACTTDN RESYNALSCY ESISATVSQN
FVGWNMMLGY TRTQNNTDDS LRWDKQQSFE NNYLRQTSAQ SISETVQLSA SRAFVMRDWI
LSTSLGVFHR NDNGGGSDDN GLYLSFSLSD TPTMDSNNNS HSTNVSTDYR YSDQDGDQTS
WQLSHTFYND SFSHKELGVT VGGLNTDTIN SAVNGRWDGQ YGNVYATVSD SYDRQNHDHL
SAFTGTYSST LAVSRYGINL GASGSDDLLG AVLVDVKGFS EQDEQSQGLQ LEARVAGSRT
LQLGQSDSVL FPYPGFQSGF VEVNDSNQGN QQGTTNIING AGNRELMLLP GKLRYREVSA
SFNYNYIGRL LLPASVEKFP LVGLNSAMLL VAEDGGFTLE ISSGEKELYL LSGQQFLKCP
LNVLKKRASI RYSGDVNCSV VSYSQLPESI QVQAQLKQPK LRGNVQTAQR EVAP