Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3172 |
Symbol | |
ID | 6484803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 3080808 |
End bp | 3083453 |
Gene Length | 2646 bp |
Protein Length | 881 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642738477 |
Product | fimbrial usher protein |
Protein accession | YP_002042201 |
Protein GI | 194445583 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3188] P pilus assembly protein, porin PapC |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.287175 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 0.39987 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTCTAA GCGTCTCCCC TTATAGCGCG TCAGGCAAAG ACATCGAATT TAATACCGAT TTCCTCGATG TAAAAAATCG CGATAACGTT AACATTGCAC AGTTTTCTCG TAAGGGTTTT ATTCTGCCAG GCGTCTACCT TTTACAAATT AAAATTAACG GACAGACTCT GCCGCAGGAA TTTCCTGTTA ACTGGGTTAT TCCAGAACAT GATCCACAAG GAAGTGAGGT TTGCGCAGAA CCAGAATTAG TTACGCAATT GGGTATAAAG CCGGAACTCG CGGAAAAACT CGTCTGGATA ACGCACGGCG AACGACAATG TCTGGCGCCA GATTCACTGA AAGGCATGGA TTTTCAGGCC GACCTGGGGC ACTCCACGCT GTTGGTGAAT TTACCCCAGG CGTATATGGA ATACAGCGAT GTCGACTGGG ACCCACCCGC CCGCTGGGAT AATGGTATTC CCGGCATCAT TCTGGATTAC AACATTAATA ATCAGCTCCG CCACGATCAA GAAAGCGGCA GCGAAGAGCA AAGCATCAGC GGCAACGGGA CGTTAGGCGC GAACCTGGGC GCATGGCGAC TGCGGGCCGA CTGGCAGGCC AGCTACGACC ATCGTGACGA TGACGAGAAC ACTTCCACTC TCCACGATCA GAGCTGGAGC CGCTACTACG CCTATCGCGC ACTACCGACG CTCGGGGCCA AACTTACGCT GGGCGAAAGC TATCTCCAGT CCGATGTTTT CGACAGCTTT AACTATATCG GTGCCAGCGT CGTTTCTGAC GATCAGATGC TGCCGCCGAA ACTGCGCGGC TATGCGCCGG AGATCGTGGG TATTGCGCGC TCTAATGCAA AAGTCAAAGT CTCCTGGCAG GGGCGCGTAC TGTATGAAAC GCAGGTGCCC GCAGGACCGT TCCGTATTCA GGATCTCAAC CAGTCCGTTT CCGGTACGTT GCACGTCACC GTGGAAGAGC AGAACGGTCA GACCCAGGAG TTTGACGTTA ACACCGCATC GGTTCCCTTC CTGACGCGCC CCGGCATGGT GCGCTACAAG ATGGCGCTGG GCCGCCCGCA GGACTGGGAT CATCACCCTA TTACCGGCAC ATTCGCCTCG GCGGAAGCTT CGTGGGGGGT CACCAACGGC TGGTCGCTAT ATGGCGGCGC AATTGGAGAA AGCAGCTATC AGGCCGTGGC GTTGGGAAGC GGTAAGGATC TTGGCGTGGT GGGCGCGGTG GCGGTTGACA TTACGCACTC CATCGCCCAC ATGCCGCAAG ACGACGGGTT TGACGGCGAA ACGCTGCAGG GTAACTCATA TCGCATCAGC TACTCCCGTG ACTTTGATGA AATCGACAGC CGACTAACCT TTGCCGGATA CCGCTTCTCA GAAAAGAACT TTATGAGCAT GAGCGACTAT CTGGATGCGA AAACCTATCA TCATCTCAAT GCCGGTCACG AAAAAGAACG CTATACGGTC ACCTATAACC AGAACTTCCG TGAACAGGGC ATGAGCGCCT ATTTCAGCTA CTCACGCAGT ACCTTCTGGG ACAGCCCGGA TCAGAGTAAC TATAACCTGT CTCTTTCCTG GTACTTCGAC TTAGGGTCGA TAAAAAATCT CAGTGCGTCG CTGAACGGCT ATCGCAGCGA ATATAACGGT GATAAAGATG ATGGCGTCTA TATCTCGCTG TCTGTTCCCT GGGGCAATGA TTCCATCAGC TACAACGGTA CGTTTAACGG TAGTCAACAC CGTAATCAGC TCGGCTATTC CGGCCACAGC CAGAACGGCG ATAACTGGCA GCTTCACGTC GGGCAGGATG AACAAGGCGC ACAGGCAGAC GGTTATTACA GCCATCAGGG CGCGCTGACG GACATCGATC TGAGCGCGGA TTATGAAGAA GGATCGTACC GTTCGCTGGG CATGTCGCTG CGCGGCGGCA TGACGCTGAC CACCCAGGGC GGCGCGCTAC ACCGGGGAAG TTTAGCGGGC AGCACACGTT TGCTGGTTGA TACCGACGGC ATTGCGGACG TCCCCGTTAG CGGTAACGGC TCGCCAACCT CAACCAACAT TTTCGGCAAG GCCGTGATTG CGGATGTCGG AAGCTATTCG CGCAGCCTGG CGCGTATCGA TCTGAACAAA TTGCCGGAGA AGGCGGAAGC TACTAAGTCG GTTGTGCAGA TCACGCTCAC CGAAGGCGCC ATCGGCTACC GTCACTTTGA CGTGGTCAGC GGCGAGAAAA TGATGGCGGT TTTCCGGCTG GCAGACGGCG ACTTCCCACC GTTCGGCGCC GAAGTGAAAA ACGAGCGCCA GCAGCAGTTG GGCCTGGTGG CCGATGACGG CAACGCGTGG CTGGCGGGCG TAAAAGCCGG GGAAACATTG AAAGTATTCT GGGACGGCGC GGCGCAGTGT GAAGCATCAC TCCCGCCCAC GTTTACACCG GAGCTATTGG CTAACGCGCT ATTGCTGCCG TGCAAAATTC TGGAAGGTCA GCCCCCCACC GCACCGCAGA AAAGTTCTCC GCTGCCTGCG CAACCGCTAA TCCAGGAACA TACGCAAACC GATGGCCAAC CGGCCGCGCC GGTGGCGACA ACCACTCAAA CCCCGCCCAT ACCGCTGGCT GACAACCATG CGGTGAATCG CAAGGATATG GAATAA
|
Protein sequence | MLLSVSPYSA SGKDIEFNTD FLDVKNRDNV NIAQFSRKGF ILPGVYLLQI KINGQTLPQE FPVNWVIPEH DPQGSEVCAE PELVTQLGIK PELAEKLVWI THGERQCLAP DSLKGMDFQA DLGHSTLLVN LPQAYMEYSD VDWDPPARWD NGIPGIILDY NINNQLRHDQ ESGSEEQSIS GNGTLGANLG AWRLRADWQA SYDHRDDDEN TSTLHDQSWS RYYAYRALPT LGAKLTLGES YLQSDVFDSF NYIGASVVSD DQMLPPKLRG YAPEIVGIAR SNAKVKVSWQ GRVLYETQVP AGPFRIQDLN QSVSGTLHVT VEEQNGQTQE FDVNTASVPF LTRPGMVRYK MALGRPQDWD HHPITGTFAS AEASWGVTNG WSLYGGAIGE SSYQAVALGS GKDLGVVGAV AVDITHSIAH MPQDDGFDGE TLQGNSYRIS YSRDFDEIDS RLTFAGYRFS EKNFMSMSDY LDAKTYHHLN AGHEKERYTV TYNQNFREQG MSAYFSYSRS TFWDSPDQSN YNLSLSWYFD LGSIKNLSAS LNGYRSEYNG DKDDGVYISL SVPWGNDSIS YNGTFNGSQH RNQLGYSGHS QNGDNWQLHV GQDEQGAQAD GYYSHQGALT DIDLSADYEE GSYRSLGMSL RGGMTLTTQG GALHRGSLAG STRLLVDTDG IADVPVSGNG SPTSTNIFGK AVIADVGSYS RSLARIDLNK LPEKAEATKS VVQITLTEGA IGYRHFDVVS GEKMMAVFRL ADGDFPPFGA EVKNERQQQL GLVADDGNAW LAGVKAGETL KVFWDGAAQC EASLPPTFTP ELLANALLLP CKILEGQPPT APQKSSPLPA QPLIQEHTQT DGQPAAPVAT TTQTPPIPLA DNHAVNRKDM E
|
| |