Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3916 |
Symbol | |
ID | 6482341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 3798822 |
End bp | 3801350 |
Gene Length | 2529 bp |
Protein Length | 842 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642739176 |
Product | fimbrial usher protein |
Protein accession | YP_002042886 |
Protein GI | 194444158 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3188] P pilus assembly protein, porin PapC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 0.834184 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATGGA CGCATCTTCC TCTGGGCAAT AAGACCTCGC GTTTCACGCA GTCTGCGCTT GCGCTGATGA TAGCGGGTAC GCTCCCCGCG TATGCGGGAA CATTTAACCC GCGCTTTCTG GAGGATGTGC CGGGTATTGA TCAGCACGTT GACCTTTCAA TGTATGAATC CAATAAAGCT GAACACCTGC CAGGTAAATA CCGCGTCTCG GTGGTGGTCA ACGAAAAAAA AATGGAGTCT CGCACCCTGG AGTTTAAGGC AGCGACAGAG GCGCAGCGCG CAAAAATGGG TGAATCCCTG GTGCCGTGCT TAAGCCGCGT GCAGCTTGAA GATATGGGCG TGCGTATTGA TAGCTTCCCG GCTCTGAAAA TGGCCCCGCC TGAAGCCTGT GTTGCTTTTG ACGACATTAT TCCCCAGGCC GCCAGCCATT TCGACTTTGC AGACCAGACC CTGATCATGA GCTTCCCGCA GGCTGCGATG AAGCAGACAG CGCGCGGTAC GGTGCCAGAA TCGCAGTGGG ACGAAGGGGT GAATGCCCTG CTGGTGGATT ATAACTTTTC CGGCAGCAAC GCCAGCTATG ACGCACACGA CAGTGAAACC AGCTACAACA GCGACAGCTA CTATCTGAAT CTGCGCAGCG GTATGAACCT GGGGGCATGG CGGTTACGTA ACTATAGCAC CTGGACGCGA AACGACGGTA ACAACACATG GGATAACATT GGCACATCTC TAAGCCGTGC CATTGTGCCG CTGAAATCAC AGCTGACGTT GGGGGATACC TCCACCGCCG GTGATATTTT TGACAGCGTT CAGATGCGCG GTGTGCAGTT AACTTCCGAC GAAGAGATGC TGCCTGACAG CCAGCGCGGG TTTGCGCCCG TCATCCGGGG TATTGCCAAA AGTAACGCCG AAGTTACCGT TGAGCAGAAC AACTACGTTA TTTACCGTAC GTTTGTTCAG CCGGGTGCGT TTGAAATTAA CGACCTGTAT CCAACCTCAA ACAGCGGCGA CCTGACGGTC ACCATTAAAG AATCGGACGG CAGTGAGCAG AAGTTCGTTC AGCCGTTCTC CTCGGTGGCG CTCCTCCAGC GTGAAGGCCA TCTCAAATAC AGCCTTTCCG CCGGGGAATA CCGTGCCGGG AACTATAACA GCGCCGAGCC GAAATTCGGG CAGCTTGATG CCATGTACGG CCTGCCGTAT GGCTTTACCG TTTACGGTGG TGCGATCTTC TCTGACGACT ATTACTCGCT GGCGGGAGGA TTAGGTAAAA ACTTCGGTTA TATCGGCGCG ATCTCCATCG ATGTAACCCA GGCAAAAAGC AAGCTGGCAA ATGAGGAGAA TTCGGAAGGT CAGTCTTATC GTTTCCTCTA CTCCAAGAGC TTTAACAGCG GTACAGATTT CCGTCTGCTG GGTTACAAGT ATTCGACCAG CGGCTATTAC ACCTTCCAGG AAGCGACGGA TGTGCGCAGC GATGCGGACA GCTCTTATAG CCAGTACCAC AAACGTAGTC AGATTCAGGG CAACGTGACG CAGCAACTGG GCGCCTGGGG CTCGGTCTAT TTTAACGTCA CGCAGCAGGA CTACTGGAAC GATGAAGGTA AACAGCGTTC GCTGAATGCC GGTTATAACG GCCGTATTGG CCGCGTGAAC TACAGCGTTG CTTACACCTG GACGAAAAGC CCGGAGTGGG ATGAGAGCGA TCGTTTACTG TCATTCTCCA TGTCGATTCC ACTGGGACGC GTGTGGAGTA ACTACCACCT CACGACCGAT CAGCATGGCC GAACCAACCA GCAGTTAGGG GTGAGCGGCA CCGCGCTGGA AGACCACAAC CTGAACTATA GTGTGCAGGA AGGCTACGGC AGCAACGGCG TGGGTAACAG CGGCAGCGTG AACCTGGATT ACCAGGGCGG CGTGGGTAGC GCCAGCCTGG GTTACAACTA CAACCGTGAC GGCCAGCAGG TGAACTACGG TTTGCGCGGC GGTGTGATAG CCCATAGCGA AGGTATCACT CTTTCTCAAC CGCTGGGTGA ATCCATGGCC ATTATCTCCG CGCCGGGCGC GCGCGGCGCG CACGTGATCA ACAACGGTGG TGTGGAAGTG GACTGGATGG GTAATGCGGT CGTGCCTTAC CTTACTCCGT ACCGTGAAAC GGAAGTCTCA CTGCGAAGCG ACAGCCTGAA CAACCAGGTT GACCTGGATA CCGCCTCCGT CAACGTAGTG CCGACACGCG GCGCGATTGT TCGTGCCCGC TTCGATACCC GAGTGGGCTA TCGCGTGCTG ATGAATCTGA CGCAGGCCAA TGGCAAAGCG GTGCCGTTTG GTGCTACCGC CACGCTGCTG GATACCACAA AAGAGTTCAG CAGCATTGTG GGTGAAGACG GTCAGCTTTA TATCAGCGGG ATGCCGGAGA AAGGTGCCCT TCAGGTGAAC TGGGGTAAAG ACCAGGCACA GCAATGCCGC GTGGCGTTTA CGCTGCCGGA ACAACAGGAT AATACCGGCG TGGTGATGGC GAATGCCGTC TGCCGGTAA
|
Protein sequence | MTWTHLPLGN KTSRFTQSAL ALMIAGTLPA YAGTFNPRFL EDVPGIDQHV DLSMYESNKA EHLPGKYRVS VVVNEKKMES RTLEFKAATE AQRAKMGESL VPCLSRVQLE DMGVRIDSFP ALKMAPPEAC VAFDDIIPQA ASHFDFADQT LIMSFPQAAM KQTARGTVPE SQWDEGVNAL LVDYNFSGSN ASYDAHDSET SYNSDSYYLN LRSGMNLGAW RLRNYSTWTR NDGNNTWDNI GTSLSRAIVP LKSQLTLGDT STAGDIFDSV QMRGVQLTSD EEMLPDSQRG FAPVIRGIAK SNAEVTVEQN NYVIYRTFVQ PGAFEINDLY PTSNSGDLTV TIKESDGSEQ KFVQPFSSVA LLQREGHLKY SLSAGEYRAG NYNSAEPKFG QLDAMYGLPY GFTVYGGAIF SDDYYSLAGG LGKNFGYIGA ISIDVTQAKS KLANEENSEG QSYRFLYSKS FNSGTDFRLL GYKYSTSGYY TFQEATDVRS DADSSYSQYH KRSQIQGNVT QQLGAWGSVY FNVTQQDYWN DEGKQRSLNA GYNGRIGRVN YSVAYTWTKS PEWDESDRLL SFSMSIPLGR VWSNYHLTTD QHGRTNQQLG VSGTALEDHN LNYSVQEGYG SNGVGNSGSV NLDYQGGVGS ASLGYNYNRD GQQVNYGLRG GVIAHSEGIT LSQPLGESMA IISAPGARGA HVINNGGVEV DWMGNAVVPY LTPYRETEVS LRSDSLNNQV DLDTASVNVV PTRGAIVRAR FDTRVGYRVL MNLTQANGKA VPFGATATLL DTTKEFSSIV GEDGQLYISG MPEKGALQVN WGKDQAQQCR VAFTLPEQQD NTGVVMANAV CR
|
| |