Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A0155 |
Symbol | |
ID | 6483238 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 166908 |
End bp | 168110 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642735592 |
Product | type IV pilin biogenesis protein |
Protein accession | YP_002039374 |
Protein GI | 194444325 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1459] Type II secretory pathway, component PulF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000866014 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 90 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTTA AACAGCTCTG GCGCTGGCAA GGCGTTAACG ATAAAGGTCA ACTGGAACAA GACGTTGTAT GGGCGGACAA TCGTCTGGCG CTGATCATCA CCCTGCAACA TCAGCGCATT ATGCCGCTTC GCATCAAGCG CATGGGCGTT AACGCCGCAC TGTGGAAAGA AGAGCAAAGC GCTGAAATTA TTCATCAGTT GGCCACGCTC ATTCATGCCG GGCTGACGCT TTCTGAAGGG CTGGAACTCC TTGCGAAACA GCATCCACAC CGACAATGGC AAGCGCTGTT GCGCACGCTG GCTCACGAGC TTGAACAGGG CGTCCCTTTT TCCAGCGCAT TAGTCTCCTG GCCGCAGGTA TTTCCGCCGC TCTACCAGAC GATGATCCGC ACCGGAGAAC TGACCGGCAA ACTGGCCGAA TGCTGCTTTG AACTGGCCCG TCAGCAAAAA GCGCAACGGC AGATTACGGT TAGCGTGAAA AAGGCGCTGC GCTATCCCGC CATTATTCTG ACAATGGCCG CCCTGGTCGT TTTCGCCATG CTGCACTTTG TTCTGCCGGA ATTTGCCGCC ATTTACCGTA GCTTCAATAC CCCGCTCCCT CTTCTGACGC GCGGTATTAT CGCGATAGCG CAATGGGGGG CGGCATGGGG TTGGCTCATC TTGTTCCTGA CGATGCTCAT TGCTATCGCT CACCGCAGGG TAAAACAAAA GCCGTCCTGG CAAGCGCAGC GGCAGCGTCT TCTGCTACGG CTTCCCGTTA TGGGTCGCCT GATAAGAGGC CAGAAACTAG CGCAAATATT CACCGTACTG GCATTAACCC AAAGCGCAGG CATTCCTTTT CTTCAGGGAC TGGAAAGCGC TATCGAGAGT CTCGGCTGCC CTTACTGGTC ACAGCGTTTA ACGCAGGTAC ATCAGGAGAT CGCCGCGGGC AATCCGGTCT GGTTGGCGCT AAAAAATACC CAGGAATTTA GTCCGCTATG CCTGCAACTG GTCAGAACGG GCGAAGCGTC CGGCTCACTC GATATCATGC TGCATAACCT TGCCCGTCAC CACAGTGAAA CTACGCTGGC GCTGGCCGAT AATCTGGCGT CGCTGTTGGA ACCGGCGTTA TTGATCATCA CCGGCTTAAT TATCGGTACG CTGGTGGTGG CGATGTATTT GCCGATTTTT CATCTGGGAG ACGCGATGAG CGGGATGGGA TAA
|
Protein sequence | MSVKQLWRWQ GVNDKGQLEQ DVVWADNRLA LIITLQHQRI MPLRIKRMGV NAALWKEEQS AEIIHQLATL IHAGLTLSEG LELLAKQHPH RQWQALLRTL AHELEQGVPF SSALVSWPQV FPPLYQTMIR TGELTGKLAE CCFELARQQK AQRQITVSVK KALRYPAIIL TMAALVVFAM LHFVLPEFAA IYRSFNTPLP LLTRGIIAIA QWGAAWGWLI LFLTMLIAIA HRRVKQKPSW QAQRQRLLLR LPVMGRLIRG QKLAQIFTVL ALTQSAGIPF LQGLESAIES LGCPYWSQRL TQVHQEIAAG NPVWLALKNT QEFSPLCLQL VRTGEASGSL DIMLHNLARH HSETTLALAD NLASLLEPAL LIITGLIIGT LVVAMYLPIF HLGDAMSGMG
|
| |