Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4949 |
Symbol | |
ID | 6482168 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 4818020 |
End bp | 4819105 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642740159 |
Product | putative major fimbrial subunit |
Protein accession | YP_002043833 |
Protein GI | 194444015 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3539] P pilus assembly protein, pilin FimA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 105 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAGGT TATATCTGGC GCTTATTCTG CTGTTCGCGT ATTCCGGTCA TGGTTACGCC TCCTGTAAAC GTTCCGGAAA TGAAGGCGCG ATCACTATCA CCCCGCCGTC GCAGCTTGTG GTGGATAGCC ACGCCTATAC CGCAGGCGAA GTGTTGTGGC AATCGGGCTG GGTTTCCACC TCCGAAGTCA CAATGGATGG CTGTAGTCGC GATTATAAGG TCGGTTTTTT ATATGAACCC GGTAGCGCGC AGTCAAATAC ATCAGCGACA ATCAATGCGA ATGACGGGAA CAACACACCG GTATTTAGCA CCGGGATTTC CGGCGTGGGC ATTGCGATTA AAACCCAAAC GAATGCTGGC CCTTACGATA ATGTCATGCC AATCGATAAT ACCTACCATA ATGGCGATGG CAATAAAACA CACCATGCGA TGGCTCCCGC CTACAATGTT GAACTGGTCG CCTTAGGCGG TCCGATCACC TCCGGTACCG CGACATTCCA AAGCCCACTG GCGCGCGTAT CCTTTCGCGA TAGCGCAACG GAAGACTCCG GCGGCGATAT CCTGACCCAT CTGTATTTGG GGAATACGCA ATTGATTATG AAAGCGATGG GATGTCGGGT AGAAACACCT GCCATCACCG TGGATTTAGG CAGCGTCAAT TTAGGCAGTT TCGCTAACAG TCAAACTGCG GGCACAGGCG AGCAGGATAT CTTATTGACC TGCGAACAAG GCACCGCTAT CTCCGCATCG TTAAGCGCCC AACCGGCCAG CGGAAATAAC CCTGATAATT CAGTCATCCA GTTGAGCAAT GCAAGCGCGC CAACCAGCGC AACCGGCGTT GGCGTACAGT TGGGTATTCA GGCGCCGGAC GCCGGTTTCT TTACCGACAG TTTGCCAATT AATCAAAAAA TTGATCTCTT TACTCACACG ATTACCACCA ATGCCGATGG CAGCCAGACG GTTAACGGCG GAACCATGAA TATGTCGACC ACCCTGAAAA TTAGCGCGCG CTACTATAAA ACCGCCGCCA CGGTAACGGC CGGGCAAGCG AATGCCACGG CAACATTAAA CCTGACCTAT AACTAA
|
Protein sequence | MRRLYLALIL LFAYSGHGYA SCKRSGNEGA ITITPPSQLV VDSHAYTAGE VLWQSGWVST SEVTMDGCSR DYKVGFLYEP GSAQSNTSAT INANDGNNTP VFSTGISGVG IAIKTQTNAG PYDNVMPIDN TYHNGDGNKT HHAMAPAYNV ELVALGGPIT SGTATFQSPL ARVSFRDSAT EDSGGDILTH LYLGNTQLIM KAMGCRVETP AITVDLGSVN LGSFANSQTA GTGEQDILLT CEQGTAISAS LSAQPASGNN PDNSVIQLSN ASAPTSATGV GVQLGIQAPD AGFFTDSLPI NQKIDLFTHT ITTNADGSQT VNGGTMNMST TLKISARYYK TAATVTAGQA NATATLNLTY N
|
| |