Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A0026 |
Symbol | |
ID | 6486157 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 28418 |
End bp | 29425 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642735471 |
Product | mannose binding protein FimH |
Protein accession | YP_002039253 |
Protein GI | 194446333 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3539] P pilus assembly protein, pilin FimA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.135295 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 0.119008 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATAC CTCTTTTATT TGCTCTGCTG GCGGGAAGTG TCGTATCGCA GTACGCCTTC GCAGACGTGT GTAAAAACGT TAACGGTGTA CCTTCCAGTA TTAATTACGA TTTAACGACC ACTCTGACGG CAGAACAAAA CCAGGTGGGA AAGACGGTTC AACTGGAAAA AAGCCAGGAA GTTAATGTAC AGGCGGTGTG TCCCGCCGGC GCGTCGACCT ATAGCCAGAC TTATCGCTCC TATGTGTCGC CATATCCGGT CGTAGAAACG AGCGGTAACT GGAAATATTT AAAGCTGGAC CCGGACTACC TTGAAGGCGG AATGCGAATT GAGGATTCTT CGGCGGGCGA TATCTATCCG CCAATGAACA ATGTCCTGAT GGGATATGAT GAAAATGTGA AAGCGGGTCA ACCGTTTTAC GTTCGTGACT CAAATCTGGA GTTTCAGCTC AAAATTGTTA AACCGTTCGT CGGCACGGTG AATATTAGTC CTAAGACTAT GTTCAATGTT TATGTCATGA CCGCCGCAGG CGATCCGCTG ACAGATGTCG TGTACAGCAT TCTTTATAGT GGAACGGTGA CCGTTCCGCA AAGCTGCGAA ATCAACGCCG GACAAACGAT TCTGGTGAAT TTCGGCGCAT TATACAGCGG CAATTTCAAC CATGCAGGCC AAAAGCCGGA GGGGGTACGA GCGAAAAAAT TCAGCGTACC GGTAAAGTGC AGCGGTCTGG ATTCGCAGGT CAATTTAACA ATGCGTCTTA TCGCTACGCC GGATAGCCAC GTTCCCCAGG CTATCGCTTC GGATAATGCC GATGTCGGCG TAGTGGTCGA AACCGATGAA GGAAACGCGC TTATCCCCAA TGATGTACAG AGCGTCGCGC CTTTTATCAC CGATAGCGCC GGACGCGCTA ACATCACATT GCAAGCCTAC CCGGTGAGTA CAACAGGCGA AACGCCAGCG GAAGGGGCGT TTACCGCGCT GGCCAGTCTG CGAGTGGACT TTGACTAA
|
Protein sequence | MKIPLLFALL AGSVVSQYAF ADVCKNVNGV PSSINYDLTT TLTAEQNQVG KTVQLEKSQE VNVQAVCPAG ASTYSQTYRS YVSPYPVVET SGNWKYLKLD PDYLEGGMRI EDSSAGDIYP PMNNVLMGYD ENVKAGQPFY VRDSNLEFQL KIVKPFVGTV NISPKTMFNV YVMTAAGDPL TDVVYSILYS GTVTVPQSCE INAGQTILVN FGALYSGNFN HAGQKPEGVR AKKFSVPVKC SGLDSQVNLT MRLIATPDSH VPQAIASDNA DVGVVVETDE GNALIPNDVQ SVAPFITDSA GRANITLQAY PVSTTGETPA EGAFTALASL RVDFD
|
| |