Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A0600 |
Symbol | |
ID | 6483140 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 613055 |
End bp | 614062 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642736017 |
Product | mannose binding protein FimH |
Protein accession | YP_002039791 |
Protein GI | 194442424 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3539] P pilus assembly protein, pilin FimA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.282698 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 0.11786 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATAT ACTCAGCGCT ATTGCTGGCG GGGACCGCAC TCTTTTTCAC CCATCCCGCG CTGGCGACGG TTTGCCGTAA TTCAAACGGG ACGGCGACCG ATATCTTTTA TGACCTGTCA GATGTTTTCA CCAGCGGCAA TAATCAGCCG GGACAGGTGG TGACGCTGCC GGAAAAATCA GGTTGGGTCG GCGTAAACGC GACGTGCCCG GCGGGGACAA CGGTAAATTA TACCTACCGA AGCTATGTAT CAGAATTACC GGTACGAAGC ACCGAAGGAA ATTTTAAATA CCTCAAGCTG AATGACTACC TTCTGGGCGC GATGAGCATC ACCGATAGTG TCGCTGGCGT ATTTTATCCG CCCCGTAACT ATATTCGCAT GGGCGTCGAC TCTAACGTGT CGCAGCAAAT GCCGTTTGGC GTGCAGGACT CAAAGCTGGT TTTTAAATTA AAAGTGATAC GGCCTTTTAT TAATATGGTG ACGATCCCTC GCCAGACAAT GTTTACTGTC TATGTGACGA CCTCTACCGG CGACGCGTTG AGCACGCCGG TATATACCAT TAGCTACAGC GGCAAAGTGG AAGTGCCGCA AAACTGCGAA GTGAATGCCG GACAGGTCGT GGAGTTTGAT TTCGGCGATA TCGGCGCGTC GTTATTTAGT CAGGCGGGGG CGGGTAATCG TCCGCAAGGC GTCACGCCGC AAACGAAAAC TATCGCTATT AAATGTACCA ACGTCGCGGC GCAAGCCTAT TTATCGATGC GGCTTGAAGC CGAAAAGGCC TCAGGGCAGG CGATGGTGTC CGATAATCCG GATTTAGGCT TTGTGGTTGC TAATAGCAAC GGTACGCCGC TCACACCCAA TAATTTGTCG AGTAAAATTC CGTTTCATCT TGATGATAAC GCCGCCGCTC GCGTAGGTAT TCGCGCCTGG CCGATCAGCG TGACGGGGAA TAAACCGGTG GAAGGGCCGT TTACTGCGCG CGGCTATCTA CGAGTCGATT ATGATTAA
|
Protein sequence | MKIYSALLLA GTALFFTHPA LATVCRNSNG TATDIFYDLS DVFTSGNNQP GQVVTLPEKS GWVGVNATCP AGTTVNYTYR SYVSELPVRS TEGNFKYLKL NDYLLGAMSI TDSVAGVFYP PRNYIRMGVD SNVSQQMPFG VQDSKLVFKL KVIRPFINMV TIPRQTMFTV YVTTSTGDAL STPVYTISYS GKVEVPQNCE VNAGQVVEFD FGDIGASLFS QAGAGNRPQG VTPQTKTIAI KCTNVAAQAY LSMRLEAEKA SGQAMVSDNP DLGFVVANSN GTPLTPNNLS SKIPFHLDDN AAARVGIRAW PISVTGNKPV EGPFTARGYL RVDYD
|
| |