Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A0596 |
Symbol | |
ID | 6875425 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 612091 |
End bp | 613098 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642783814 |
Product | mannose binding protein FimH |
Protein accession | YP_002214501 |
Protein GI | 198244251 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3539] P pilus assembly protein, pilin FimA |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 0.244 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATAT ACTCAGCGCT ATTGCTGGCG GGGACCGCGC TCTTTTTCAC CCATCCCGCG CTGGCGACGG TTTGCCGTAA TTCAAACGGG ACGGCGACCG ATATCTTTTA CGACCTGTCA GATGTTTTCA CCAGTGGCAA TAATCAGCCG GGACAGGTGG TTACGCTGCC GGAAAAATCT GGTTGGGTCG GCGTAAACGC GACGTGCCCG GCGGGGACAA CGGTAAATTA TACCTACCGA AGCTATGTAT CAGAATTACC GGTACAAAGC ACCGAAGGAA ATTTTAAATA CCTCAAGCTG AGTGACTACC TTCTGGGCGC GATGAGCATC ACCGATAGTG TCGCTGGCGT ATTTTATCCG CCCCGTAACT ATATTCGCAT GGGCGTCGAC TCTAACGTGT CGCAGCAAAT GCCGTTTGGC GTGCAGGACT CAAAGCTGGT TTTTAAATTA AAAGTGATAC GGCCTTTTAT TAATATGGTG ACGATCCCTC GCCAGACAAT GTTTACTGTC TATGTGACGA CCTCTACCGG CGACGCGTTG AGCACGCCGG TATATACCAT TAGCTACAGC GGCAAAGTGG AAGTGCCGCA AAACTGCGAA GTGAATGCCG GACAGGTCGT GGAGTTTGAT TTCGGCGATA TCGGCGCGTC GTTATTTAGT CAGGCGGGGG CGGGTAATCG TCCGCAAGGC GTCACGCCGC AAACGAAAAC TATCGCTATT AAATGTACCA ACGTCGCGGC GCAGGCCTAT TTATCTATGC GGCTTGAAGC CGAAAAGGCC TCAGGGCAGG CGATGGTGTC CGATAATCCG GATTTAGGCT TTGTGGTTGC TAATAGCAAC GGTACGCCGC TCATACCCAA TAATTTGTCG AGTAAAATTC CGTTTCATCT TGATGATAAC GCCGCCGCTC GCGTAGGTAT TCGCGCCTGG CCGATCAGCG TGACGGGGAA TAAACCGGCG GAAGGGCCGT TTACTGCGCG CGGCTATCTA CGAGTCGATT ATGATTAA
|
Protein sequence | MKIYSALLLA GTALFFTHPA LATVCRNSNG TATDIFYDLS DVFTSGNNQP GQVVTLPEKS GWVGVNATCP AGTTVNYTYR SYVSELPVQS TEGNFKYLKL SDYLLGAMSI TDSVAGVFYP PRNYIRMGVD SNVSQQMPFG VQDSKLVFKL KVIRPFINMV TIPRQTMFTV YVTTSTGDAL STPVYTISYS GKVEVPQNCE VNAGQVVEFD FGDIGASLFS QAGAGNRPQG VTPQTKTIAI KCTNVAAQAY LSMRLEAEKA SGQAMVSDNP DLGFVVANSN GTPLIPNNLS SKIPFHLDDN AAARVGIRAW PISVTGNKPA EGPFTARGYL RVDYD
|
| |