Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A0025 |
Symbol | |
ID | 6874880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 28427 |
End bp | 29434 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642783285 |
Product | mannose binding protein FimH |
Protein accession | YP_002213979 |
Protein GI | 198246163 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3539] P pilus assembly protein, pilin FimA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.159882 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 0.122652 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATAC CTCTTTTATT TGCTCTGCTG GCGGGAAGTG TCGTATCGCA GTACGCCTTC GCAGACGTGT GTAAAAACGT TAACGGCGTG CCTTCCAGTA TTAATTACGA TTTAACGACC ACTCTGATGG CAGAACAAAA CCAGGTGGGA AAGACGGTTC AACTGGAAAA AAGCCAGGAA GTTAATGTAC AGGCGGTGTG TCCCGCCGGC GCGTCGACCT ATAGCCAGAC TTATCGCTCC TATGTGTCGC CATATCCGGT CGTAGAAACG AGCGGTAACT GGAAATATTT AAAGCTGGAC CCGGACTACC TTGAAGGCGG AATGCGAATT GAGGATTCTT CGGCGGGCGA TATCTATCCG CCAATGAACA ACGTTCTGAT GGGATATGAT GAAAATGTGA AAGCGGGTCA ACCGTTTTAC GTTCGTGACT CAAATCTGGA GTTTCAGCTC AAAATTGTTA AACCGTTCGT CGGCACGGTG AATATTAGTC CTAAGACTAT GTTCAATGTT TATGTCATGA CCGCCGCAGG CGATCCGCTG ACAGATGTCG TATACAGCAT TCTTTATAGT GGAACGGTGA CCGTACCGCA AAGCTGTGAA ATCAACGCCG GACAAACGAT TCTGGTAAAT TTCGGCGCAT TATACAGTGG CAATTTCAAC CATGCAGGCC AAAAGCCGGA GGGGGTACGA GCGAAAAAAT TCAGCGTACC GGTAAAGTGC AGCGGTCTGG ATTCGCAGGT CAATTTAACA ATGCGTCTTA TCGCTACGCC GGATAGCCAC GTTCCCCAGG CTATCGCTTC GGATAATGCC GATGTCGGCG TAGTGGTCGA AACCGATGAA GGAAACGCGC TTATCCCCAA TGATGCACAG AGCGTCGCGC CTTTTATCAC CGATAGCGCC GGACGCGCTA ACATCACATT GCAAGCCTAC CCGGTAAGTA CAACAGGCGA AACGCCTGCG GAAGGGGCGT TTACCGCACT GGCCAGCCTG CGAGTGGACT TTGACTAA
|
Protein sequence | MKIPLLFALL AGSVVSQYAF ADVCKNVNGV PSSINYDLTT TLMAEQNQVG KTVQLEKSQE VNVQAVCPAG ASTYSQTYRS YVSPYPVVET SGNWKYLKLD PDYLEGGMRI EDSSAGDIYP PMNNVLMGYD ENVKAGQPFY VRDSNLEFQL KIVKPFVGTV NISPKTMFNV YVMTAAGDPL TDVVYSILYS GTVTVPQSCE INAGQTILVN FGALYSGNFN HAGQKPEGVR AKKFSVPVKC SGLDSQVNLT MRLIATPDSH VPQAIASDNA DVGVVVETDE GNALIPNDAQ SVAPFITDSA GRANITLQAY PVSTTGETPA EGAFTALASL RVDFD
|
| |