Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C0657 |
Symbol | |
ID | 6492134 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 657106 |
End bp | 658113 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642740915 |
Product | mannose binding protein FimH |
Protein accession | YP_002044582 |
Protein GI | 194449122 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3539] P pilus assembly protein, pilin FimA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.301836 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 85 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATAT ACTCAGCGCT ATTGCTGGCG GGGACCGCGC TCTTTTTCAC CCATCCCGCG CTGGCGACGG TTTGCCGTAA TTCAAACGGG ACGGCGACCG ATATCTTTTA CGACCTGTCA GATGTTTTCA CCAGTGGCAA TAATCAGCCG GGACAGGTGG TTACGCTGCC GGAAAAATCT GGTTGGGTCG GCGTAAACGC GACGTGCCCG GCGGGGACAA CGGTAAATTA TACCTACCGA AGCTATGTAT CAGAATTACC GGTACAAAGC ACCGAAGGAA ATTTTAAATA CCTCAAGCTG AATGACTACC TTCTGGGCGC GATGAGCATC ACCGATAGTG TCGCTGGCGT ATTTTATCCG CCCCGTAACT ATATTCTCAT GGGCGTCGAC TCTAACGTGT CGCAGCAAAG ACCGTTTGGC GTGCAGGACT CAAAGCTGGT TTTTAAATTA AAAGTGATAC GGCCTTTTAT TAATATGGTG ACGATCCCTC GCCAGACAAT GTTTACTGTC TATGTGACGA CCTCTACCGG CGACGCGTTG AGCACGCCGG TGTATACCAT TAGCTACAGC GGCAAAGTGG AAGTACCGCA AAACTGCGAA GTGAATGCCG GACAGGTCGT GGAGTTTGAT TTCGGCGATA TCGGCGCGTC GTTATTTAGT CAGGCGGGGG CGGGTAATCG TCCGCAAGGC GTCACGCCGC AAACGAAAAC TATCGCTATT AAATGTACCA ACGTCGCGGC GCAGGCCTAT TTATCTATGC GGCTTGAAGC CGAAAAGGCC TCAGGGCAGG CGATGGTGTC CGATAATCCG GATTTAGGCT TTGTGGTTGC TAATAGCAAC GGTACGCCGC TCACACCCAA TAATTTGTCG AGTAAAATTC CGTTTCATCT TGATGATAAC GCCGCCGCTC GCGTAGGTAT TCGCGCCTGG CCGATCAGCG TGACGGGGAA TAAACCGGTG GAAGGGCCGT TTACTGCGCG CGGCTATCTA CGAGTCGATT ATGATTAA
|
Protein sequence | MKIYSALLLA GTALFFTHPA LATVCRNSNG TATDIFYDLS DVFTSGNNQP GQVVTLPEKS GWVGVNATCP AGTTVNYTYR SYVSELPVQS TEGNFKYLKL NDYLLGAMSI TDSVAGVFYP PRNYILMGVD SNVSQQRPFG VQDSKLVFKL KVIRPFINMV TIPRQTMFTV YVTTSTGDAL STPVYTISYS GKVEVPQNCE VNAGQVVEFD FGDIGASLFS QAGAGNRPQG VTPQTKTIAI KCTNVAAQAY LSMRLEAEKA SGQAMVSDNP DLGFVVANSN GTPLTPNNLS SKIPFHLDDN AAARVGIRAW PISVTGNKPV EGPFTARGYL RVDYD
|
| |