Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0634 |
Symbol | |
ID | 6972244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 658047 |
End bp | 659054 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 643384672 |
Product | mannose binding protein FimH homolog |
Protein accession | YP_002269186 |
Protein GI | 209395712 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3539] P pilus assembly protein, pilin FimA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.732186 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.432125 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATAA TCTGCAGATT ATTATTGGCG ATGGCATGTT TGTGTCTGGC AAACATATCC TGGGCTACTG TTTGTGCAAA TAGTACTGGC GTAGCAGAAG ATGAACACTA TGATCTCTCA AATGTCTTTA ATAGCACCAA TAACCAGCCA GGGCAGATTG TTGTTTTACC GGAAAAATCC GGCTGGGTAG GTGTCTCAGC AATTTGTCCA CCCGGCACGC TGGTGAATTA TACATACCGT AGTTATGTCA CTAACTTTAT TGTTCAGGAA ACTATCGATA ATTATAAATA TATGCAATTA AATGATTATC TATTAGGTGC GATGAGTCTG GTTGATAGTG TGATGGATAT TCAATTTCCC CCGCAAAATT ATATTCGTAT GGGAACAGAT CCTAACGTTT CGCAAAACCT TCCGTTCGGG GTGATGGATT CTCGTTTGAT ATTTCGTTTA AAGGTTATTC GGCCCTTTAT TAACATGGTG GAGATCCCCA GACAGGTGAT GTTTACCGTG TATGTGACAT CAACGCCTTA CGATCCGTTG GTTACACCTG TTTATACCAT TAGTTTTGGT GGCCGGGTTG AAGTACCGCA AAACTGCGAA TTAAATGCCG GGCAGATTGT TGAATTTGAT TTTGGTGATA TCGGCGCATC GTTATTTAGT GCGGCAGGGC CGGGTAATCG GCCTGCTGGT GTCATGCCGC AAACCAAGAG CATTGCGGTC AAATGTACGA ATGTTGCTGC GCAGGCTTAT TTAACAATGC GTCTGGAAGC CAGTGCCGTT TCTGGTCAGG CGATGGTGTC GGACAATCAG GATTTAGGTT TTATTGTCGC CGATCAGAAC GATACGCCGA TCACGCCTAA CGATCTCAAT AGCGTTATTC CTTTCCGTCT GGATGCAGCT GCGGCAGCCA ATGTCACACT TCGCGCCTGG CCTATCAGTA TTACCGGTCA AAAACCGACC GAAGGGCCGT TTAGCGCGCT GGGGTATTTA CGCGTCGATT ATCAATGA
|
Protein sequence | MKIICRLLLA MACLCLANIS WATVCANSTG VAEDEHYDLS NVFNSTNNQP GQIVVLPEKS GWVGVSAICP PGTLVNYTYR SYVTNFIVQE TIDNYKYMQL NDYLLGAMSL VDSVMDIQFP PQNYIRMGTD PNVSQNLPFG VMDSRLIFRL KVIRPFINMV EIPRQVMFTV YVTSTPYDPL VTPVYTISFG GRVEVPQNCE LNAGQIVEFD FGDIGASLFS AAGPGNRPAG VMPQTKSIAV KCTNVAAQAY LTMRLEASAV SGQAMVSDNQ DLGFIVADQN DTPITPNDLN SVIPFRLDAA AAANVTLRAW PISITGQKPT EGPFSALGYL RVDYQ
|
| |