Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0608 |
Symbol | |
ID | 5591970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 621850 |
End bp | 622857 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640919792 |
Product | mannose binding protein FimH-like protein |
Protein accession | YP_001457375 |
Protein GI | 157160057 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3539] P pilus assembly protein, pilin FimA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 0.565914 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATAA TCTGCAGATT ATTATTGGCG ATGGCATGTT TGTGGTTAAC AAACATATCC TGGGCTACTG TTTGTGCAAA TAGTACTGGC GTAGCAGAAG ATGAACACTA TGATCTCTCA AATGTCTTTA ATAGCACCAA TAACCAGCCA GGGCAGATTG TTGTTTTACC GGAAAAATCC GGCTGGGTAG GTGTGTCAGC AATTTGTCCA CCCGGCACGC TGGTGAATTA TACATACCGT AGTTATGTCA CCAACTTTAT TGTTCAGGAA ACTATCGATA ATTATAAATA TATGCAATTA TATGATTATC TATTAGGTGC GATGAGTCTG GTTGATAGTG TTATGGATAT TCAGTTCCCC CCGCAAAATT ATATTCGGAT GGGAACAGAT CCTAACGTTT CGCAAAACCT TCCATTCGGG GTGATGGATT CTCGTTTAAT ATTTCGTTTA AAGGTTATTC GTCCCTTTAT TAACATAGTG GAGATCCCCA GACAGGTGAT GTTTACCGTG TATGTGACAT CAACGCCTTA CGATCCGTTG GTTACACCTG TTTATACCAT TAGTTTTGGT GGCCGGGTTG AAGTACCGCA AAACTGCGAA TTAAATGCCG GGCAGATTGT TGAATTTGAT TTTGGTGATA TCGGCGCATC GTTATTTAGT GCGGCAGGGC CGGGTAATCG ACCTGCTGGT GTCATGCCGC AAACCAAGAG CATTGCGGTC AAATGTACGA ATGTTGCTGC GCAGGCTTAT TTAACAATGC GTCTGGAAGC CAGTGCCGTT TCTGGTCAGG CGATGGTGTC GGACAATCAG GATTTAGGTT TTATTGTCGC CGATCAGAAC GATACGCCGA TTACGCCTAA CGATCTCAAT AGCGTTATTC CTTTCCGTCT GGATGCAGCT GCGGCAGCCA ATGTCACACT TCGCGCCTGG CCTATCAGTA TTACCGGTCA AAAACCGACC GAGGGGCCGT TTAGCGCGCT GGGGTATTTA CGCGTCGATT ATCAATGA
|
Protein sequence | MKIICRLLLA MACLWLTNIS WATVCANSTG VAEDEHYDLS NVFNSTNNQP GQIVVLPEKS GWVGVSAICP PGTLVNYTYR SYVTNFIVQE TIDNYKYMQL YDYLLGAMSL VDSVMDIQFP PQNYIRMGTD PNVSQNLPFG VMDSRLIFRL KVIRPFINIV EIPRQVMFTV YVTSTPYDPL VTPVYTISFG GRVEVPQNCE LNAGQIVEFD FGDIGASLFS AAGPGNRPAG VMPQTKSIAV KCTNVAAQAY LTMRLEASAV SGQAMVSDNQ DLGFIVADQN DTPITPNDLN SVIPFRLDAA AAANVTLRAW PISITGQKPT EGPFSALGYL RVDYQ
|
| |