Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0113 |
Symbol | hofB |
ID | 6968055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 120184 |
End bp | 121569 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643384190 |
Product | hypothetical protein |
Protein accession | YP_002268713 |
Protein GI | 209396061 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00764351 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATTC CACAGCTCAC GGCCCTGTGC CTGCGTTATC AGGGAGTCTT GCTGGATGCC AGCGAAGAAG TGGTTCATGT TGCGGTGGTC GATGCCCCCT CACATGAGTT GCTGGACGCA TTGCATTTCG CTACCACCAA ACGTATTGAG ATCACCTGCT GGACGCGCCA ACAAATGGAA GGTCACGCCA GTCGCACACA ACAGACATTG CCCGTAGCTG TTCAGGAGAA GCATCAGCCC AAAGCAGAGT TGCTGACTCG AACGTTACAA TCTGCGCTGG AACAACGCGC GTCTGATATT CATATCGAAC CAGCGGACAA TGCCTACCGC ATCCGCTTGC GTATCGACGG CGTATTGCAT CCTTTACCGG ATGTTTCACC GGATGCCGGA GTCGCATTAA CCGCCAGATT AAAAGTGCTG GGAAACCTGG ATATTGCGGA ACATCGCCTG CCGCAGGACG GGCAATTCAC TGTCGAACTG GCAGGAAACG CCGTCTCATT TCGTATTGCG ACCTTACCAT GTCGGGGTGG TGAAAAGGTG GTATTAAGGT TGTTACAGCA GGTGAGCCAG GCACTGGATG TTAACACGCT GGGAATGCAG CCGTTACAAC TGGCGGGCTT TGCTCATGCC TTGCAACAAC CACAGGGACT GGTGCTGGTA ACTGGCCCTA CCGGCAGCGG CAAAACGGTC ACGCTTTATA GTGCCCTGCA AAAGCTGAAT ACCGCTGACA TTAATATTTG TAGCGTCGAA GATCCGGTTG AGATCCCCAT AGCCGGACTA AACCAGACGC AAATCCATCC GCGTGCCGGA CTCACCTTTC AGGGCGTTTT GCGTGCGTTA TTGCGCCAGG ATCCTGACGT CATCATGATC GGAGAGATCC GCGATGGCGA AACGGCAGAA ATTGCCATTA AAGCCGCGCA AACCGGTCAC CTGGTGTTGT CTACCCTACA CACTAATTCC ACCTGCGAAA CGCTGGTACG TTTACAGCAA ATGGGGGTCG CCCGCTGGAT GCTATCATCG GCGCTTACGC TGGTAATAGC CCAGCGTCTG GTACGCAAAC TTTGCCCACA TTGTCGCCGG CAGCAAGGGG AGCCCATCCA CATTCCAGTC AATGTATGGC CGTCGCCGCT GCCCCACTGG CAAGCACCCG GTTGTGTACA TTGCTACCAC GGTTTTTATG GTCGCACTGC CTTATTTGAA GTTCTGCCCA TAACACCGGT CATACGTCAG CTTATTTCCG CTAATACCGA CGTTGAATCG CTGGAAACGC ACGCCCGACA GGCGGGTATG CGAACGCTTT TTGAAAACGG CTGCCTGGCC GTGGAGCAAG GCTTAACCAC CTTTGAAGAG TTAATCCGCG TATTGGGGAT GCCGCATGGC GAGTAA
|
Protein sequence | MNIPQLTALC LRYQGVLLDA SEEVVHVAVV DAPSHELLDA LHFATTKRIE ITCWTRQQME GHASRTQQTL PVAVQEKHQP KAELLTRTLQ SALEQRASDI HIEPADNAYR IRLRIDGVLH PLPDVSPDAG VALTARLKVL GNLDIAEHRL PQDGQFTVEL AGNAVSFRIA TLPCRGGEKV VLRLLQQVSQ ALDVNTLGMQ PLQLAGFAHA LQQPQGLVLV TGPTGSGKTV TLYSALQKLN TADINICSVE DPVEIPIAGL NQTQIHPRAG LTFQGVLRAL LRQDPDVIMI GEIRDGETAE IAIKAAQTGH LVLSTLHTNS TCETLVRLQQ MGVARWMLSS ALTLVIAQRL VRKLCPHCRR QQGEPIHIPV NVWPSPLPHW QAPGCVHCYH GFYGRTALFE VLPITPVIRQ LISANTDVES LETHARQAGM RTLFENGCLA VEQGLTTFEE LIRVLGMPHG E
|
| |