Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0110 |
Symbol | hofB |
ID | 6145896 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 122230 |
End bp | 123615 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641615011 |
Product | hypothetical protein |
Protein accession | YP_001742227 |
Protein GI | 170681105 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.157109 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.596737 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATTC CACAGCTCAC CGCCCTGTGT CTGCGTTATC AGGGAGTCTT GCTGGATGCC AGCGAAGAGG TGGTTCATGT TGCGGTAGTC GATGCACCTT CGCATGAGCT ACTGGACGCA TTGCATTTCG CTACCACCAA ACGTATTGAG ATCACCTGCT GGACGCGCCA ACAAATGGAA GGTCACGCCA GTCGCACACA ACAGACATTG CCCGTAGCTG TTCAGGAGAA GCATCAGCCC AAAGCAGAGT TGCTGACTAG AACGTTACAG TCTGCGCTGG AACAACGCGC GTCTGATATT CATATCGAAC CAGCGGACAA TGCCTACCGC ATCCGCTTGC GTATCGACGG CGTATTGCAT CCTTTACCGG TCGTTTCACC GGATGCCGGA GTCGCATTAA CCGCCAGATT AAAAGTGCTG GGAAATCTGG ATATTGCGGA ACATCGCCTG CCGCAGGACG GGCAATTCAC TGTCGAACTG GCAGGAAACG CCGTCTCATT TCGTATTGCG ACCTTACCAT GTCGGGGTGG TGAAAAGGTG GTATTAAGGT TGTTACAGCA GGTGAGCCAG GCACTGGATG TTAACACGCT GGGAATGCAG CCGTTACAAC TGGCGGACTT TGCTCATGCA TTGCAACAAC CACAGGGATT GGTGCTGGTA ACTGGCCCTA CAGGCAGCGG CAAAACGGTC ACGCTTTATA GTGCCCTGCA AACGCTGAAT ACCGCTGACA TTAATATTTG TAGCGTCGAA GATCCGGTTG AGATCCCCAT AGCCGGACTA AACCAGACGC AAATCCATTC GCGTGCCGGA CTCACCTTTC AGGGCGTTTT GCGTGCGTTA TTGCGCCAGG ATCCTGACGT CATCATGATC GGAGAGATCC GCGATGGCGA AACGGCAGAA ATTGCCATTA AAGCCGCGCA AACCGGTCAC CTGGTGTTGT CTACCCTACA CACTAATTCC ACCTGCGAAA CGCTGGTACG TTTACAGCAA ATGGGAGTCG CCCGCTGGAT GCTATCATCG GCGCTTACGC TGGTAATAGC CCAGCGTCTG GTACGCAAAC TTTGCCCGCA TTGTCGCCGG CAGCAAGGGG AGCCCATCCA TATTCCAGAC AATGTATGGC CGTCGCCGCT GCCCCACTGG CAGGCACCCG GTTGTGTACA TTGCTACCAC GGTTTTTATG GTCGTACGGC CTTATTTGAA GTTCTACCCA TAACACCGAT CATACGCCAG CTTATTTCCG CTAATACCGA CGTTGAATCG CTGGAAACAC ACGCCCGACA GGCGGGTATG CGTACGCTTT TTGAAAACGG CTGCCTGGCC GTGGAGCAAG GCTTAACCAC CTTTGAAGAG TTAATCCGCG TATTGGGGAT GCCGCATGGC GAGTAA
|
Protein sequence | MNIPQLTALC LRYQGVLLDA SEEVVHVAVV DAPSHELLDA LHFATTKRIE ITCWTRQQME GHASRTQQTL PVAVQEKHQP KAELLTRTLQ SALEQRASDI HIEPADNAYR IRLRIDGVLH PLPVVSPDAG VALTARLKVL GNLDIAEHRL PQDGQFTVEL AGNAVSFRIA TLPCRGGEKV VLRLLQQVSQ ALDVNTLGMQ PLQLADFAHA LQQPQGLVLV TGPTGSGKTV TLYSALQTLN TADINICSVE DPVEIPIAGL NQTQIHSRAG LTFQGVLRAL LRQDPDVIMI GEIRDGETAE IAIKAAQTGH LVLSTLHTNS TCETLVRLQQ MGVARWMLSS ALTLVIAQRL VRKLCPHCRR QQGEPIHIPD NVWPSPLPHW QAPGCVHCYH GFYGRTALFE VLPITPIIRQ LISANTDVES LETHARQAGM RTLFENGCLA VEQGLTTFEE LIRVLGMPHG E
|
| |