Gene EcHS_A0111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0111 
SymbolhofB 
ID5593055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp117353 
End bp118738 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content54% 
IMG OID640919299 
Producthypothetical protein 
Protein accessionYP_001456894 
Protein GI157159576 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0000253922 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATTC CACAGCTCAC TGCCCTGTGT CTGCGTTATC ATGGAGTCTT GCTGGATGCC 
AGCGAAGAGG TGGTTCATGT TGCGGTAGTC GATGCACCTT CGCATGAGCT ACTGGACGCA
TTGCATTTCG CTACCACCAA ACGTATTGAG ATCACCTGCT GGACGCGCCA ACAAATGGAA
GGTCACGCCA GTCGCACACA ACAGACATTG CCCGTAGCTG TTCAGGAGAA GCATCAGCCC
AAAGCAGAGT TGCTAACTCG AACGTTACAA TCTGCGCTGG AACAACGCGC GTCTGATATT
CATATCGAAC CAGCGGACAA TGCCTACCGC ATCCGCTTGC GTATCGACGG CGTATTGCAT
CCTTTACCGG ATGTTTCACC GGATGCCGGA GTCGCATTAA CCGCCAGATT AAAAGTGCTG
GGAAACCTGG ATATTGCGGA ACATCGCCTG CCGCAGGACG GGCAATTCAC TGTCGAACTG
GCAGGAAACG CCGTCTCATT TCGTATTGCG ACCTTACCAT GTCGGGGTGG TGAAAAGGTG
GTATTAAGGT TGTTACAGCA GGTGAGCCAG GCACTGGATG TCAACACGCT TGGAATGCAG
CCGTTACAAC TGGCGGACTT TGCTCATGCC TTGCAACAAC CACAGGGACT GGTGCTGGTA
ACTGGCCCTA CAGGCAGCGG CAAAACGGTC ACGCTTTATA GTGCCCTGCA AACGCTGAAT
ACCGCTGACA TTAATATTTG TAGCGTCGAA GATCCGGTTG AGATCCCCAT AGCCGGACTA
AACCAGACGC AAATCCATCC GCGTGCCGGA CTCACCTTTC AGGGCGTGTT GCGTGCGTTA
TTGCGCCAGG ATCCTGACGT CATCATGATC GGAGAGATCC GCGATGGCGA AACAGCAGAG
ATCGCTATTA AAGCGGCGCA AACTGGTCAC CTGGTGTTGT CTACCCTACA CACTAATTCC
ACCTGCGAAA CGCTGGTACG TTTACAGCAA ATGGGGGTCG CCCGCTGGAT GCTATCATCG
GCGCTTACGC TGGTAATAGC CCAGCGTCTG GTACGCAAAC TTTGCCCACA TTGTCGCCGG
CAGCAAGGGG AGCCCATCCA CATTCCAGAC AATGTATGGC CATCGCCGCT GCCCCACTGG
CAGGCACCCG GTTGTGTACA TTGCTACCAC GGTTTTTATG GTCGTACGGC CTTATTTGAA
GTTCTGCCCA TAACGCCGGT CATTCGTCAG CTTATTTCCG CTAATACCGA CGTTGAATCG
CTGGAAACGC ACGCACGACA GGCGGGTATG CGTACGCTTT TTGAAAACGG CTGCCTGGCC
GTGGAGCAAG GCTTAACCAC CTTTGAAGAG TTAATCCGCG TACTGGGGAT GCCGCATGGC
GAGTAA
 
Protein sequence
MNIPQLTALC LRYHGVLLDA SEEVVHVAVV DAPSHELLDA LHFATTKRIE ITCWTRQQME 
GHASRTQQTL PVAVQEKHQP KAELLTRTLQ SALEQRASDI HIEPADNAYR IRLRIDGVLH
PLPDVSPDAG VALTARLKVL GNLDIAEHRL PQDGQFTVEL AGNAVSFRIA TLPCRGGEKV
VLRLLQQVSQ ALDVNTLGMQ PLQLADFAHA LQQPQGLVLV TGPTGSGKTV TLYSALQTLN
TADINICSVE DPVEIPIAGL NQTQIHPRAG LTFQGVLRAL LRQDPDVIMI GEIRDGETAE
IAIKAAQTGH LVLSTLHTNS TCETLVRLQQ MGVARWMLSS ALTLVIAQRL VRKLCPHCRR
QQGEPIHIPD NVWPSPLPHW QAPGCVHCYH GFYGRTALFE VLPITPVIRQ LISANTDVES
LETHARQAGM RTLFENGCLA VEQGLTTFEE LIRVLGMPHG E