Gene ECH74115_2955 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2955 
SymbolhisB 
ID6967220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2730368 
End bp2731435 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content53% 
IMG OID643386795 
Productimidazole glycerol-phosphate dehydratase/histidinol phosphatase 
Protein accessionYP_002271263 
Protein GI209400725 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0131] Imidazoleglycerol-phosphate dehydratase
[COG0241] Histidinol phosphatase and related phosphatases 
TIGRFAM ID[TIGR01261] histidinol-phosphatase
[TIGR01656] histidinol-phosphate phosphatase family domain
[TIGR01662] HAD-superfamily hydrolase, subfamily IIIA 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0000000167497 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTCAGA AGTATCTTTT TATCGATCGC GATGGAACCC TGATTAGCGA ACCGCCGAGT 
GATTTTCAGG TGGATCGTTT TGACAAACTC GCCTTTGAAC CGGGCGTGAT CCCGCAACTG
CTGAAGCTGC AAAAAGCGGG CTACAAACTG GTGATGATCA CTAATCAGGA TGGTCTGGGA
ACACAAAGTT TCCCGCAGGC GGATTTCGAT GGCCCGCACA ACCTGATGAT GCAGATATTC
ACCTCGCAAG GCGTACAGTT TGATGAAGTG CTGATTTGTC CGCACCTGCC CGCCGATGAG
TGCGACTGCC GTAAACCGAA AGTAAAACTG GTGGAGCGTT ATCTCGCTGA GCAAGCGATG
GATCGCGCCA ACAGTTATGT GATTGGCGAT CGCGCGACCG ATATTCAACT CGCTGAAAAC
ATGGGCATTA ATGGTTTACG CTACGACCGC GAAACCCTGA ACTGGCCAAT GATTGGCGAG
CAACTCACCA GACGTGACCG TTACGCTCAC GTAGTGCGTA ATACCAAAGA GACGCAGATT
GACGTTCAGG TGTGGCTGGA TCGTGAAGGT GGCAGCAAGA TTAATACCGG CGTTGGCTTC
TTTGATCACA TGCTGGATCA GATCGCCACC CACGGCGGTT TCCGTATGGA AATCAACGTC
AAAGGCGACC TCTATATCGA CGATCACCAC ACCGTCGAAG ATACCGGCCT GGCGCTGGGC
GAAGCGTTAA AAATTGCCCT CGGCGATAAA CGCGGTATTT GCCGCTTTGG TTTTGTGCTG
CCGATGGACG AATGCCTTGC CCGCTGCGCG CTGGATATCT CTGGTCGCCC GCACCTGGAA
TATAAAGCCG AGTTTACCTA CCAACGCGTG GGCGATCTCA GCACCGAGAT GATCGAGCAC
TTCTTCCGTT CGCTCTCTTA CACCATGGGC GTGACGCTCC ACCTGAAAAC CAAAGGTAAA
AACGATCATC ACCGTGTAGA GAGCCTGTTC AAAGCCTTTG GTCGGACCCT GCGCCAGGCC
ATCCGCGTGG AAGGCGATAC CCTGCCCTCG TCGAAAGGAG TGCTGTAA
 
Protein sequence
MSQKYLFIDR DGTLISEPPS DFQVDRFDKL AFEPGVIPQL LKLQKAGYKL VMITNQDGLG 
TQSFPQADFD GPHNLMMQIF TSQGVQFDEV LICPHLPADE CDCRKPKVKL VERYLAEQAM
DRANSYVIGD RATDIQLAEN MGINGLRYDR ETLNWPMIGE QLTRRDRYAH VVRNTKETQI
DVQVWLDREG GSKINTGVGF FDHMLDQIAT HGGFRMEINV KGDLYIDDHH TVEDTGLALG
EALKIALGDK RGICRFGFVL PMDECLARCA LDISGRPHLE YKAEFTYQRV GDLSTEMIEH
FFRSLSYTMG VTLHLKTKGK NDHHRVESLF KAFGRTLRQA IRVEGDTLPS SKGVL