Gene EcE24377A_2313 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_2313 
SymbolhisB 
ID5586479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp2276131 
End bp2277198 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content54% 
IMG OID640925978 
Productimidazole glycerol-phosphate dehydratase/histidinol phosphatase 
Protein accessionYP_001463373 
Protein GI157155752 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0131] Imidazoleglycerol-phosphate dehydratase
[COG0241] Histidinol phosphatase and related phosphatases 
TIGRFAM ID[TIGR01261] histidinol-phosphatase
[TIGR01656] histidinol-phosphate phosphatase family domain
[TIGR01662] HAD-superfamily hydrolase, subfamily IIIA 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCAGA AGTATCTTTT TATCGATCGC GATGGAACCC TGATTAGCGA ACCGCCGAGT 
GATTTTCAGG TGGACCGTTT TGACAAACTC GCCTTTGAAC CGGGCGTGAT CCCGGAACTG
CTGAAGCTGC AAAAAGCGGG CTACAAGCTG GTGATGATCA CTAATCAGGA TGGTCTGGGA
ACACAAAGTT TCCCGCAGGC GGATTTTGAT GGCCCGCACA ACCTGATGAT GCAGATCTTC
ACCTCGCAAG GCGTGCAGTT TGATGAAGTG CTGATTTGTC CGCACCTGCC CGCCGATGAG
TGCGACTGCC GTAAGCCGAA AGTAAAACTG GTAGAGCGTT ATCTGGCTGA GCAAGCGATG
GATCGTGCCA ACAGTTATGT GATTGGCGAT CGCGCGACCG ACATTCAACT GGCGGAAAAC
ATGGGTATTA ATGGTTTACG CTACGACCGC GAAATCCTGA GCTGGCCGAT GATTGGCGAG
CAACTCACTA AACGAGACCG TTACGCCCAT GTAGTGCGCA ACACCAAAGA GACGCAAATT
GACGTCCAGG TGTGGCTGGA TCGCGAAGGT GGCAGCAAGA TTAATACCGG CGTTGGCTTC
TTTGATCACA TGCTGGATCA GATCGCCACC CACGGCGGTT TCCGTATGGA AATCAACGTC
AAAGGCGACC TCTATATCGA CGATCACCAC ACCGTCGAAG ATACCGGCCT GGCGCTGGGC
GAAGCGCTAA AAATCGCCCT TGGCGACAAA CGCGGTATTT GCCGCTTTGG TTTTGTGCTA
CCGATGGACG AATGCCTTGC CCGCTGCGCG CTGGATATCT CTGGTCGCCC GCACCTGGAA
TATAAAGCCG AGTTTACCTA CCAGCGCGTG GGCGATCTCA GCACCGAAAT GATCGAGCAC
TTCTTCCGTT CGCTCTCATA CACCATGGGC GTGACGCTAC ACCTGAAAAC CAAAGGTAAA
AACGATCACC ACCGTGTAGA GAGCCTGTTC AAAGCCTTTG GTCGCACCCT GCGCCAGGCC
ATCCGCGTGG AAGGCGACAC CCTGCCCTCG TCGAAAGGAG TGCTGTAA
 
Protein sequence
MSQKYLFIDR DGTLISEPPS DFQVDRFDKL AFEPGVIPEL LKLQKAGYKL VMITNQDGLG 
TQSFPQADFD GPHNLMMQIF TSQGVQFDEV LICPHLPADE CDCRKPKVKL VERYLAEQAM
DRANSYVIGD RATDIQLAEN MGINGLRYDR EILSWPMIGE QLTKRDRYAH VVRNTKETQI
DVQVWLDREG GSKINTGVGF FDHMLDQIAT HGGFRMEINV KGDLYIDDHH TVEDTGLALG
EALKIALGDK RGICRFGFVL PMDECLARCA LDISGRPHLE YKAEFTYQRV GDLSTEMIEH
FFRSLSYTMG VTLHLKTKGK NDHHRVESLF KAFGRTLRQA IRVEGDTLPS SKGVL