Gene EcHS_A2161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2161 
SymbolhisB 
ID5594694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2139323 
End bp2140390 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content54% 
IMG OID640921294 
Productimidazole glycerol-phosphate dehydratase/histidinol phosphatase 
Protein accessionYP_001458833 
Protein GI157161515 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0131] Imidazoleglycerol-phosphate dehydratase
[COG0241] Histidinol phosphatase and related phosphatases 
TIGRFAM ID[TIGR01261] histidinol-phosphatase
[TIGR01656] histidinol-phosphate phosphatase family domain
[TIGR01662] HAD-superfamily hydrolase, subfamily IIIA 


Plasmid Coverage information

Num covering plasmid clones67 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCAGA AGTATCTTTT TATCGATCGC GATGGAACCC TGATTAGCGA ACCGCCGAGT 
GATTTTCAGG TGGACCGTTT TGACAAACTC GCCTTTGAAC CGGGCGTGAT CCCGGAGCTG
CTGAAGCTGC AAAAAGCGGG CTACAAGCTG GTGATGATCA CTAATCAGGA TGGTCTGGGA
ACACAAAGTT TCCCGCAGGC GGATTTTGAT GGCCCGCACA ACCTGATGAT GCAGATCTTC
ACCTCGCAAG GCGTGCAGTT TGATGAAGTG CTGATTTGTC CGCACCTGCC CGCCGATGAG
TGCGACTGCC GTAAGCCGAA AGTAAAACTG GTGGAGCGTT ATCTCGCTGA GCAAGCGATG
GATCGTGCCA ATAGTTATGT GATTGGCGAT CGCGCGACCG ATATTCAGCT GGCGGAAAAC
ATGGGCATTA ATGGTTTACG CTACGACCGT GAAACCCTGA ACTGGCCGAT GATTGGCGAG
CAACTCACTA AACGAGACCG TTACGCCCAT GTAGTGCGCA ACACCAAAGA GACGCAGATT
GACGTTCAGG TGTGGCTGGA TCGCGAAGGT GGCAGCAAGA TTAACACCGG CGTTGGCTTC
TTTGATCATA TGCTGGATCA GATCGCCACT CACGGCGGTT TCCGCATGGA AATCAACGTC
AAAGGCGACC TCTATATCGA CGATCACCAC ACCGTCGAAG ATACCGGCCT GACGCTGGGC
GAAGCGTTAA AAATTGCCCT CGGCGATAAA CGCGGTATTT GTCGCTTTGG TTTTGTGCTA
CCGATGGACG AATGCCTTGC CCGCTGCGCG CTGGATATCT CTGGTCGCCC GCACCTGGAA
TATAAAGCCG AGTTTACCTA CCAGCGCGTG GGCGATCTCA GCACCGAGAT GATCGAGCAC
TTCTTCCGTT CGCTCTCTTA CACCATGGGC GTGACGCTCC ACCTGAAAAC CAAAGGTAAA
AACGATCACC ACCGTGTAGA GAGCCTGTTC AAAGCCTTTG GTCGCACCCT GCGCCAGGCC
ATCCGCGTGG AAGGCGACAC CCTGCCCTCG TCGAAAGGAG TGCTGTAA
 
Protein sequence
MSQKYLFIDR DGTLISEPPS DFQVDRFDKL AFEPGVIPEL LKLQKAGYKL VMITNQDGLG 
TQSFPQADFD GPHNLMMQIF TSQGVQFDEV LICPHLPADE CDCRKPKVKL VERYLAEQAM
DRANSYVIGD RATDIQLAEN MGINGLRYDR ETLNWPMIGE QLTKRDRYAH VVRNTKETQI
DVQVWLDREG GSKINTGVGF FDHMLDQIAT HGGFRMEINV KGDLYIDDHH TVEDTGLTLG
EALKIALGDK RGICRFGFVL PMDECLARCA LDISGRPHLE YKAEFTYQRV GDLSTEMIEH
FFRSLSYTMG VTLHLKTKGK NDHHRVESLF KAFGRTLRQA IRVEGDTLPS SKGVL