Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2955 |
Symbol | hisB |
ID | 6967220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 2730368 |
End bp | 2731435 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643386795 |
Product | imidazole glycerol-phosphate dehydratase/histidinol phosphatase |
Protein accession | YP_002271263 |
Protein GI | 209400725 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0131] Imidazoleglycerol-phosphate dehydratase [COG0241] Histidinol phosphatase and related phosphatases |
TIGRFAM ID | [TIGR01261] histidinol-phosphatase [TIGR01656] histidinol-phosphate phosphatase family domain [TIGR01662] HAD-superfamily hydrolase, subfamily IIIA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0000000167497 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTCAGA AGTATCTTTT TATCGATCGC GATGGAACCC TGATTAGCGA ACCGCCGAGT GATTTTCAGG TGGATCGTTT TGACAAACTC GCCTTTGAAC CGGGCGTGAT CCCGCAACTG CTGAAGCTGC AAAAAGCGGG CTACAAACTG GTGATGATCA CTAATCAGGA TGGTCTGGGA ACACAAAGTT TCCCGCAGGC GGATTTCGAT GGCCCGCACA ACCTGATGAT GCAGATATTC ACCTCGCAAG GCGTACAGTT TGATGAAGTG CTGATTTGTC CGCACCTGCC CGCCGATGAG TGCGACTGCC GTAAACCGAA AGTAAAACTG GTGGAGCGTT ATCTCGCTGA GCAAGCGATG GATCGCGCCA ACAGTTATGT GATTGGCGAT CGCGCGACCG ATATTCAACT CGCTGAAAAC ATGGGCATTA ATGGTTTACG CTACGACCGC GAAACCCTGA ACTGGCCAAT GATTGGCGAG CAACTCACCA GACGTGACCG TTACGCTCAC GTAGTGCGTA ATACCAAAGA GACGCAGATT GACGTTCAGG TGTGGCTGGA TCGTGAAGGT GGCAGCAAGA TTAATACCGG CGTTGGCTTC TTTGATCACA TGCTGGATCA GATCGCCACC CACGGCGGTT TCCGTATGGA AATCAACGTC AAAGGCGACC TCTATATCGA CGATCACCAC ACCGTCGAAG ATACCGGCCT GGCGCTGGGC GAAGCGTTAA AAATTGCCCT CGGCGATAAA CGCGGTATTT GCCGCTTTGG TTTTGTGCTG CCGATGGACG AATGCCTTGC CCGCTGCGCG CTGGATATCT CTGGTCGCCC GCACCTGGAA TATAAAGCCG AGTTTACCTA CCAACGCGTG GGCGATCTCA GCACCGAGAT GATCGAGCAC TTCTTCCGTT CGCTCTCTTA CACCATGGGC GTGACGCTCC ACCTGAAAAC CAAAGGTAAA AACGATCATC ACCGTGTAGA GAGCCTGTTC AAAGCCTTTG GTCGGACCCT GCGCCAGGCC ATCCGCGTGG AAGGCGATAC CCTGCCCTCG TCGAAAGGAG TGCTGTAA
|
Protein sequence | MSQKYLFIDR DGTLISEPPS DFQVDRFDKL AFEPGVIPQL LKLQKAGYKL VMITNQDGLG TQSFPQADFD GPHNLMMQIF TSQGVQFDEV LICPHLPADE CDCRKPKVKL VERYLAEQAM DRANSYVIGD RATDIQLAEN MGINGLRYDR ETLNWPMIGE QLTRRDRYAH VVRNTKETQI DVQVWLDREG GSKINTGVGF FDHMLDQIAT HGGFRMEINV KGDLYIDDHH TVEDTGLALG EALKIALGDK RGICRFGFVL PMDECLARCA LDISGRPHLE YKAEFTYQRV GDLSTEMIEH FFRSLSYTMG VTLHLKTKGK NDHHRVESLF KAFGRTLRQA IRVEGDTLPS SKGVL
|
| |