Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01910 |
Symbol | hisB |
ID | 8116335 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 1988584 |
End bp | 1989651 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 644848125 |
Product | hypothetical protein |
Protein accession | YP_002999698 |
Protein GI | 251785394 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0131] Imidazoleglycerol-phosphate dehydratase |
TIGRFAM ID | [TIGR01261] histidinol-phosphatase [TIGR01656] histidinol-phosphate phosphatase family domain [TIGR01662] HAD-superfamily hydrolase, subfamily IIIA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCAGA AGTATCTTTT TATCGATCGC GATGGAACCC TGATTAGCGA ACCGCCGAGT GATTTTCAGG TGGACCGTTT TGACAAACTC GCCTTTGAAC CGGGCGTGAT CCCGGAGCTA CTGAAGCTGC AAAAAGCGGG CTACAAGCTG GTGATGATCA CTAATCAGGA TGGTCTGGGA ACACAAAGTT TCCCGCAGGC GGATTTTGAT GGCCCGCACA ACCTGATGAT GCAGATCTTC ACCTCGCAAG GCGTGCAGTT TGATGAAGTG CTGATTTGTC CGCACCTGCC CGCCGATGAG TGCGACTGCC GTAAGCCGAA AGTAAAACTG GTGGAGCGTT ATCTCGCTGA GCAAGCGATG GATCGTGCCA ATAGTTATGT GATTGGCGAT CGCGCGACCG ATATTCAACT CGCTGAAAAC ATGGGCATTA ATGGTTTACG CTACGACCGC GAAATCCTGA ACTGGCCGAT GATTGGCGAG CAACTCACGA AACGAGACCG TTACGCCCAT GTAGTGCGCA ACACCAAAGA GACGCAAATT GACGTCCAGG TGTGGCTGGA TCGCGAAGGT GGCAGCAAGA TTAACACCGG CGTTGGCTTC TTTGATCACA TGCTGGATCA GATCGCTACC CACGGCGGTT TCCGTATGGA AATCAACGTC AAAGGCGACC TCTATATCGA CGATCACCAC ACCGTCGAAG ATACCGGTCT GGCGCTGGGC GAAGCGTTAA AAATAGCTCT TGGCGACAAA CGCGGTATTT GCCGCTTCGG TTTTGTGCTG CCGATGGACG AATGCCTTGC CCGCTGCGCG CTGGATATCT CTGGTCGCCC GCATCTGGAA TATAAAGCTG AATTTACCTA CCAGCGTGTG GGCGATCTCA GCACTGAAAT GATCGAACAC TTCTTCCGCT CACTCTCTTA CACCATGGGC GTGACGCTGC ACCTGAAAAC CAAAGGTAAA AACGATCATC ACCGTGTAGA GAGCCTGTTC AAAGCGTTTG GCCGCACCCT ACGCCAGGCC ATCCGCGTGG AAGGCGACAC CCTGCCCTCG TCGAAAGGAG TGCTGTAA
|
Protein sequence | MSQKYLFIDR DGTLISEPPS DFQVDRFDKL AFEPGVIPEL LKLQKAGYKL VMITNQDGLG TQSFPQADFD GPHNLMMQIF TSQGVQFDEV LICPHLPADE CDCRKPKVKL VERYLAEQAM DRANSYVIGD RATDIQLAEN MGINGLRYDR EILNWPMIGE QLTKRDRYAH VVRNTKETQI DVQVWLDREG GSKINTGVGF FDHMLDQIAT HGGFRMEINV KGDLYIDDHH TVEDTGLALG EALKIALGDK RGICRFGFVL PMDECLARCA LDISGRPHLE YKAEFTYQRV GDLSTEMIEH FFRSLSYTMG VTLHLKTKGK NDHHRVESLF KAFGRTLRQA IRVEGDTLPS SKGVL
|
| |