Gene B21_01910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01910 
SymbolhisB 
ID8116335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp1988584 
End bp1989651 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content53% 
IMG OID644848125 
Producthypothetical protein 
Protein accessionYP_002999698 
Protein GI251785394 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0131] Imidazoleglycerol-phosphate dehydratase 
TIGRFAM ID[TIGR01261] histidinol-phosphatase
[TIGR01656] histidinol-phosphate phosphatase family domain
[TIGR01662] HAD-superfamily hydrolase, subfamily IIIA 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCAGA AGTATCTTTT TATCGATCGC GATGGAACCC TGATTAGCGA ACCGCCGAGT 
GATTTTCAGG TGGACCGTTT TGACAAACTC GCCTTTGAAC CGGGCGTGAT CCCGGAGCTA
CTGAAGCTGC AAAAAGCGGG CTACAAGCTG GTGATGATCA CTAATCAGGA TGGTCTGGGA
ACACAAAGTT TCCCGCAGGC GGATTTTGAT GGCCCGCACA ACCTGATGAT GCAGATCTTC
ACCTCGCAAG GCGTGCAGTT TGATGAAGTG CTGATTTGTC CGCACCTGCC CGCCGATGAG
TGCGACTGCC GTAAGCCGAA AGTAAAACTG GTGGAGCGTT ATCTCGCTGA GCAAGCGATG
GATCGTGCCA ATAGTTATGT GATTGGCGAT CGCGCGACCG ATATTCAACT CGCTGAAAAC
ATGGGCATTA ATGGTTTACG CTACGACCGC GAAATCCTGA ACTGGCCGAT GATTGGCGAG
CAACTCACGA AACGAGACCG TTACGCCCAT GTAGTGCGCA ACACCAAAGA GACGCAAATT
GACGTCCAGG TGTGGCTGGA TCGCGAAGGT GGCAGCAAGA TTAACACCGG CGTTGGCTTC
TTTGATCACA TGCTGGATCA GATCGCTACC CACGGCGGTT TCCGTATGGA AATCAACGTC
AAAGGCGACC TCTATATCGA CGATCACCAC ACCGTCGAAG ATACCGGTCT GGCGCTGGGC
GAAGCGTTAA AAATAGCTCT TGGCGACAAA CGCGGTATTT GCCGCTTCGG TTTTGTGCTG
CCGATGGACG AATGCCTTGC CCGCTGCGCG CTGGATATCT CTGGTCGCCC GCATCTGGAA
TATAAAGCTG AATTTACCTA CCAGCGTGTG GGCGATCTCA GCACTGAAAT GATCGAACAC
TTCTTCCGCT CACTCTCTTA CACCATGGGC GTGACGCTGC ACCTGAAAAC CAAAGGTAAA
AACGATCATC ACCGTGTAGA GAGCCTGTTC AAAGCGTTTG GCCGCACCCT ACGCCAGGCC
ATCCGCGTGG AAGGCGACAC CCTGCCCTCG TCGAAAGGAG TGCTGTAA
 
Protein sequence
MSQKYLFIDR DGTLISEPPS DFQVDRFDKL AFEPGVIPEL LKLQKAGYKL VMITNQDGLG 
TQSFPQADFD GPHNLMMQIF TSQGVQFDEV LICPHLPADE CDCRKPKVKL VERYLAEQAM
DRANSYVIGD RATDIQLAEN MGINGLRYDR EILNWPMIGE QLTKRDRYAH VVRNTKETQI
DVQVWLDREG GSKINTGVGF FDHMLDQIAT HGGFRMEINV KGDLYIDDHH TVEDTGLALG
EALKIALGDK RGICRFGFVL PMDECLARCA LDISGRPHLE YKAEFTYQRV GDLSTEMIEH
FFRSLSYTMG VTLHLKTKGK NDHHRVESLF KAFGRTLRQA IRVEGDTLPS SKGVL