Gene ECD_01924 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_01924 
SymbolhisB 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp1989271 
End bp1990338 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content53% 
IMG OID 
Productimidazole glycerol-phosphate dehydratase/histidinol phosphatase 
Protein accessionACT43775 
Protein GI253978105 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCAGA AGTATCTTTT TATCGATCGC GATGGAACCC TGATTAGCGA ACCGCCGAGT 
GATTTTCAGG TGGACCGTTT TGACAAACTC GCCTTTGAAC CGGGCGTGAT CCCGGAGCTA
CTGAAGCTGC AAAAAGCGGG CTACAAGCTG GTGATGATCA CTAATCAGGA TGGTCTGGGA
ACACAAAGTT TCCCGCAGGC GGATTTTGAT GGCCCGCACA ACCTGATGAT GCAGATCTTC
ACCTCGCAAG GCGTGCAGTT TGATGAAGTG CTGATTTGTC CGCACCTGCC CGCCGATGAG
TGCGACTGCC GTAAGCCGAA AGTAAAACTG GTGGAGCGTT ATCTCGCTGA GCAAGCGATG
GATCGTGCCA ATAGTTATGT GATTGGCGAT CGCGCGACCG ATATTCAACT CGCTGAAAAC
ATGGGCATTA ATGGTTTACG CTACGACCGC GAAATCCTGA ACTGGCCGAT GATTGGCGAG
CAACTCACGA AACGAGACCG TTACGCCCAT GTAGTGCGCA ACACCAAAGA GACGCAAATT
GACGTCCAGG TGTGGCTGGA TCGCGAAGGT GGCAGCAAGA TTAACACCGG CGTTGGCTTC
TTTGATCACA TGCTGGATCA GATCGCTACC CACGGCGGTT TCCGTATGGA AATCAACGTC
AAAGGCGACC TCTATATCGA CGATCACCAC ACCGTCGAAG ATACCGGTCT GGCGCTGGGC
GAAGCGTTAA AAATAGCTCT TGGCGACAAA CGCGGTATTT GCCGCTTCGG TTTTGTGCTG
CCGATGGACG AATGCCTTGC CCGCTGCGCG CTGGATATCT CTGGTCGCCC GCATCTGGAA
TATAAAGCTG AATTTACCTA CCAGCGTGTG GGCGATCTCA GCACTGAAAT GATCGAACAC
TTCTTCCGCT CACTCTCTTA CACCATGGGC GTGACGCTGC ACCTGAAAAC CAAAGGTAAA
AACGATCATC ACCGTGTAGA GAGCCTGTTC AAAGCGTTTG GCCGCACCCT ACGCCAGGCC
ATCCGCGTGG AAGGCGACAC CCTGCCCTCG TCGAAAGGAG TGCTGTAA
 
Protein sequence
MSQKYLFIDR DGTLISEPPS DFQVDRFDKL AFEPGVIPEL LKLQKAGYKL VMITNQDGLG 
TQSFPQADFD GPHNLMMQIF TSQGVQFDEV LICPHLPADE CDCRKPKVKL VERYLAEQAM
DRANSYVIGD RATDIQLAEN MGINGLRYDR EILNWPMIGE QLTKRDRYAH VVRNTKETQI
DVQVWLDREG GSKINTGVGF FDHMLDQIAT HGGFRMEINV KGDLYIDDHH TVEDTGLALG
EALKIALGDK RGICRFGFVL PMDECLARCA LDISGRPHLE YKAEFTYQRV GDLSTEMIEH
FFRSLSYTMG VTLHLKTKGK NDHHRVESLF KAFGRTLRQA IRVEGDTLPS SKGVL