Gene SbBS512_E1210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1210 
SymbolhisB 
ID6270862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1116938 
End bp1118005 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content54% 
IMG OID641725341 
Productimidazole glycerol-phosphate dehydratase/histidinol phosphatase 
Protein accessionYP_001879855 
Protein GI187733376 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0131] Imidazoleglycerol-phosphate dehydratase
[COG0241] Histidinol phosphatase and related phosphatases 
TIGRFAM ID[TIGR01261] histidinol-phosphatase
[TIGR01656] histidinol-phosphate phosphatase family domain
[TIGR01662] HAD-superfamily hydrolase, subfamily IIIA 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCAGA AGTATCTTTT TATCGATCGC GATGGAACCC TGATTAGCGA ACCGCCGAGT 
GATTTTCAGG TGGACCGTTT TGACAAACTC GCCTTTGAAC CAGGCGTGAT CCCGGAACTG
CTGAAGCTGC AAAAAGCGGG CTATAAACTG GTGATGATCA CCAACCAGGA TGGTCTGGGG
ACGCAAAGTT TCCCGCAGGC GAATTTCGAT GGCCCGCACA ACCTGATGAT GCAGATCTTC
ACCTCGCAAG GCGTGCAGTT TGATGAAGTG CTGATTTGCC CGCACCTGCC CGCCGATGAA
TGCGACTGCC GTAAGCCGAA AGTAAAACTG GTGGAGCGTT ATCTCGCTGA GCAAGCGATG
GATCGCGCCA ACAGTTATGT GATTGGCGAT CGCGCGACCG ACATTCAACT GGCGGAAAAC
ATGGGCATTA CTGGTTTACG CTACGACCGC GAAACCCTGA ACTGGCCAAT GATTGGCGAG
CAACTCACCA GACGTGACCG TTACGCTCAC GTAGTGCGTA ATACCAAAGA GACGCAGATT
GACGTTCAGG TGTGGCTGGA TCGTGAAGGT GGCAGCAAGA TTAACACCGG CGTTGGCTTC
TTTGATCATA TGCTGGATCA GATCGCTACC CACGGCGGTT TCCGCATGGA AATCAACGTC
AAAGGCGACC TCTATATCGA CGATCACCAC ACCGTCGAAG ATACCGGCCT GGCGCTGGGC
GAAGCGTTAA AAATTGCCCT CGGCGATAAA CGCGGTATTT GTCGCTTTGG TTTTGTGCTA
CCGATGGACG AATGCCTTGC CCGCTGCGCG CTGGATATCT CTGGTCGCCC GCACCTGGAA
TATAAAGCCG AGTTTACCTA CCAGCGCGTG GGCGATCTCA GCACCGAGAT GATCGAGCAC
TTCTTCCGTT CGCTCTCTTA CACCATGGGC GTGACGCTAC ACCTGAAAAC CAAAGGTAAA
AACGATCACC ACCGTGTAGA GAGCCTGTTC AAAGCCTTTG GTCGCACCCT GCGCCAGGCC
ATCCGCGTGG AAGGCGACAC CCTGCCCTCG TCGAAAGGAG TGCTGTAA
 
Protein sequence
MSQKYLFIDR DGTLISEPPS DFQVDRFDKL AFEPGVIPEL LKLQKAGYKL VMITNQDGLG 
TQSFPQANFD GPHNLMMQIF TSQGVQFDEV LICPHLPADE CDCRKPKVKL VERYLAEQAM
DRANSYVIGD RATDIQLAEN MGITGLRYDR ETLNWPMIGE QLTRRDRYAH VVRNTKETQI
DVQVWLDREG GSKINTGVGF FDHMLDQIAT HGGFRMEINV KGDLYIDDHH TVEDTGLALG
EALKIALGDK RGICRFGFVL PMDECLARCA LDISGRPHLE YKAEFTYQRV GDLSTEMIEH
FFRSLSYTMG VTLHLKTKGK NDHHRVESLF KAFGRTLRQA IRVEGDTLPS SKGVL