Gene YpsIP31758_2431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_2431 
SymbolhisB 
ID5387852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp2740380 
End bp2741447 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content51% 
IMG OID640865422 
Productimidazole glycerol-phosphate dehydratase/histidinol phosphatase 
Protein accessionYP_001401400 
Protein GI153950620 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0131] Imidazoleglycerol-phosphate dehydratase
[COG0241] Histidinol phosphatase and related phosphatases 
TIGRFAM ID[TIGR01261] histidinol-phosphatase
[TIGR01656] histidinol-phosphate phosphatase family domain
[TIGR01662] HAD-superfamily hydrolase, subfamily IIIA 


Plasmid Coverage information

Num covering plasmid clones64 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGA AATTTCTTTT TATTGACCGC GACGGCACCA TCATTGCCGA GCCACCAACT 
GATTATCAGG TTGACCGGTT GGATAAACTG GCGCTGGAGC CTGATGTCAT TCCCGCATTG
CTGGCGTTGC AAAAAGCAGA CTACAAACTG GTGATGATCA CTAATCAGGA TGGCCTCGGC
ACCAGCAGTT TCCCGCAGGA AACCTTCGAT CCGCCACATA ACCTGATGAT GCAAATCCTG
ACGTCTCAGG GGATCAATTT TGAACAGATA CTGATTTGCC CACATCTGCC AGCCGATAAC
TGCACCTGTC GCAAACCGAA AACCGCGCTG GTAGAAAGCT ATCTGGCAGA CGGCGTGATG
AACAGTGCCA CTAGCTATGT CATCGGTGAC CGTGAAACTG ACCTACAACT GGCCGAGAAC
ATGGGTATCA GCGGGTTACG TTATCAGCGT GATGGCTTGA ACTGGACGCA AATTGCCAAA
CAACTGACCC AGCGCGACCG CCACGCCTAT GTTAATCGCG TGACCAAAGA AACCGCCATT
GACGTTAATG TTTGGCTGGA TCGCGAAGGG GGAAGCAAAA TTAAAACCGG CGTGGGCTTC
TTCGACCATA TGCTGGATCA AATCGCCACC CACGGCGGTT TTCGCATGGA TATTCAGGTC
AGCGGCGATC TGTATATCGA TGATCACCAC ACAGTGGAAG ATACCGCGCT GGCACTGGGC
GAAGCGATCA ACATCGCACT GGGTGACAAA CGGGGTATTG GCCGCTTTGG TTTTGTATTG
CCGATGGATG AGTGCCTGGC ACGCTGTGCC TTGGATATTT CTGGTCGCCC GCATTTGGAA
TACAAAGCTG AATTTAACTA CCAGCGTGTC GGCGATCTAA GCACCGAGAT GGTCGAGCAC
TTCTTCCGCT CCCTTTCGTA TGCCATGGCC TGTACCTTGC ACCTGAAAAC CAAAGGTCGC
AACGATCATC ACCGAGTAGA AAGCCTGTTT AAAGTATTTG GTCGTACCTT GCGTCAAGCC
ATTCGGGTTG AAGGCAATAC CCTGCCAAGT TCAAAAGGAG TGCTGTAA
 
Protein sequence
MSQKFLFIDR DGTIIAEPPT DYQVDRLDKL ALEPDVIPAL LALQKADYKL VMITNQDGLG 
TSSFPQETFD PPHNLMMQIL TSQGINFEQI LICPHLPADN CTCRKPKTAL VESYLADGVM
NSATSYVIGD RETDLQLAEN MGISGLRYQR DGLNWTQIAK QLTQRDRHAY VNRVTKETAI
DVNVWLDREG GSKIKTGVGF FDHMLDQIAT HGGFRMDIQV SGDLYIDDHH TVEDTALALG
EAINIALGDK RGIGRFGFVL PMDECLARCA LDISGRPHLE YKAEFNYQRV GDLSTEMVEH
FFRSLSYAMA CTLHLKTKGR NDHHRVESLF KVFGRTLRQA IRVEGNTLPS SKGVL