Gene YpAngola_A3172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3172 
SymbolhisB 
ID5801647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3357650 
End bp3358717 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content51% 
IMG OID641341005 
Productimidazole glycerol-phosphate dehydratase/histidinol phosphatase 
Protein accessionYP_001607532 
Protein GI162420147 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0131] Imidazoleglycerol-phosphate dehydratase
[COG0241] Histidinol phosphatase and related phosphatases 
TIGRFAM ID[TIGR01261] histidinol-phosphatase
[TIGR01656] histidinol-phosphate phosphatase family domain
[TIGR01662] HAD-superfamily hydrolase, subfamily IIIA 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.894788 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGA AATTTCTTTT TATTGACCGC GACGGCACCA TCATTGCCGA GCCACCAACT 
GATTATCAGG TTGACCGGTT GGATAAACTG GCGCTGGAGC CTGATGTCAT TCCCGCATTA
CTGGCGTTGC AAAAAGCAGA CTACAAACTG GTGATGATCA CTAATCAGGA TGGCCTCGGC
ACCAGCAGTT TCCCGCAGGA AACCTTCGAT CCGCCACATA ACCTGATGAT GCAAATCCTG
ACGTCTCAGG GGATCAATTT TGAACAGATA CTGATTTGCC CACATCTGCC AGAGGATAAC
TGCACCTGTC GCAAACCGAA AACCGCGCTG GTAGAAAGCT ATCTGGCAGA TGGCGTGATG
AACAGCACCA ATAGCTATGT CATCGGTGAC CGTGAAACTG ACCTACAACT GGCCGAGAAC
ATGGGCATCA GCGGGTTACG TTATCAGCGT GATGGCTTGA ACTGGACGCA AATTGCCAAA
CAACTGACCC AGCGCGACCG CCACGCCTAT GTTAATCGCG TGACCAAAGA AACCGCCATT
GACGTTAATG TTTGGCTGGA TCGCGAAGGG GGAAGCAAAA TTAAAACCGG CGTGGGCTTC
TTCGACCATA TGCTGGATCA AATCGCCACC CACGGCGGTT TTCGCATGGA TATTCAGGTC
AGCGGCGATC TGTATATCGA TGATCACCAC ACAGTGGAAG ATACCGCGCT GGCACTGGGC
GAAGCGATCA ACATCGCACT GGGTGACAAA CGGGGTATTG GCCGCTTTGG TTTTGTATTG
CCGATGGATG AGTGCCTGGC ACGCTGTGCC TTGGATATTT CTGGTCGCCC GCATTTGGAA
TACAAAGCTG AATTTAACTA CCAGCGTGTC GGCGATCTAA GCACCGAGAT GGTCGAGCAC
TTCTTCCGCT CCCTTTCGTA TGCCATGGCC TGTACCTTGC ACCTGAAAAC CAAAGGTCGC
AACGATCATC ACCGAGTAGA AAGCCTGTTT AAAGTATTTG GTCGTACCTT GCGTCAAGCC
ATTCGCGTTG AAGGCAATAC CCTGCCAAGT TCAAAAGGAG TGCTGTAA
 
Protein sequence
MSQKFLFIDR DGTIIAEPPT DYQVDRLDKL ALEPDVIPAL LALQKADYKL VMITNQDGLG 
TSSFPQETFD PPHNLMMQIL TSQGINFEQI LICPHLPEDN CTCRKPKTAL VESYLADGVM
NSTNSYVIGD RETDLQLAEN MGISGLRYQR DGLNWTQIAK QLTQRDRHAY VNRVTKETAI
DVNVWLDREG GSKIKTGVGF FDHMLDQIAT HGGFRMDIQV SGDLYIDDHH TVEDTALALG
EAINIALGDK RGIGRFGFVL PMDECLARCA LDISGRPHLE YKAEFNYQRV GDLSTEMVEH
FFRSLSYAMA CTLHLKTKGR NDHHRVESLF KVFGRTLRQA IRVEGNTLPS SKGVL