Gene YpAngola_A4137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A4137 
SymbolhutH 
ID5802617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4426697 
End bp4428229 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content51% 
IMG OID641341913 
Producthistidine ammonia-lyase 
Protein accessionYP_001608417 
Protein GI162419144 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0926785 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0735815 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACAA TAACACTACG TCCTGGTCAG ATGACGCTGG CTGACTTACG GCATATTTAT 
CAACATCCCG TACATATCAC ATTGGATGAA AGTGCCTATG TACCCATTCA GCAAAGTGTG
GATTGTGTGC AAGCCATATT GGCAGAACAG CGCACGGCAT ATGGCATTAA CACTGGGTTT
GGCTTGCTGG CCTCTACCCG TATCGCCACC GAAGACTTGG AAAACTTACA GCGCTCAATC
GTACTCTCTC ACGCGGCAGG AGTCGGAGAA GCGAATGATG ATGCGATTGT GCGTCTGATT
ATGGTGCTGA AAATCAATAG CCTGGCGAGA GGTTTCTCAG GTATTCGGCT GGAGGTGATT
CAGGCGCTGA TTACCTTGGT CAATGCTGGG GTTTATCCGC ATATCCCGTT AAAAGGATCA
GTGGGCGCTT CTGGCGATTT AGCTCCGCTG GCACATATGA GCTTGCTGCT ATTAGGTGAA
GGAAAAGCCC GCTATCAGGG TGAATGGTTG CCCGCACACA CGGCACTGGC GCAAGCGGGT
TTGCAGCCCC TCACACTGGC GGCGAAAGAG GGTTTGGCAC TACTTAACGG CACCCAGGTC
TCTGCCGCTT ATGCATTGCG TGGTTTATTT GAGGCCGAAG ATCTCTATGC GGCCGCTTCG
GTGTTTGGCT GCCTGACAGT GGATGCAGCA TTAGGATCCC GTAGCCCATT TGACGCCCGT
ATTCACGCCG TTCGGGGCCA ACGTGGGCAG ATTGATGCTG CCAGCACTTA TCGTCATCTG
CTTGGTGAAC GCAGTGAAAT CTCAGAATCA CACAAGAATT GTGACAAAGT GCAGGATCCA
TATTCTTTAC GCTGTCAGCC ACAGGTGATG GGCGCATGTT TAGGCCAAAT ACGTCAGGCG
GCAGAGGTGC TGGCTATTGA ATCCAATGCC GTTTCAGATA ACCCGTTGGT GTTTGCTGAA
CAGGGTGATG TCTTGTCTGG TGGGAATTTC CATGCTGAAC CGGTCGCTAT GGCAGCAGAT
AATCTGGCGT TGGCGTTGGC AGAAGTCGGT TCATTATCAG AGTGCCGTAT CTCGTTGATG
ATGGACAAGC ATATGTCTCA GTTACCTCCA TTTCTGGTAG AGAACGGTGG CGTAAATTCT
GGTTTTATGA TTGCTCAGGT TACGGCTGCG GCGTTAACCA GTGAAAATAA AGGGCTGGCA
TTCCCCGCCA GTGTCGATAG CATCCCAACA TCTGCTAATC AGGAAGATCA TGTCTCTATG
GCCCCTCGGG CGGGTAAACG CTTGTGGGAA ATGGCTGAAA ATGTACGGAA TATACTGGCT
ATCGAGTGGC TGGCTGCGTG TCAGGGGCTT GATTTGCGCA AAGGGCTAAG AACTTCCGCC
ATACTGGAGC CCGCCCGCCA ACTATTACGC CAGCACGTCA CTTACTACGA TAAAGATCGT
TTCTTTGCCC CCGATATTGA AGTTGCTAGC CAGCTTATTG CACAACGTCA TATGAATGAG
TTGATGCCAG CAAAATTACT GCCAAGTCTT TAA
 
Protein sequence
MKTITLRPGQ MTLADLRHIY QHPVHITLDE SAYVPIQQSV DCVQAILAEQ RTAYGINTGF 
GLLASTRIAT EDLENLQRSI VLSHAAGVGE ANDDAIVRLI MVLKINSLAR GFSGIRLEVI
QALITLVNAG VYPHIPLKGS VGASGDLAPL AHMSLLLLGE GKARYQGEWL PAHTALAQAG
LQPLTLAAKE GLALLNGTQV SAAYALRGLF EAEDLYAAAS VFGCLTVDAA LGSRSPFDAR
IHAVRGQRGQ IDAASTYRHL LGERSEISES HKNCDKVQDP YSLRCQPQVM GACLGQIRQA
AEVLAIESNA VSDNPLVFAE QGDVLSGGNF HAEPVAMAAD NLALALAEVG SLSECRISLM
MDKHMSQLPP FLVENGGVNS GFMIAQVTAA ALTSENKGLA FPASVDSIPT SANQEDHVSM
APRAGKRLWE MAENVRNILA IEWLAACQGL DLRKGLRTSA ILEPARQLLR QHVTYYDKDR
FFAPDIEVAS QLIAQRHMNE LMPAKLLPSL