Gene YpsIP31758_4085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_4085 
SymbolhutH 
ID5386549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4607354 
End bp4608886 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content51% 
IMG OID640867114 
Producthistidine ammonia-lyase 
Protein accessionYP_001403029 
Protein GI153946871 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGACAA TAACACTACG TCCTGGTCAG ATGACGCTGG CTGACTTACG GCATATTTAT 
CAACATCCCG TACATATCAC ATTGGATGAA AGTGCCTATG TACCCATTCA GCAAAGTGTG
GATTGTGTGC AAGCCATATT GGCAGAACAG CGCACGGCAT ATGGCATTAA CACTGGGTTT
GGCTTGCTGG CCTCTACCCG TATCGCCACC GAAGACTTGG AAAACTTACA GCGCTCAATC
GTACTCTCTC ACGCGGCAGG AGTCGGAGAA GCGAATGATG ATGCGATTGT GCGTCTGATT
ATGGTGCTGA AAATCAATAG CCTGGCGAGA GGTTTCTCAG GTATTCGGCT GGAGGTGATT
CAGGCGCTGA TTACCTTGGT CAATGCTGGG GTTTATCCGC ATATCCCGTT AAAAGGATCA
GTGGGCGCTT CTGGCGATTT AGCTCCGCTG GCACATATGA GCTTGCTGCT ATTAGGTGAA
GGAAAAGCCC GCTATCAGGG TGAATGGTTG CCCGCACACA CGGCACTGGC GCAAGCGGGT
TTGCAGCCCC TCACACTGGC GGCGAAAGAG GGTTTGGCAC TACTTAACGG CACCCAGGTC
TCTGCCGCTT ATGCATTGCG TGGTTTATTT GAGGCCGAAG ATCTCTATGC GGCCGCTTCG
GTGTTTGGCT GCCTGACAGT GGATGCAGCA TTAGGATCCC GTAGCCCATT TGACGCCCGT
ATTCACGCCG TTCGGGGCCA ACGTGGGCAG ATTGATGCTG CCAGCACTTA TCGTCATCTG
CTTGGTGAAC GCAGTGAAAT CTCAGAATCA CACAAGAATT GTGACAAAGT GCAGGATCCA
TATTCTTTAC GCTGTCAGCC ACAGGTGATG GGCGCATGTT TAGGCCAAAT ACGTCAGGCG
GCAGAGGTGC TGGCTATTGA ATCTAATGCC GTTTCAGATA ACCCGTTGGT GTTTGCTGAA
CAGGGTGATG TCTTGTCTGG TGGGAATTTC CATGCTGAAC CGGTCGCTAT GGCAGCAGAT
AATCTGGCGT TGGCGTTGGC AGAAATCGGT TCATTATCAG AGTGCCGTAT CTCGTTGATG
ATGGACAAGC ATATGTCTCA GTTACCTCCA TTTCTGGTAG AGAACGGTGG CGTAAATTCT
GGCTTTATGA TTGCTCAGGT TACGGCTGCG GCGTTAACCA GTGAAAATAA AGGGCTGGCA
TTCCCCGCCA GTGTCGATAG CATCCCAACA TCTGCTAATC AGGAAGATCA TGTCTCTATG
GCCCCTCGGG CGGGTAAACG CTTGTGGGAA ATGGCTGAAA ATGTACGGAA TATACTGGCT
ATCGAGTGGC TGGCTGCGTG TCAGGGGCTT GATTTGCGCA AAGGGCTAAG AACTTCCGCC
ATACTGGAGC CCGCCCGCCA ACTATTACGC CAGCACGTCA CTTACTACGA TAAAGATCGT
TTCTTTGCCC CCGATATTGA AGTTGCTAGC CAGCTTATTG CACAACGTCA TATGAATGAG
TTGATACCAG CAAAATTACT GCCAAGTCTT TAA
 
Protein sequence
MKTITLRPGQ MTLADLRHIY QHPVHITLDE SAYVPIQQSV DCVQAILAEQ RTAYGINTGF 
GLLASTRIAT EDLENLQRSI VLSHAAGVGE ANDDAIVRLI MVLKINSLAR GFSGIRLEVI
QALITLVNAG VYPHIPLKGS VGASGDLAPL AHMSLLLLGE GKARYQGEWL PAHTALAQAG
LQPLTLAAKE GLALLNGTQV SAAYALRGLF EAEDLYAAAS VFGCLTVDAA LGSRSPFDAR
IHAVRGQRGQ IDAASTYRHL LGERSEISES HKNCDKVQDP YSLRCQPQVM GACLGQIRQA
AEVLAIESNA VSDNPLVFAE QGDVLSGGNF HAEPVAMAAD NLALALAEIG SLSECRISLM
MDKHMSQLPP FLVENGGVNS GFMIAQVTAA ALTSENKGLA FPASVDSIPT SANQEDHVSM
APRAGKRLWE MAENVRNILA IEWLAACQGL DLRKGLRTSA ILEPARQLLR QHVTYYDKDR
FFAPDIEVAS QLIAQRHMNE LIPAKLLPSL