Gene YpAngola_A4138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A4138 
SymbolhutU 
ID5802618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4428226 
End bp4429932 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content51% 
IMG OID641341914 
Producturocanate hydratase 
Protein accessionYP_001608418 
Protein GI162418746 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.13459 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0575164 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGGAAA TTGCCGTGAC GACTCAAAAC AGATTCCGAG ATAATGAGAT TCGAGCCCCG 
CAGGGTACGC AACTGACAGC GAAAAGCTGG CTGACTGAAG CCGCGCTACG CATGCTGATG
AATAACCTCG ATCCTGACGT GGCTGAGAAT CCAAAAGAAT TAGTGGTCTA CGGCGGTATT
GGTCGCGCAG CCCGTAACTG GGAATGCTAT GACAAGATTG TTGAAAGCCT GATCAATTTA
AACGATGACG AAACTTTGTT AATTCAATCG GGTAAGCCTG TCGGGATATT CAAAACCCAC
AGTAATGCGC CCAGGGTATT GATTGCCAAC TCGAATTTGG TACCTCATTG GGCTAATTGG
GAACATTTTA ATGAATTGGA CGCCAAAGGG CTGGCTATGT ATGGCCAGAT GACGGCGGGC
AGTTGGATCT ATATTGGTAG CCAAGGCATT GTACAGGGTA CTTATGAAAC CTTTGTTGAA
GCGGGTCGCC AGCATTTTGG CGGTAGCCTG AAAGGGCGTT GGGTCTTGAC TGCTGGGCTA
GGAGGAATGG GCGGCGCGCA ACCTTTGGCC GCAACGTTAG CTGGTGCATG TTCTCTGAAC
ATCGAATGCC AACAAAGCCG CATCGATTTT CGTCTCAAAA CCCGTTATGT GGATGAGCAA
GCAACCGATC TGGATGATGC TTTAGCGCGC ATCGAGAAAT ATACCGCTAC AGGTGTCGCG
GTTTCTATTG CACTGTGCGG CAATGCGGCT GAAATCTTAC CTGAGCTGGT GCGCCGTGGT
GTTCGGCCTG ATATGGTCAC CGACCAAACC AGTGCTCATG ATCCATTGAA CGGTTATCTG
CCGAAGGGTT GGAATTGGGA AGAGTACCGC CAACGCGCTC AACATGAGCC AGCGCTGGTG
ATCAATGCCG CGAAAATCTC CATGGCAGAG CATGTTGAAG CGATGTTAGC CTTCCACAAC
ATGGGTATCC CAACCTTTGA TTACGGCAAT AATATCCGTC AAATGGCCCA CGATATGGGG
GTTATTCGTG CCTTTGATTT CCCCGGTTTT GTTCCGGCGT ATATTCGTCC TCTTTTTTGT
CGTGGTATTG GCCCATTCCG TTGGGTCGCG TTGTCGGGTA ACCCAGACGA TATTTATAAA
ACCGATGCTA AGGTCAAAGC ACTGATCCCT GATGATGCAC ATTTGCATCA TTGGCTAGAT
ATGGCGCGTG AGCGTATTCG TTTTCAGGGG CTGCCAGCAC GTATTTGCTG GGTTGGTCTA
GGCCAGCGTG CCAAATTAGG TTTGGCATTT AACGAAATGG TGCGCAGCGG CGAGCTCTCT
GCGCCCGTTG TGATTGGCCG CGATCATCTG GATTCTGGAT CGGTTGCCAG CCCTAATCGT
GAAACGGAAG CGATGCAGGA TGGCTCCGAT GCGGTGTCTG ACTGGCCGCT GCTCAATGCA
TTACTGAATA CGGCTAGCGG TGCGACGTGG GTATCTCTGC ATCATGGTGG TGGCGTAGGG
ATGGGCTTCT CGCAACATTC AGGCATGGTG GTGGTTTGTG ATGGCAGTGA TGAAGCCGCT
GAACGTATTG CCAGAGTACT ACATAACGAT CCGGCTACGG GTGTGATGCG CCATGCAGAT
GCGGGTTATG ACATTGCGGT TAACTGCGCG CAAGAGCAAG GACTTAACCT ACCAATGGTT
GCCGCAACTC AGGGGAAAAA ATCATGA
 
Protein sequence
MKEIAVTTQN RFRDNEIRAP QGTQLTAKSW LTEAALRMLM NNLDPDVAEN PKELVVYGGI 
GRAARNWECY DKIVESLINL NDDETLLIQS GKPVGIFKTH SNAPRVLIAN SNLVPHWANW
EHFNELDAKG LAMYGQMTAG SWIYIGSQGI VQGTYETFVE AGRQHFGGSL KGRWVLTAGL
GGMGGAQPLA ATLAGACSLN IECQQSRIDF RLKTRYVDEQ ATDLDDALAR IEKYTATGVA
VSIALCGNAA EILPELVRRG VRPDMVTDQT SAHDPLNGYL PKGWNWEEYR QRAQHEPALV
INAAKISMAE HVEAMLAFHN MGIPTFDYGN NIRQMAHDMG VIRAFDFPGF VPAYIRPLFC
RGIGPFRWVA LSGNPDDIYK TDAKVKALIP DDAHLHHWLD MARERIRFQG LPARICWVGL
GQRAKLGLAF NEMVRSGELS APVVIGRDHL DSGSVASPNR ETEAMQDGSD AVSDWPLLNA
LLNTASGATW VSLHHGGGVG MGFSQHSGMV VVCDGSDEAA ERIARVLHND PATGVMRHAD
AGYDIAVNCA QEQGLNLPMV AATQGKKS