Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A4138 |
Symbol | hutU |
ID | 5802618 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | - |
Start bp | 4428226 |
End bp | 4429932 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641341914 |
Product | urocanate hydratase |
Protein accession | YP_001608418 |
Protein GI | 162418746 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2987] Urocanate hydratase |
TIGRFAM ID | [TIGR01228] urocanate hydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.13459 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.0575164 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAGGAAA TTGCCGTGAC GACTCAAAAC AGATTCCGAG ATAATGAGAT TCGAGCCCCG CAGGGTACGC AACTGACAGC GAAAAGCTGG CTGACTGAAG CCGCGCTACG CATGCTGATG AATAACCTCG ATCCTGACGT GGCTGAGAAT CCAAAAGAAT TAGTGGTCTA CGGCGGTATT GGTCGCGCAG CCCGTAACTG GGAATGCTAT GACAAGATTG TTGAAAGCCT GATCAATTTA AACGATGACG AAACTTTGTT AATTCAATCG GGTAAGCCTG TCGGGATATT CAAAACCCAC AGTAATGCGC CCAGGGTATT GATTGCCAAC TCGAATTTGG TACCTCATTG GGCTAATTGG GAACATTTTA ATGAATTGGA CGCCAAAGGG CTGGCTATGT ATGGCCAGAT GACGGCGGGC AGTTGGATCT ATATTGGTAG CCAAGGCATT GTACAGGGTA CTTATGAAAC CTTTGTTGAA GCGGGTCGCC AGCATTTTGG CGGTAGCCTG AAAGGGCGTT GGGTCTTGAC TGCTGGGCTA GGAGGAATGG GCGGCGCGCA ACCTTTGGCC GCAACGTTAG CTGGTGCATG TTCTCTGAAC ATCGAATGCC AACAAAGCCG CATCGATTTT CGTCTCAAAA CCCGTTATGT GGATGAGCAA GCAACCGATC TGGATGATGC TTTAGCGCGC ATCGAGAAAT ATACCGCTAC AGGTGTCGCG GTTTCTATTG CACTGTGCGG CAATGCGGCT GAAATCTTAC CTGAGCTGGT GCGCCGTGGT GTTCGGCCTG ATATGGTCAC CGACCAAACC AGTGCTCATG ATCCATTGAA CGGTTATCTG CCGAAGGGTT GGAATTGGGA AGAGTACCGC CAACGCGCTC AACATGAGCC AGCGCTGGTG ATCAATGCCG CGAAAATCTC CATGGCAGAG CATGTTGAAG CGATGTTAGC CTTCCACAAC ATGGGTATCC CAACCTTTGA TTACGGCAAT AATATCCGTC AAATGGCCCA CGATATGGGG GTTATTCGTG CCTTTGATTT CCCCGGTTTT GTTCCGGCGT ATATTCGTCC TCTTTTTTGT CGTGGTATTG GCCCATTCCG TTGGGTCGCG TTGTCGGGTA ACCCAGACGA TATTTATAAA ACCGATGCTA AGGTCAAAGC ACTGATCCCT GATGATGCAC ATTTGCATCA TTGGCTAGAT ATGGCGCGTG AGCGTATTCG TTTTCAGGGG CTGCCAGCAC GTATTTGCTG GGTTGGTCTA GGCCAGCGTG CCAAATTAGG TTTGGCATTT AACGAAATGG TGCGCAGCGG CGAGCTCTCT GCGCCCGTTG TGATTGGCCG CGATCATCTG GATTCTGGAT CGGTTGCCAG CCCTAATCGT GAAACGGAAG CGATGCAGGA TGGCTCCGAT GCGGTGTCTG ACTGGCCGCT GCTCAATGCA TTACTGAATA CGGCTAGCGG TGCGACGTGG GTATCTCTGC ATCATGGTGG TGGCGTAGGG ATGGGCTTCT CGCAACATTC AGGCATGGTG GTGGTTTGTG ATGGCAGTGA TGAAGCCGCT GAACGTATTG CCAGAGTACT ACATAACGAT CCGGCTACGG GTGTGATGCG CCATGCAGAT GCGGGTTATG ACATTGCGGT TAACTGCGCG CAAGAGCAAG GACTTAACCT ACCAATGGTT GCCGCAACTC AGGGGAAAAA ATCATGA
|
Protein sequence | MKEIAVTTQN RFRDNEIRAP QGTQLTAKSW LTEAALRMLM NNLDPDVAEN PKELVVYGGI GRAARNWECY DKIVESLINL NDDETLLIQS GKPVGIFKTH SNAPRVLIAN SNLVPHWANW EHFNELDAKG LAMYGQMTAG SWIYIGSQGI VQGTYETFVE AGRQHFGGSL KGRWVLTAGL GGMGGAQPLA ATLAGACSLN IECQQSRIDF RLKTRYVDEQ ATDLDDALAR IEKYTATGVA VSIALCGNAA EILPELVRRG VRPDMVTDQT SAHDPLNGYL PKGWNWEEYR QRAQHEPALV INAAKISMAE HVEAMLAFHN MGIPTFDYGN NIRQMAHDMG VIRAFDFPGF VPAYIRPLFC RGIGPFRWVA LSGNPDDIYK TDAKVKALIP DDAHLHHWLD MARERIRFQG LPARICWVGL GQRAKLGLAF NEMVRSGELS APVVIGRDHL DSGSVASPNR ETEAMQDGSD AVSDWPLLNA LLNTASGATW VSLHHGGGVG MGFSQHSGMV VVCDGSDEAA ERIARVLHND PATGVMRHAD AGYDIAVNCA QEQGLNLPMV AATQGKKS
|
| |