Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4163 |
Symbol | hyuA |
ID | 6966881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3852744 |
End bp | 3854141 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643387909 |
Product | phenylhydantoinase |
Protein accession | YP_002272348 |
Protein GI | 209396584 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR02033] D-hydantoinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGAGTTTG CTATGCGCGT ATTGATCAAA AACGGCACTG TCGTTAACGC AGATGGACAA GCCAAACAGG ATTTGCTGAT TGAAAGCGGG ATTGTTCGCC AGTTGGGCAA CAATATTTCG CCGCAGCTCC CGTATGAAGA AATTGATGCC ACTGGCTGTT ACGTTTTCCC TGGCGGCGTG GATGTCCATA CGCATTTCAA TATTGATGTC GGCATCGCGC GCAGTTGTGA TGATTTTTTT ACCGGTACCC GCGCAGCTGC GTGTGGCGGT ACAACAACCA TTATTGACCA TATGGGATTT GGCCCAAACG GCTGTCGGTT ACGCCATCAA CTGGAGGTTT ATCGTGGTTA TGCCGCCCAT AAAGCGGTCA TCGATTACAG CTTTCACGGT GTGATCCAGC ACATTAATCA CGCCATCCTC GACGAAATCC CGATGATGGT CGAGGAAGGG CTGAGCAGTT TTAAACTCTA TTTAACCTAT CAATACAAAC TCAACGATGA CGAGGTTTTG CAGGCATTAC GCCGTCTGCA TGAATCCGGC GCGCTGACCA CCGTGCACCC GGAAAATGAT GCGGCTATCG CCAGCAAGCG GGCGGAATTT ATCGCCGCAG GGTTAACCGC GCCGCGCTAT CATGCCTTGA GTCGCCCTCT GGAATGCGAA GCGGAAGCCA TCGCCCGCAT GATTAACCTG GCACAAATTG CCGGTAACGC CCCGCTCTAT ATCGTGCACC TGTCTAACGG CTTAGGTCTG GATTATCTGC GTCTTGCCCG TGCGAATCAC CAGCCAGTCT GGGTTGAAAC CTGCCCACAA TATCTCCTGT TGGACGAACG CAGTTACGAT ACAGAAGACG GCATGAAGTT CATTCTTAGC CCACCGCTGC GTAACGTACG CGAGCAGGAC AAACTGTGGT GTGGCATCAG CGATGGTGCG ATTGACGTGG TGGCGACCGA TCACTGCACC TTCTCGATGG CTCAACGCCT GCAAATTTCT AAAGGCGATT TCAGTCGCTG CCCAAATGGC TTACCCGGTG TGGAAAACCG CATGCAGTTA CTGTTTTCCA GTGGCGTGAT GACGGGACGT ATAACACCGG AACGCTTTGT TGAATTAACC AGCGCAATGC CCGCCAGGCT GTTTGGCCTG TGGCCGCAAA AAGGATTATT AGCGCCCGGT TCCGATGGCG ACGTGGTGAT TATCGACCCA CGTCAGAGCC AACAAATTCA GCATCGCCAT CTCCACGACA ACGCCGACTA CTCGCCATGG GAGGGTTTTA CCTGTCAGGG CGCGATTGTC AGAACCTTAT CCCGTGGTGA AACGATTTTC TGTGACGGCA CCTTTACAGG CAAAGCCGGG CGAGGTCGTT TCCTGCGACG CAAACCGTTT GTCCCTCCCG TGCTCTAA
|
Protein sequence | MEFAMRVLIK NGTVVNADGQ AKQDLLIESG IVRQLGNNIS PQLPYEEIDA TGCYVFPGGV DVHTHFNIDV GIARSCDDFF TGTRAAACGG TTTIIDHMGF GPNGCRLRHQ LEVYRGYAAH KAVIDYSFHG VIQHINHAIL DEIPMMVEEG LSSFKLYLTY QYKLNDDEVL QALRRLHESG ALTTVHPEND AAIASKRAEF IAAGLTAPRY HALSRPLECE AEAIARMINL AQIAGNAPLY IVHLSNGLGL DYLRLARANH QPVWVETCPQ YLLLDERSYD TEDGMKFILS PPLRNVREQD KLWCGISDGA IDVVATDHCT FSMAQRLQIS KGDFSRCPNG LPGVENRMQL LFSSGVMTGR ITPERFVELT SAMPARLFGL WPQKGLLAPG SDGDVVIIDP RQSQQIQHRH LHDNADYSPW EGFTCQGAIV RTLSRGETIF CDGTFTGKAG RGRFLRRKPF VPPVL
|
| |