Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3006 |
Symbol | hyuA |
ID | 6143431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3088997 |
End bp | 3090394 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617875 |
Product | phenylhydantoinase |
Protein accession | YP_001745026 |
Protein GI | 170682443 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR02033] D-hydantoinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGAGTTTG CTATGCGCGT ATTGATCAAA AACGGCACTG TCGTTAACGC AGATGGACAA GCCAAACAGG ATTTGCTGAT TGAAAGCGGG ATTGTTCGCC AGTTGGGCAA CAATATTTCG CCACAGCTCC CGTATGAAGA AATTGATGCC ACTGGCTGTT ACGTTTTCCC TGGCGGCGTG GATGTCCATA CGCATTTCAA TATTGATGTC GGCATCGCGC GCAGTTGTGA TGATTTTTTT ACCGGTACCC GCGCAGCTGC GTGTGGCGGT ACAACAACCA TTATTGACCA TATGGGATTT GGCCCGAATG GCTGCCGGTT ACGCCATCAA CTGGAGGTTT ATCGTGGTTA TGCCGCCCAT AAGGCGGTCA TCGACTACAG CTTTCACGGT GTGATCCAGC ACATTAATCA CGCCATCCTC GACGAAATTC CGATGATGGT CGAAGAAGGA CTGAGCAGTT TTAAACTCTA TTTAACCTAT CAATACAAAC TCAACGATGA CGAAGTTTTG CAGGCGTTAC GCCGCTTGCA TGAGTCCGGC GCGCTGACCA CCGTGCACCC GGAAAATGAT GCAGCTATCG CCAGCAAGCG GGCGGAGTTT ATTGCCGCAG GGTTAACCGC GCCGCGCTAT CACGCCTTGA GTCGCCCTCT GGAATGCGAA GCGGAAGCCA TCGCCCGCAT GATTAACCTG GCACAAATTG CCGGTAACGC CCCGCTCTAT ATCGTGCACC TGTCTAACGG CTTAGGTCTG GATTATCTGC GTCTTGCCCG TGCGAATCAC CAGCCAGTCT GGGTTGAAAC CTGCCCACAA TATCTCCTGT TGGACGAACG CAGTTACGAT ACAGAAGATG GCATGAAGTT CATTCTTAGC CCACCGCTGC GTAACGTACG CGAGCAGGAC AAACTGTGGT GTGGCATCAG CGATGGTGCG ATTGACGTGG TAGCGACCGA TCACTGCACA TTCTCGATGG CTCAACGCCT GCAAATTTCT AAAGGCGATT TCAGTCGCTG CCCAAATGGC TTACCCGGTG TGGAAAACCG CATGCAGTTA CTGTTTTCCA GTGGCGTTAT GACGGGACGT ATAACACCGG AACGTTTTGT TGAATTAACC AGCGCGATGC CCGCCAGGCT GTTTGGCCTG TGGCCGCAAA AAGGATTATT AGCGCCCGGT TCCGATGGCG ACGTGGTGAT TATCGACCCA CGTCAGAGTC AACAAATTCA GCATCGCCAT CTCCACGACA ACGCCGACTA CTCGCCATGG GAGGGTTTTA CCTGTCAGGG CGCGATTGTC AGAACTTTAT CCCGTGGTGA AACGATTTTC TGTGACGGCA CTTTTACAGG CAAAGCCGGG CGAGGTCGTT TCCTGCGACG CAAACCGTTT GTCCCTCCCG TGCTCTAA
|
Protein sequence | MEFAMRVLIK NGTVVNADGQ AKQDLLIESG IVRQLGNNIS PQLPYEEIDA TGCYVFPGGV DVHTHFNIDV GIARSCDDFF TGTRAAACGG TTTIIDHMGF GPNGCRLRHQ LEVYRGYAAH KAVIDYSFHG VIQHINHAIL DEIPMMVEEG LSSFKLYLTY QYKLNDDEVL QALRRLHESG ALTTVHPEND AAIASKRAEF IAAGLTAPRY HALSRPLECE AEAIARMINL AQIAGNAPLY IVHLSNGLGL DYLRLARANH QPVWVETCPQ YLLLDERSYD TEDGMKFILS PPLRNVREQD KLWCGISDGA IDVVATDHCT FSMAQRLQIS KGDFSRCPNG LPGVENRMQL LFSSGVMTGR ITPERFVELT SAMPARLFGL WPQKGLLAPG SDGDVVIIDP RQSQQIQHRH LHDNADYSPW EGFTCQGAIV RTLSRGETIF CDGTFTGKAG RGRFLRRKPF VPPVL
|
| |