Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4576 |
Symbol | hpaB |
ID | 5595197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 4584194 |
End bp | 4585756 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640923670 |
Product | 4-hydroxyphenylacetate 3-monooxygenase, oxygenase component |
Protein accession | YP_001461110 |
Protein GI | 157163792 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2368] Aromatic ring hydroxylase |
TIGRFAM ID | [TIGR02310] 4-hydroxyphenylacetate 3-monooxygenase, oxygenase component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 58 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCAG AAGATTTCCG CGCCAGTACC CAACGTCCGT TCACCGGGGA AGAGTATCTG AAAAGCCTGC AGGATGGTCG CGAGATCTAT ATCTATGGCG AGCGAGTGAA AGACGTCACT ACTCATCCGG CATTTCGTAA TGCGGCAGCG TCTGTTGCCC AACTGTACGA CGCGCTGCAC AAACCGGAGA TGCAGGACTC TCTGTGCTGG AACACCGACA CCGGCAGCGG CGGCTATACC CATAAATTCT TCCGCGTGGC GAAAAGTGCC GACGACCTGC GCCAGCAACG CGACGCCATC GCTGAGTGGT CACGCCTGAG CTATGGCTGG ATGGGCCGTA CCCCAGACTA CAAAGCTGCT TTCGGTTGCG CACTGGGCGC GAATCCGGGC TTTTACGGTC AGTTCGAGCA GAACGCCCGT AACTGGTACA CCCGTATTCA GGAAACTGGC CTCTACTTTA ACCACGCGAT TGTTAACCCA CCGATCGATC GTCATTTGCC GACCGATAAA GTAAAAGACG TTTACATCAA GCTGGAAAAA GAGACTGACG CCGGGATTAT CGTCAGCGGT GCGAAAGTGG TTGCCACCAA CTCGGCGCTG ACTCACTACA ACATGATTGG CTTCGGCTCG GCACAAGTGA TGGGCGAAAA CCCGGACTTC GCACTGATGT TCGTTGCGCC AATGGATGCC GATGGCGTGA AATTAATCTC CCGCGCCTCT TATGAGATGG TCGCGGGTGC TACCGGCTCG CCATACGACT ACCCGCTCTC CAGCCGCTTC GATGAGAACG ATGCGATTCT GGTGATGGAT AACGTGCTGA TCCCATGGGA AAACGTACTG ATCTACCGCG ATTTCGATCG CTGCCGTCGC TGGACAATGG AAGGCGGTTT TGCCCGTATG TATCCGCTGC AAGCCTGTGT GCGCCTGGCA GTGAAACTCG ACTTCATTAC AGCACTGCTG AAAAAATCAC TCGAATGTAC CGGCACCCTG GAGTTCCGTG GTGTACAGGC CGATCTCGGT GAAGTGGTGG CGTGGCGCAA CACCTTCTGG GCATTGAGTG ACTCGATGTG TTCTGAAGCG ACGCCGTGGG TCAACGGGGC TTATTTACCG GATCATGCCG CACTGCAAAC CTATCGCGTA CTGGCACCAA TGGCCTACGC GAAGATCAAA AACATTATCG AACGCAACGT TACCAGTGGC CTGATCTATC TCCCTTCCAG TGCCCGTGAC CTGAACAATC CGCAGATCGA CCAGTATCTG GCGAAGTATG TGCGCGGTTC GAATGGTATG GATCACGTCC AGCGCATCAA GATCCTCAAA CTGATGTGGG ACGCCATTGG CAGCGAGTTT GGTGGTCGCC ACGAACTGTA TGAAATCAAC TACTCCGGTA GCCAGGATGA GATTCGCCTA CAGTGTCTGC GCCAGGCACA AAGCTCCGGC AATATGGACA AGATGATGGC GATGGTTGAT CGCTGCCTGT CGGAATACGA CCAGAACGGC TGGACTGTGC CGCACTTGCA CAACAACGAC GATATCAACA TGCTGGATAA GCTGCTGAAA TAA
|
Protein sequence | MKPEDFRAST QRPFTGEEYL KSLQDGREIY IYGERVKDVT THPAFRNAAA SVAQLYDALH KPEMQDSLCW NTDTGSGGYT HKFFRVAKSA DDLRQQRDAI AEWSRLSYGW MGRTPDYKAA FGCALGANPG FYGQFEQNAR NWYTRIQETG LYFNHAIVNP PIDRHLPTDK VKDVYIKLEK ETDAGIIVSG AKVVATNSAL THYNMIGFGS AQVMGENPDF ALMFVAPMDA DGVKLISRAS YEMVAGATGS PYDYPLSSRF DENDAILVMD NVLIPWENVL IYRDFDRCRR WTMEGGFARM YPLQACVRLA VKLDFITALL KKSLECTGTL EFRGVQADLG EVVAWRNTFW ALSDSMCSEA TPWVNGAYLP DHAALQTYRV LAPMAYAKIK NIIERNVTSG LIYLPSSARD LNNPQIDQYL AKYVRGSNGM DHVQRIKILK LMWDAIGSEF GGRHELYEIN YSGSQDEIRL QCLRQAQSSG NMDKMMAMVD RCLSEYDQNG WTVPHLHNND DINMLDKLLK
|
| |