Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3710 |
Symbol | |
ID | 6064673 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 4060577 |
End bp | 4062139 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641603128 |
Product | 4-hydroxyphenylacetate 3-monooxygenase, oxygenase subunit |
Protein accession | YP_001726648 |
Protein GI | 170021694 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2368] Aromatic ring hydroxylase |
TIGRFAM ID | [TIGR02310] 4-hydroxyphenylacetate 3-monooxygenase, oxygenase component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.47122 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.996028 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCAG AAGATTTCCG CGCCAGTACC CAACGTCCGT TCACCGGGGA AGAGTATCTG AAAAGCCTGC AGGATGGTCG CGAGATCTAT ATCTATGGCG AGCGAGTGAA AGACGTCACC ACTCATCCGG CATTTCGTAA TGCGGCAGCG TCTGTTGCCC AACTGTACGA CGCGCTACAC AAACCGGAGA TGCAGGACTC TCTGTGCTGG AACACCGACA CCGGTAGCGG CGGCTATACC CATAAATTCT TCCGCGTGGC GAAAAGTGCC GACGACCTGC GCCAGCAACG CGACGCCATC GCTGAGTGGT CACGCCTGAG CTATGGCTGG ATGGGCCGTA CCCCAGACTA CAAAGCCGCT TTCGGTTGCG CACTGGGCGC GAATCCGGGC TTTTACGGTC AGTTCGAGCA GAACGCCCGT AACTGGTATA CCCGTATTCA GGAAACTGGC CTCTACTTTA ACCACGCGAT TGTTAACCCA CCGATCGATC GTCATTTGCC GACCGATAAA GTGAAAGACG TTTACATCAA GCTGGAAAAA GAGACTGACG CCGGGATTAT CGTCAGCGGT GCGAAAGTGG TTGCCACCAA CTCGGCGCTG ACTCACTACA ACATGATTGG CTTCGGCTCG GCACAAGTGA TGGGCGAAAA CCCGGACTTC GCACTGATGT TCGTTGCGCC AATGGATGCC GATGGCGTGA AATTAATCTC CCGCGCCTCT TATGAGATGG TCGCGGGTGC TACCGGCTCG CCGTACGACT ACCCGCTCTC CAGCCGCTTC GATGAGAACG ATGCGATTCT GGTGATGGAT AACGTGCTGA TCCCATGGGA AAACGTGCTG ATCTACCGCG ATTTCGATCG CTGCCGTCGC TGGACAATGG AAGGCGGTTT TGCCCGTATG TATCCGCTGC AAGCCTGTGT GCGCCTGGCA GTGAAATTAG ACTTCATTAC GGCACTGCTG AAAAAATCAC TCGAATGTAC CGGCACCCTG GAGTTCCGTG GTGTGCAGGC CGATCTCGGT GAAGTGGTGG CGTGGCGCAA CACCTTCTGG GCATTGAGTG ACTCGATGTG TTCAGAAGCG ACGCCGTGGG TCAACGGGGC TTATTTACCG GATCATGCCG CACTGCAAAC CTATCGCGTA CTGGCACCAA TGGCCTACGC GAAGATCAAA AACATTATCG AACGCAACGT TACCAGTGGC CTGATCTATC TCCCTTCCAG TGCCCGTGAC CTGAACAATC CGCAGATCGA CCAGTATCTG GCGAAGTATG TGCGCGGTTC GAACGGTATG GATCACGTCC AGCGCATCAA GATCCTCAAA CTGATGTGGG ACGCCATTGG CAGCGAATTT GGTGGTCGTC ACGAGCTGTA TGAAATCAAC TACTCCGGTA GCCAGGATGA GATTCGCCTG CAGTGTCTGC GCCAGGCACA AAGCTCCGGC AATATGGACA AGATGATGGC GATGGTTGAT CGCTGCCTGT CGGAATACGA CCAGAACGGC TGGACTGTGC CGCACCTGCA CAACAACGAC GATATCAACA TGCTGGATAA GCTGCTGAAA TAA
|
Protein sequence | MKPEDFRAST QRPFTGEEYL KSLQDGREIY IYGERVKDVT THPAFRNAAA SVAQLYDALH KPEMQDSLCW NTDTGSGGYT HKFFRVAKSA DDLRQQRDAI AEWSRLSYGW MGRTPDYKAA FGCALGANPG FYGQFEQNAR NWYTRIQETG LYFNHAIVNP PIDRHLPTDK VKDVYIKLEK ETDAGIIVSG AKVVATNSAL THYNMIGFGS AQVMGENPDF ALMFVAPMDA DGVKLISRAS YEMVAGATGS PYDYPLSSRF DENDAILVMD NVLIPWENVL IYRDFDRCRR WTMEGGFARM YPLQACVRLA VKLDFITALL KKSLECTGTL EFRGVQADLG EVVAWRNTFW ALSDSMCSEA TPWVNGAYLP DHAALQTYRV LAPMAYAKIK NIIERNVTSG LIYLPSSARD LNNPQIDQYL AKYVRGSNGM DHVQRIKILK LMWDAIGSEF GGRHELYEIN YSGSQDEIRL QCLRQAQSSG NMDKMMAMVD RCLSEYDQNG WTVPHLHNND DINMLDKLLK
|
| |