Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3702 |
Symbol | |
ID | 6066149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 4052210 |
End bp | 4053499 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641603120 |
Product | 4-hydroxyphenylacetate degradation bifunctional isomerase/decarboxylase, HpaG1 subunit |
Protein accession | YP_001726640 |
Protein GI | 170021686 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | [TIGR02303] 4-hydroxyphenylacetate degradation bifunctional isomerase/decarboxylase, C-terminal subunit [TIGR02305] 4-hydroxyphenylacetate degradation bifunctional isomerase/decarboxylase, N-terminal subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.188189 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGGCA CTATCTTCGC CGTAGCGTTG AACCATCGCA GCCAACTTGA TGCATGGCAG GAAGCGTTCC AGCAATCCCC CTACAAAGCC CCGCCTAAAA CTGCGGTCTG GTTTATTAAA CCGCGCAATA CGGTGATTGG TTGCGGTGAA CCGATTCCCT TTCCACAGGG TGAAAAGGTA CTGAGCGGTG CGACTGTTGC GCTGATTGTG GGAAAAACGG CGACGAAAGT ACGTGAAGAA GATGCGGCAG AGTACATCGC CGGATATGCG CTGGCTAACG ACGTCAGCCT GCCGGAAGAG AGCTTTTACC GCCCGGCAAT CAAAGCAAAA TGTCGTGATG GATTCTGCCC CATTGGCGAA ACCGTGGCTC TCAGCAATGT CGATAATCTG ACCATCTATA CCGAGATCAA CGGGCGTCCT GCCGATCACT GGAATACCGC CGATTTACAA CGTAACGCCG CGCAGTTGCT GAGCGCCCTG AGCGAATTTG CCACACTGAA TCCAGGCGAT GCCATTCTGC TCGGCACGCC ACAGGCGCGC GTGGAAATAC AGCCAGGCGA TCGCGTTCGT GTTCTCGCAG AAGGTTTCCC GCCGCTGGAA AATCCGGTAG TGGACGAACG TGAAGTGACC ACGCGCAAGA GCTTCCCAAC GCAGCCACAC CCGCACGGCA CGCTGTTTGC CCTCGGCCTG AACTACGCCG ACCACGCCAG CGAACTGGAA TTTAAGCCAC CGGAAGAACC GCTGGTGTTC CTGAAAGCGC CAAATACCCT CACTGGCGAT AACCAGACCT CCGTGCGTCC AAACAATATT GAATACATGC ACTATGAAGC GGAGCTGGTG GTAGTTATTG GCAAACAGGC GCGTAACGTC AGCGAAGCCG ATGCCATGGA TTATGTCGCG GGCTACACCG TGTGTAACGA CTACGCCATT CGCGACTATC TGGAAAACTA CTACCGCCCT AACCTGCGGG TAAAAAGCCG CGACGGACTG ACGCCGATGC TTTCAACCAT CGTGCCGAAA GAGGCGATCC CGGACCTGCA TAATCTGACC CTTCGCACCT TCGTCAACGG CGAGTTACGC CAGCAAGGCA CCACCGCCGA TCTGATCTTC AGCGTGCCCT TCCTGATCGC CTACTTAAGC GAATTTATGA CCCTGAATCC GGGCGACATG ATCGCCACCG GCACACCAAA AGGCTTATCT GACGTGGTGC CTGGCGATGA AGTAGTGGTG GAAGTAGAAG GCGTGGGCTG CCTGGTGAAC CGAATTGTGA GTGAGGAAAC AGCGAAATGA
|
Protein sequence | MKGTIFAVAL NHRSQLDAWQ EAFQQSPYKA PPKTAVWFIK PRNTVIGCGE PIPFPQGEKV LSGATVALIV GKTATKVREE DAAEYIAGYA LANDVSLPEE SFYRPAIKAK CRDGFCPIGE TVALSNVDNL TIYTEINGRP ADHWNTADLQ RNAAQLLSAL SEFATLNPGD AILLGTPQAR VEIQPGDRVR VLAEGFPPLE NPVVDEREVT TRKSFPTQPH PHGTLFALGL NYADHASELE FKPPEEPLVF LKAPNTLTGD NQTSVRPNNI EYMHYEAELV VVIGKQARNV SEADAMDYVA GYTVCNDYAI RDYLENYYRP NLRVKSRDGL TPMLSTIVPK EAIPDLHNLT LRTFVNGELR QQGTTADLIF SVPFLIAYLS EFMTLNPGDM IATGTPKGLS DVVPGDEVVV EVEGVGCLVN RIVSEETAK
|
| |