Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4890 |
Symbol | hpaG |
ID | 6271551 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 4562575 |
End bp | 4563864 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641728622 |
Product | 4-hydroxyphenylacetate degradation bifunctional isomerase/decarboxylase |
Protein accession | YP_001883016 |
Protein GI | 187731132 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | [TIGR02303] 4-hydroxyphenylacetate degradation bifunctional isomerase/decarboxylase, C-terminal subunit [TIGR02305] 4-hydroxyphenylacetate degradation bifunctional isomerase/decarboxylase, N-terminal subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGGCA CTATCTTCGC CGTAGCGTTG AACCATCGCA GCCAACTTGA TGCATGGCAG GAAGCGTTCC AGCAATCCCC CTACAAAGCC CCGCCTAAAA CTGCGGTCTG GTTTATTAAA CCGCGCAATA CGGTGATTGG TTGCGGTGAA CCGATTCCCT TTCCACAGGG TGAAAAGGTA CTGAGCGGTG CGACTGTTGC GCTGATTGTG GGAAAAACGG CGACGAAAGT ACGTGAAGAA GATGCGGCAG AGTACATCGC CGGATATGCG CAGGCTAACG ACGTCAGCCT GCCGGAAGAG AGCTTTTACC GCCCGGCAAT CAAAGCAAAA TGTCGTGATG GATTCTGCCC CATTGGCGAA ACCGTGGCTC TCAGCAATGT CGATAATCTG ACCATCTATA CCGAGATCAA CGGGCGTCCT GCCGATCACT GGAATACCGC CGATTTACAA CGTAACGCCG CGCAGTTGCT GAGCGCCCTG AGCGAATTTG CCACGCTGAA TCCAGGCGAC GCCATTCTGC TCGGCACGCC ACAGGCGCGC GTGGAAATAC AGCCAGGCGA GCGCGTTCGT GTTCTCGCAG AAGGTTTCCC GCCGCTGGAA AATCCGGTAG TGGACGAACG TGAAGTGACC ACGCGCAAGA GCTTCCCAAC GCAGCCACAC CCGCACGGCA CGCTGTTTGC CCTCGGCCTG AACTACGCCG ACCACGCCAG CGAACTGGAA TTTAAGCCAC CGGAAGAACC GCTGGTGTTC CTGAAAGCGC CAAATACCCT CACTGGCGAT AACCAGACCT CCGTGCGTCC AAACAATATT GAATACATGC ACTATGAAGC GGAGCTGGTG GTAGTTATTG GCAAACAGGC GCGTAACGTC AGCGAAGCCG ATGCCATGGA TTATGTCGCG GGCTACACCG TGTGTAACGA CTACGCCATT CGTGACTATC TGGAAAACTA CTACCGCCCT AACCTGCGGG TAAAAAGCCG CGACGGACTG ACGCCGATGC TTTCAACCAT CGTGCCGAAA GAGGCGATCC CGGACCCGCA TAATCTGACC CTTCGCACCT TCGTCAACGG CGAGTTACGC CAGCAAGGCA CCACCGCCGA TCTGATCTTC AGCGTGCCCT TCCTGATCGC CTACTTAAGT GAATTTATGA CCCTGAATCC GGGCGACATG ATCGCCACCG GCACACCCAA AGGCTTATCT GACGTGGTGC CTGGCGATGA AGTAGTGGTG GAAGTAGAAG GCGTGGGCCG CCTGGTGAAC CGAATTGTGA GTGAGGATAC AGCGAAATGA
|
Protein sequence | MKGTIFAVAL NHRSQLDAWQ EAFQQSPYKA PPKTAVWFIK PRNTVIGCGE PIPFPQGEKV LSGATVALIV GKTATKVREE DAAEYIAGYA QANDVSLPEE SFYRPAIKAK CRDGFCPIGE TVALSNVDNL TIYTEINGRP ADHWNTADLQ RNAAQLLSAL SEFATLNPGD AILLGTPQAR VEIQPGERVR VLAEGFPPLE NPVVDEREVT TRKSFPTQPH PHGTLFALGL NYADHASELE FKPPEEPLVF LKAPNTLTGD NQTSVRPNNI EYMHYEAELV VVIGKQARNV SEADAMDYVA GYTVCNDYAI RDYLENYYRP NLRVKSRDGL TPMLSTIVPK EAIPDPHNLT LRTFVNGELR QQGTTADLIF SVPFLIAYLS EFMTLNPGDM IATGTPKGLS DVVPGDEVVV EVEGVGRLVN RIVSEDTAK
|
| |