Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3270 |
Symbol | gtdA |
ID | 6971767 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3003416 |
End bp | 3004444 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643387083 |
Product | gentisate 1,2-dioxygenase |
Protein accession | YP_002271547 |
Protein GI | 209399809 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3435] Gentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR02272] gentisate 1,2-dioxygenase |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.395363 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGATA ACAATCAAAA TAGCCGCGAG CAATTTTATC AGCATATTTC CGGGCAAAAC CTGACCCCGC TGTGGGAGTC ACTGCACCAT CTGGTGCCGA AAACGCCCAA CGCTAACTGT GCGCCAGCGT ACTGGAATTA TCAGGAGATC CGCCCGCTGC TACTGGAGAG CGGCGGCCTG ATTGGTGCGA AAGAGGCCGT GCGCCGCGTG CTGGTGCTGG AAAACCCGGC GCTGCGCGGG CAGTCTTCCA TTACCGCGAC TTTATATGCC GGTTTGCAAC TGATCATGCC GGGCGAAGTG GCACCGAGTC ATCGTCATAA TCAGTCGGCG CTGCGTTTTA TTGTTGAAGG CAAAGGGGCA TTTACCGCCG TTGATGGCGA ACGCACGCCA ATGAATGAAG GCGATTTTAT CCTGACCCCG CAGTGGCGCT GGCACGATCA CGGTAACCCT GGCGACGAAC CGGTTATCTG GCTCGATGGG CTGGATCTGC CATTAGTGAA TATTCTGGGC TGCGGTTTTG CCGAAGATTA TCCGGAAGAG CAACAACCGG TAACGCGTAA AGAGGGAGAT TATCTGCCGC GTTACGCTGC CAATATGTTG CCGCTGCGGC ATCAGACGGG GAACTCCTCG CCGATCTTTA ACTATCGTTA TGACCGCAGC CGCGAAGTGC TGCACGATTT AACCCGACTG GGCGATGCCG ACGAGTGGGA TGGCTACAAA ATGCGCTACG TCAACCCGGT CACCGGCGGC TATCCGATGC CGTCGATGGG CGCTTTCCTG CAACTGCTGC CGAAAGGGTT TGCCTCACGC GTTGCGCGTA CCACTGACAG CACCATCTAC CATGTGGTGG AAGGTAGCGG GCAGGTCATC ATCGGCAATG AAACCTTCAG CTTTAGTGCA AAAGATATCT TCGTGGTGCC GACCTGGCAT GGCGTGTCGT TCCAGACCAC GCAAGATTCC GTTTTATTTA GTTTTTCGGA CAGACCGGTA CAAGAAGCCC TGGGGCTGTT CCGCGAAGCG CGTTATTAA
|
Protein sequence | MTDNNQNSRE QFYQHISGQN LTPLWESLHH LVPKTPNANC APAYWNYQEI RPLLLESGGL IGAKEAVRRV LVLENPALRG QSSITATLYA GLQLIMPGEV APSHRHNQSA LRFIVEGKGA FTAVDGERTP MNEGDFILTP QWRWHDHGNP GDEPVIWLDG LDLPLVNILG CGFAEDYPEE QQPVTRKEGD YLPRYAANML PLRHQTGNSS PIFNYRYDRS REVLHDLTRL GDADEWDGYK MRYVNPVTGG YPMPSMGAFL QLLPKGFASR VARTTDSTIY HVVEGSGQVI IGNETFSFSA KDIFVVPTWH GVSFQTTQDS VLFSFSDRPV QEALGLFREA RY
|
| |