Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2433 |
Symbol | gtdA |
ID | 5590259 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 2415446 |
End bp | 2416474 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640926094 |
Product | gentisate 1,2-dioxygenase |
Protein accession | YP_001463489 |
Protein GI | 157155116 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3435] Gentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR02272] gentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGATA ACAATCAAAA TAGCCGCGAA CAATTTTATC AGCACATTTC CGGGCAAAAC CTGACCCCGC TGTGGGAGTC ACTACACCAT CTGGTGCCGA AAACGCCCAA CGCTAACTGT GCGCCAGCGT ACTGGAATTA TCAGGAGATC CGCCCGCTGC TGCTGGAGAG CGGCAGCCTG ATTGGCGCGA AAGAGGCGGT GCGCCGCGTG CTGGTGCTGG AAAACCCGGC GCTGCGCGGG CAGTCTTCCA TCACCGCGAC TTTATATGCC GGTTTGCAAC TGATCATGCC GGGCGAAGTG GCACCGAGTC ATCGCCATAA CCAGTCGGCG CTGCGTTTTA TTGTTGAAGG CAAAGGGGCA TTTACCGCCG TTGACGGCGA ACGCACGCCA ATGAATGAGG GCGATTTTAT CCTCACTCCG CAGTGGCGCT GGCACGATCA CGGTAACCCT GGCGACGAAC CAGTTATCTG GCTCGATGGG CTGGATCTGC CATTAGTGAA TACTCTGGGC TGCGGTTTTG CCGAAGATTA CCCGGAAGAG CAACAACCGG TAACGCGTAA AGAGGGCGAT TATCTGCCGC GTTACGCTGC CAATATGTTG CCGCTGCGCC ATCAGACCGG GAACTCCTCA CCCATCTTTA ACTACCGTTA TGACCGCAGC CGCGAAGTGC TGCACGATTT AACCCGCCTG GGCGATGCCG ATGAGTGGGA TGGCTACAAA ATGCGCTACG TCAACCCGGT CACCGGTGGC TATCCGATGC CGTCGATGGG CGCTTTCCTG CAACTGCTGC CGAAAGGGTT CGCCTCCCGC GTGGCGCGTA CCACTGACAG CACCATCTAT CACGTGGTGG AAGGTGGCGG GCAGGTCACT ATTGGCAATG AAACCTTCAG CTTTAGTGCA AAAGATATCT TCGTGGTGCC GACCTGGCAC GGCGTGTCGT TCCAGACCAC GCAAGATACC GTGTTATTCA GTTTTTCGGA CAGACCGGTA CAAGAAGCCC TGGGACTGTT CCGCGAAGCG CGTTATTAA
|
Protein sequence | MTDNNQNSRE QFYQHISGQN LTPLWESLHH LVPKTPNANC APAYWNYQEI RPLLLESGSL IGAKEAVRRV LVLENPALRG QSSITATLYA GLQLIMPGEV APSHRHNQSA LRFIVEGKGA FTAVDGERTP MNEGDFILTP QWRWHDHGNP GDEPVIWLDG LDLPLVNTLG CGFAEDYPEE QQPVTRKEGD YLPRYAANML PLRHQTGNSS PIFNYRYDRS REVLHDLTRL GDADEWDGYK MRYVNPVTGG YPMPSMGAFL QLLPKGFASR VARTTDSTIY HVVEGGGQVT IGNETFSFSA KDIFVVPTWH GVSFQTTQDT VLFSFSDRPV QEALGLFREA RY
|
| |