Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A2525 |
Symbol | gtdA |
ID | 6873092 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 2404894 |
End bp | 2405931 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642785605 |
Product | gentisate 1,2-dioxygenase |
Protein accession | YP_002216263 |
Protein GI | 198243492 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3435] Gentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR02272] gentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 77 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAAA TAAATCAGAA CGTAAAAGAT AGCCGTCAGC AGTATTACCA GCATATTTCC GGGCAGAATC TGACGCCGCT GTGGGAATCG TTACATCACC TGGTACCGCA GACGCCAAAC GCCAACTGCG CGCCGGCCTA CTGGAATTAT CAGGAAATTC GTCCGCTACT GATGGAAAGC GGCAATGTCA TTGGCGCGAA AGAGGCGATC CGCCGGGTGC TGGTGCTGGA AAATCCGGCA TTGCGCGGTC AGTCGTCGAT CACGGCGACC TTATACGCTG GTTTACAGCT GATCCTGCCC GGCGAAGTCG CGCCGAGTCA TCGCCATAAC CAGTCGGCGC TGCGTTTTAT CGTCGAAGGT AAAGGCGCAT TTACCGCGGT GGACGGCGAG CGCACGCCAA TGCATACCGG CGATTTTATC CTGACGCCGC AGTGGCGCTG GCACGATCAT GGTAATCCGG GGTCAGAGCC GGTGGTATGG CTGGATGGTC TGGATCTGCC GTTAGTCAAT CTCCTGGGCT GTGGTTTTGC GGAAGACTAT CCCGAAGATC AGCAGCCGGT GACGCGAAAA GAGGGCGATT ATCTGCCGCG CTATGCAGCG AATATGCTGC CGCTGCGCCA CCAGCGCGGG AACTCGTCGC CGATTTTCAA CTACCGTTAC GACCGCAGTC GCGAGGCGCT GCACGATCTG ACCCGTATGG GCGATCCGGA TGAGTGGGAA GGTTACAAGC TGCGTTACGT TAATCCCGTC ACCGGCGGTT ATCCGATGCC GTCGATGGGC GCGTTCCTGC AACTGTTGCC AAAAGGCTTT GCCTCGCGTG TGGCGCGGAG CACCGACAGC ACTATCTACC ACGTCGTTGA AGGGGCAGGG CTGGTCACTA TCGGCAACGA AACTTTTCAT TTTTCCGCAA AAGACATTTT TGTGGCGCCG ACCTGGCATG AGGTGTCGTT TCGCAGCAGC GAAGACACGG TGTTATTCAG CTTTTCGGAC AAGCCGGTTC AGGAAGCCCT GGGGCTGTTC CGCGAAGCAC GTTATTAA
|
Protein sequence | MSEINQNVKD SRQQYYQHIS GQNLTPLWES LHHLVPQTPN ANCAPAYWNY QEIRPLLMES GNVIGAKEAI RRVLVLENPA LRGQSSITAT LYAGLQLILP GEVAPSHRHN QSALRFIVEG KGAFTAVDGE RTPMHTGDFI LTPQWRWHDH GNPGSEPVVW LDGLDLPLVN LLGCGFAEDY PEDQQPVTRK EGDYLPRYAA NMLPLRHQRG NSSPIFNYRY DRSREALHDL TRMGDPDEWE GYKLRYVNPV TGGYPMPSMG AFLQLLPKGF ASRVARSTDS TIYHVVEGAG LVTIGNETFH FSAKDIFVAP TWHEVSFRSS EDTVLFSFSD KPVQEALGLF REARY
|
| |