Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A2417 |
Symbol | gtdA |
ID | 6518395 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | - |
Start bp | 2294815 |
End bp | 2295852 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642747475 |
Product | gentisate 1,2-dioxygenase |
Protein accession | YP_002115268 |
Protein GI | 194736424 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3435] Gentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR02272] gentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAAA TAAATCAGAA CGTAAAAGAT AGCCGTCAGC AGTATTACCA GCATATTTCC GGGCAGAATC TGACGCCGCT GTGGGAGTCG TTACATCACC TGGTACCGCA GACGCCAAAC GCCAACTGCG CGCCGGCCTA CTGGAATTAT CAGGAAATTC GTCCGCTACT GATGGAAAGC GGCAATGTCA TTGGCGCGAA AGAGGCGATC CGCCGGGTGC TGGTGCTGGA AAATCCGGCA TTGCGCGGTC AGTCGTCGAT CACGGCGACC TTATACGCTG GTTTACAGCT GATCCTGCCC GGCGAAGTCG CGCCGAGTCA TCGCCATAAC CAGTCGGCGC TGCGTTTTAT CGTCGAAGGT AAAGGCGCAT TTACCGCGGT GGACGGCGAG CGCACGCCAA TGCATACCGG CGATTTTATC CTGACGCCGC AGTGGCGCTG GCACGATCAT GGTAATCCGG GGTCAGAGCC GGTGGTATGG CTGGATGGTC TGGATCTGCC GTTAGTCAAC CTCCTGGGCT GTGGGTTTGC GGAAGACTAT CCCGAAGATC AGCAGCCGGT AACGCGAAAA GAGGGCGATT ATCTGCCGCG CTATGCAGCG AATATGCTGC CGCTGCGCCA CCAGCGCGGG AATTCGTCGC CGATTTTCAA CTACCGTTAC GACCGCAGTC GCGAGGCGCT GCACGATCTG ACCCGTATGG GCGATCCGGA TGAGTGGGAA GGTTACAAGC TGCGTTACGT TAATCCCGTC ACCGGCGGTT ATCCGATGCC GTCAATGGGC GCGTTCCTGC AACTGCTGCC AAAAGGCTTT GCCTCGCGTG TGGCGCGGAG CACCGACAGC ACTATCTACC ACGTCGTTGA AGGGGCAGGG CAGGTCACTA TCGGCAACGA AACTTTTCAT TTTTCCGCAA AAGACATTTT TGTGGCGCCG ACCTGGCATG AGGTGTCGTT TCGCAGCAGT GAAGACACGG TGTTATTCAG CTTTTCGGAC AAGCCGGTTC AGGAAGCCCT GGGGCTGTTC CGCGAAGCAC GTTATTAA
|
Protein sequence | MSEINQNVKD SRQQYYQHIS GQNLTPLWES LHHLVPQTPN ANCAPAYWNY QEIRPLLMES GNVIGAKEAI RRVLVLENPA LRGQSSITAT LYAGLQLILP GEVAPSHRHN QSALRFIVEG KGAFTAVDGE RTPMHTGDFI LTPQWRWHDH GNPGSEPVVW LDGLDLPLVN LLGCGFAEDY PEDQQPVTRK EGDYLPRYAA NMLPLRHQRG NSSPIFNYRY DRSREALHDL TRMGDPDEWE GYKLRYVNPV TGGYPMPSMG AFLQLLPKGF ASRVARSTDS TIYHVVEGAG QVTIGNETFH FSAKDIFVAP TWHEVSFRSS EDTVLFSFSD KPVQEALGLF REARY
|
| |