Gene SeD_A2525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2525 
SymbolgtdA 
ID6873092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2404894 
End bp2405931 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content57% 
IMG OID642785605 
Productgentisate 1,2-dioxygenase 
Protein accessionYP_002216263 
Protein GI198243492 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3435] Gentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR02272] gentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAA TAAATCAGAA CGTAAAAGAT AGCCGTCAGC AGTATTACCA GCATATTTCC 
GGGCAGAATC TGACGCCGCT GTGGGAATCG TTACATCACC TGGTACCGCA GACGCCAAAC
GCCAACTGCG CGCCGGCCTA CTGGAATTAT CAGGAAATTC GTCCGCTACT GATGGAAAGC
GGCAATGTCA TTGGCGCGAA AGAGGCGATC CGCCGGGTGC TGGTGCTGGA AAATCCGGCA
TTGCGCGGTC AGTCGTCGAT CACGGCGACC TTATACGCTG GTTTACAGCT GATCCTGCCC
GGCGAAGTCG CGCCGAGTCA TCGCCATAAC CAGTCGGCGC TGCGTTTTAT CGTCGAAGGT
AAAGGCGCAT TTACCGCGGT GGACGGCGAG CGCACGCCAA TGCATACCGG CGATTTTATC
CTGACGCCGC AGTGGCGCTG GCACGATCAT GGTAATCCGG GGTCAGAGCC GGTGGTATGG
CTGGATGGTC TGGATCTGCC GTTAGTCAAT CTCCTGGGCT GTGGTTTTGC GGAAGACTAT
CCCGAAGATC AGCAGCCGGT GACGCGAAAA GAGGGCGATT ATCTGCCGCG CTATGCAGCG
AATATGCTGC CGCTGCGCCA CCAGCGCGGG AACTCGTCGC CGATTTTCAA CTACCGTTAC
GACCGCAGTC GCGAGGCGCT GCACGATCTG ACCCGTATGG GCGATCCGGA TGAGTGGGAA
GGTTACAAGC TGCGTTACGT TAATCCCGTC ACCGGCGGTT ATCCGATGCC GTCGATGGGC
GCGTTCCTGC AACTGTTGCC AAAAGGCTTT GCCTCGCGTG TGGCGCGGAG CACCGACAGC
ACTATCTACC ACGTCGTTGA AGGGGCAGGG CTGGTCACTA TCGGCAACGA AACTTTTCAT
TTTTCCGCAA AAGACATTTT TGTGGCGCCG ACCTGGCATG AGGTGTCGTT TCGCAGCAGC
GAAGACACGG TGTTATTCAG CTTTTCGGAC AAGCCGGTTC AGGAAGCCCT GGGGCTGTTC
CGCGAAGCAC GTTATTAA
 
Protein sequence
MSEINQNVKD SRQQYYQHIS GQNLTPLWES LHHLVPQTPN ANCAPAYWNY QEIRPLLMES 
GNVIGAKEAI RRVLVLENPA LRGQSSITAT LYAGLQLILP GEVAPSHRHN QSALRFIVEG
KGAFTAVDGE RTPMHTGDFI LTPQWRWHDH GNPGSEPVVW LDGLDLPLVN LLGCGFAEDY
PEDQQPVTRK EGDYLPRYAA NMLPLRHQRG NSSPIFNYRY DRSREALHDL TRMGDPDEWE
GYKLRYVNPV TGGYPMPSMG AFLQLLPKGF ASRVARSTDS TIYHVVEGAG LVTIGNETFH
FSAKDIFVAP TWHEVSFRSS EDTVLFSFSD KPVQEALGLF REARY