Gene SeSA_A2417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A2417 
SymbolgtdA 
ID6518395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp2294815 
End bp2295852 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content57% 
IMG OID642747475 
Productgentisate 1,2-dioxygenase 
Protein accessionYP_002115268 
Protein GI194736424 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3435] Gentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR02272] gentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAA TAAATCAGAA CGTAAAAGAT AGCCGTCAGC AGTATTACCA GCATATTTCC 
GGGCAGAATC TGACGCCGCT GTGGGAGTCG TTACATCACC TGGTACCGCA GACGCCAAAC
GCCAACTGCG CGCCGGCCTA CTGGAATTAT CAGGAAATTC GTCCGCTACT GATGGAAAGC
GGCAATGTCA TTGGCGCGAA AGAGGCGATC CGCCGGGTGC TGGTGCTGGA AAATCCGGCA
TTGCGCGGTC AGTCGTCGAT CACGGCGACC TTATACGCTG GTTTACAGCT GATCCTGCCC
GGCGAAGTCG CGCCGAGTCA TCGCCATAAC CAGTCGGCGC TGCGTTTTAT CGTCGAAGGT
AAAGGCGCAT TTACCGCGGT GGACGGCGAG CGCACGCCAA TGCATACCGG CGATTTTATC
CTGACGCCGC AGTGGCGCTG GCACGATCAT GGTAATCCGG GGTCAGAGCC GGTGGTATGG
CTGGATGGTC TGGATCTGCC GTTAGTCAAC CTCCTGGGCT GTGGGTTTGC GGAAGACTAT
CCCGAAGATC AGCAGCCGGT AACGCGAAAA GAGGGCGATT ATCTGCCGCG CTATGCAGCG
AATATGCTGC CGCTGCGCCA CCAGCGCGGG AATTCGTCGC CGATTTTCAA CTACCGTTAC
GACCGCAGTC GCGAGGCGCT GCACGATCTG ACCCGTATGG GCGATCCGGA TGAGTGGGAA
GGTTACAAGC TGCGTTACGT TAATCCCGTC ACCGGCGGTT ATCCGATGCC GTCAATGGGC
GCGTTCCTGC AACTGCTGCC AAAAGGCTTT GCCTCGCGTG TGGCGCGGAG CACCGACAGC
ACTATCTACC ACGTCGTTGA AGGGGCAGGG CAGGTCACTA TCGGCAACGA AACTTTTCAT
TTTTCCGCAA AAGACATTTT TGTGGCGCCG ACCTGGCATG AGGTGTCGTT TCGCAGCAGT
GAAGACACGG TGTTATTCAG CTTTTCGGAC AAGCCGGTTC AGGAAGCCCT GGGGCTGTTC
CGCGAAGCAC GTTATTAA
 
Protein sequence
MSEINQNVKD SRQQYYQHIS GQNLTPLWES LHHLVPQTPN ANCAPAYWNY QEIRPLLMES 
GNVIGAKEAI RRVLVLENPA LRGQSSITAT LYAGLQLILP GEVAPSHRHN QSALRFIVEG
KGAFTAVDGE RTPMHTGDFI LTPQWRWHDH GNPGSEPVVW LDGLDLPLVN LLGCGFAEDY
PEDQQPVTRK EGDYLPRYAA NMLPLRHQRG NSSPIFNYRY DRSREALHDL TRMGDPDEWE
GYKLRYVNPV TGGYPMPSMG AFLQLLPKGF ASRVARSTDS TIYHVVEGAG QVTIGNETFH
FSAKDIFVAP TWHEVSFRSS EDTVLFSFSD KPVQEALGLF REARY