Gene SNSL254_A2368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2368 
SymbolgtdA 
ID6484033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2285573 
End bp2286610 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content56% 
IMG OID642737708 
Productgentisate 1,2-dioxygenase 
Protein accessionYP_002041450 
Protein GI194444288 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3435] Gentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR02272] gentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0793193 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAA TAAATCAGAA CGTAAAAGAT AGCCGTCAAC AGTATTACCA GCATATTTCC 
GGGCAGAATC TGACGCCGCT GTGGGAGTCG TTACATCACC TGGTACCGCA GACGCCAAAC
GCCAACTGCG CGCCGGCCTA CTGGAATTAT CAGGAAATTC GTCCGCTACT GATGGAAAGC
GGCAATGTCA TTGGCGCGAA AGAGGCGATC CGCCGGGTGC TGGTGCTGGA AAATCCGGCA
TTGCGCGGTC AGTCGTCGAT TACGGCGACC TTATACGCTG GTTTACAGCT GATCCTGCCC
GGCGAAGTCG CGCCGAGTCA TCGCCATAAC CAGTCGGCGC TGCGTTTTAT CGTCGAAGGT
AAAGGCGCAT TTACCGCGGT GGACGGCGAG CGCACGCCAA TGCATACCGG CGATTTTATC
CTGACGCCGC AGTGGCGCTG GCACGATCAT GGTAATCCGG GATCAGAGCC GGTGGTATGG
CTGGATGGTC TGGATCTGCC GTTAGTCAAT CTCCTGGGCT GTGGTTTTGC GGAAGACTAT
CCCGAAGATC AGCAGCCGGT GACGCGAAAA GAGGGCGATT ATCTGCCGCG CTATGCAGCG
AATATGCTGC CGCTGCGCCA CCAGCGCGGG AACTCGTCGC CGATTTTCAA CTACCGTTAC
GACCGCAGTC GCGAGGCGTT GCACGATCTG ACCCGTATGG GCGATCCGGA TGAGTGGGAA
GGTTACAAGC TGCGTTACGT TAATCCCGTC ACCGGCGGTT ATCCGATGCC GTCGATGGGC
GCGTTCCTGC AACTGTTGCC AAAAGGCTTT GCCTCGCGTG TGGCGCGGAG CACCGACAGC
ACTATCTACC ACGTCGTTGA AGGGGCAGGA CTGGTCACTA TCGGCAACGA AACTTTTCAT
TTTTCCGCAA AAGACATTTT TGTGGCGCCG ACCTGGCATG AGGTGTCGTT TCGCAGCAGC
GAAGACACAG TGTTATTCAG TTTTTCGGAC AAGCCGGTTC AGGAAGCCCT GGGGCTGTTC
CGCGAAGCAC GTTATTAA
 
Protein sequence
MSEINQNVKD SRQQYYQHIS GQNLTPLWES LHHLVPQTPN ANCAPAYWNY QEIRPLLMES 
GNVIGAKEAI RRVLVLENPA LRGQSSITAT LYAGLQLILP GEVAPSHRHN QSALRFIVEG
KGAFTAVDGE RTPMHTGDFI LTPQWRWHDH GNPGSEPVVW LDGLDLPLVN LLGCGFAEDY
PEDQQPVTRK EGDYLPRYAA NMLPLRHQRG NSSPIFNYRY DRSREALHDL TRMGDPDEWE
GYKLRYVNPV TGGYPMPSMG AFLQLLPKGF ASRVARSTDS TIYHVVEGAG LVTIGNETFH
FSAKDIFVAP TWHEVSFRSS EDTVLFSFSD KPVQEALGLF REARY