Gene SeHA_C2413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C2413 
SymbolgtdA 
ID6489538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp2321937 
End bp2322974 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content57% 
IMG OID642742597 
Productgentisate 1,2-dioxygenase 
Protein accessionYP_002046232 
Protein GI194451398 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3435] Gentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR02272] gentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones97 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAA TAAATCAGAA CGTAAAAGAT AGCCGTCAGC AGTATTACCA GCATATTTCC 
GGGCAGAATC TGACGCCGCT GTGGGAATCG TTACATCACC TGGTACCGCA GACGCCAAAC
GCCAACTGCG CGCCGGCCTA CTGGAATTAT CAGGAAATTC GTCCGCTACT GATGGAAAGC
GGCAATGTCA TTGGCGCGAA AGAGGCGATC CGCCGGGTGC TGGTGCTGGA AAATCCGGCA
TTGCGCGGTC AGTCGTCGAT CACGGCGACC TTATATGCTG GTTTACAGCT GATCCTGCCC
GGCGAAGTCG CGCCGAGTCA TCGCCATAAC CAGTCGGCGC TGCGTTTTAT CGTCGAAGGT
AAAGGCGCAT TTACCGCGGT GGACGGCGAG CGCACGCCAA TGCATACCGG CGATTTTATC
CTGACGCCGC AGTGGCGCTG GCACGATCAT GGTAATCCGG GATCAGAGCC GGTGGTATGG
CTGGATGGTC TGGATCTGCC GTTAGTCAAC CTCCTGGGCT GTGGGTTTGC GGAAGACTAT
CCCGAAGATC AGCAGCCGGT GACGCGAAAA GAGGGCGATT ATCTGCCGCG CTATGCAGCG
AATATGCTGC CGCTGCGCCA CCAGCGCGGG AACTCGTCGC CGATTTTCAA CTACCGTTAC
GACCGCAGTC GCGAGGCGCT GCACGATCTG ACCCGTATGG GCGATCCGGA TGAGTGGGAA
GGCTACAAGC TGCGTTACGT TAATCCCGTC ACCGGCGGTT ATCCGATGCC GTCGATGGGC
GCGTTCCTGC AACTGCTGCC AAAAGGCTTT GCCTCGCGTG TGGCGCGGAG CACCGACAGC
ACTATCTACC ACGTCGTTGA AGGGGCAGGG CAGGTCACTA TCGGCAACGA AACTTTTCAT
TTTTCCGCAA AAGACATTTT TGTGGCGCCG ACCTGGCATG AGGTGTCGTT TCGCAGCAGC
GAAGACACGG TGTTATTCAG CTTTTCGGAC AAGCCGGTTC AGGAAGCCCT GGGGCTGTTC
CGCGAAGCAC GTTATTAA
 
Protein sequence
MSEINQNVKD SRQQYYQHIS GQNLTPLWES LHHLVPQTPN ANCAPAYWNY QEIRPLLMES 
GNVIGAKEAI RRVLVLENPA LRGQSSITAT LYAGLQLILP GEVAPSHRHN QSALRFIVEG
KGAFTAVDGE RTPMHTGDFI LTPQWRWHDH GNPGSEPVVW LDGLDLPLVN LLGCGFAEDY
PEDQQPVTRK EGDYLPRYAA NMLPLRHQRG NSSPIFNYRY DRSREALHDL TRMGDPDEWE
GYKLRYVNPV TGGYPMPSMG AFLQLLPKGF ASRVARSTDS TIYHVVEGAG QVTIGNETFH
FSAKDIFVAP TWHEVSFRSS EDTVLFSFSD KPVQEALGLF REARY