Gene SeD_A0147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0147 
Symbol 
ID6874109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp157529 
End bp158518 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content54% 
IMG OID642783395 
Productaldo-keto reductase YakC 
Protein accessionYP_002214089 
Protein GI198243518 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATATC GTACATTAGG CGCAAACGGA CCGCGAGTGT CAGCCATCGG ACTGGGATGT 
ATGGGCATGA GCGCATTTTA CGGCGCTCAT GACGACAGCA CCTCAATTAA GACGCTACAT
TATGCGTTAG ATCAGGGGGT AACACTGCTC GATACCGCAG ATATGTATGG CCCTTATACC
AATGAAAGGT TAGTTGGAAG AGCCATCGCC GATCGTCGCG ATCGGGTATT TTTAGCGACG
AAATTTGGTA TCGTTCTCGA CCCTGCTAAC CCTATGGCGC GTGGCGTCAA TGGCAGACCG
GAGTACGTTC GCCGTAGTTG TGAGCAAAGC CTGCAACGCC TGGGGGTCGA TCATATCGAT
CTGTACTATC AACATCGCGT TGATCCATCA GTTCCCATAG GAGAGACTGT CGGTGCAATG
GCGGACCTGG TGCGCGAGGG AAAAGTGCGT TATCTCGGGC TATCCGAAGC ATCAACGCAA
ACGCTGGAAC GCGCCCATAA CGTTCACCCT ATTACCGCGC TGCAAAGTGA GTATTCGCTT
TGGTCCCGCG AAGCGGAAAT TTCAGCACTT TCCACCTGTG AACGGTTGGG TATAGGATTC
GTCGCTTACA GCCCGCTGGG ACGCGGATTT CTGACCGGTA CGATTAAAAC GCCAGAAGAT
TTTGCTGCGA ATGACTTCCG TCGCACAAAT CCCAGGTTCA TGGGTGAGAA CTTCTCGCGC
AATTTACGTC TGGCTGAAGC AATAAAACAA ATGGCACGCG AAAAAGAGTG TACCCCCGCA
CAATTAGCGC TGGCCTGGCT GCTGGCCCGC AACAGGCACA TCGTTCCCAT TCCCGGCACC
CGCCACTGCG CCAGGGTGGA TGAAAACCTC GGCGCGTTAT CACTGACCCT AAGCCCGCAG
GAGCTGACGG CAATTGAGGC GGTTTTTCCT CACGACGCCG CGGCCGGCCC CCGCTACTGG
CCGGAAATTA TGTCGACATT AAATCGCTAA
 
Protein sequence
MQYRTLGANG PRVSAIGLGC MGMSAFYGAH DDSTSIKTLH YALDQGVTLL DTADMYGPYT 
NERLVGRAIA DRRDRVFLAT KFGIVLDPAN PMARGVNGRP EYVRRSCEQS LQRLGVDHID
LYYQHRVDPS VPIGETVGAM ADLVREGKVR YLGLSEASTQ TLERAHNVHP ITALQSEYSL
WSREAEISAL STCERLGIGF VAYSPLGRGF LTGTIKTPED FAANDFRRTN PRFMGENFSR
NLRLAEAIKQ MAREKECTPA QLALAWLLAR NRHIVPIPGT RHCARVDENL GALSLTLSPQ
ELTAIEAVFP HDAAAGPRYW PEIMSTLNR