Gene SeD_A4926 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4926 
Symbol 
ID6872865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4759782 
End bp4761152 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content57% 
IMG OID642787800 
Productaldehyde dehydrogenase (NAD) family protein 
Protein accessionYP_002218393 
Protein GI198242417 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value0.813756 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTACC AGACAGTGAA TCCTGCCAAT AATCAGCTCA TTAAAGAGTA TCCCCCGCAC 
ACGGGCGCGG ATATTGAAAC CGCGCTGCAA AAAGCTGACG CGCTCTATCA CTCCGACTGG
TCCAAGGGAG AGATTGACCA ACGTCTGCCG GTACTGCATA AGCTGGCTGA CTTGATCGAC
AGCCGTGTTG AAGAACTGGC AAAAATCGCC AGCCAGGAGA TGGGCAAGCT CATCGAGCAG
AGCCGTGGCG AAGTCAAACT GTGTGCGCAG ATCGCTCGCT ATTATGCGGA TAACGCGAAG
CAGTTTCTTG CCCCGGTGCC TTATAAAACC GAGTTTGGCG ACGCGTGGGT AGAACATCAT
CCGATTGGCG TCATCATGGC TGTTGAGCCG TGGAACTTCC CGTACTATCA GTTGATGCGT
GTGCTGGCGC CGAACCTGGC CGCTGGTAAC CCGGTGCTGG CGAAACATGC CAGCATCGTA
CCGCACTGCG CCGAGACATT TGCCCATCTG GTGCGTGAAG CCGGCGCGCC GGAAGGCGCA
TGGACCAACC TGTTTATTTC CTCCGATCAG GTGGCGAACA TCATCGCCGA CCCGCGCGTG
CAGGGCGCGG CGCTGACCGG CTCTGAAAAA GCGGGGAGCG CCGTGGCGGC ACAGGCGGCG
AAGCACATTA AAAAATCGAC GCTGGAACTG GGCGGGAACG ATGTGTTCGT CGTGCTGGAC
GATGCCGATC TTGAGAAAGC GGTGAAAATT GGCGTGCAGG CACGGCTCAC TAATGCGGGG
CAGGTATGTA CGGCGGCGAA GCGCTTTATC CTGCATGAGA AAATCGCCGA TCAATTCCTC
AGCCAGTTCA CCGAGGCGTT CAGGAAGGTG AAGGTGGGGG ATCAGATGGA CGCTTCTACC
GAACTGGGGC CGCTGTCGTC GAAAGATGCT CTGGATACAC TGACCAGACA GGTCGAGGAA
GCGGTGAAAA ATGGCGCGAC GCTGCACGTT GGCGGCACGC CGCTGGAAAG CAAAGGCAAC
TTCTTTGAGC CGACCATTCT GACCAATATT ACGCGTGACA ACCCGGCGTA CTTTGAAGAG
TTCTTCGGCC CGGTGGCGCA GATGTATGTG GTGAAAGACG ATGATGAGGC GGTAAAACTC
GCCAACGATT CCCACTACGG CCTGGGCGGC GCAGTGTTTA GTCAGAATAT CGAACGCGCT
AAACGCATGG CCTCCCGGAT TGAAACCGGG ATGGTTTATA TCAACTGGCT GACCGACACC
GCAGCGGAGC TGCCGTTCGG CGGCGTTAAG CGTTCGGGCT TCGGACGCGA GCTATCGGAT
CTGGGGATTA AGGAGTTTGT GAACCAGAAG CTGGTAGTGG TGCGCCGCTA A
 
Protein sequence
MAYQTVNPAN NQLIKEYPPH TGADIETALQ KADALYHSDW SKGEIDQRLP VLHKLADLID 
SRVEELAKIA SQEMGKLIEQ SRGEVKLCAQ IARYYADNAK QFLAPVPYKT EFGDAWVEHH
PIGVIMAVEP WNFPYYQLMR VLAPNLAAGN PVLAKHASIV PHCAETFAHL VREAGAPEGA
WTNLFISSDQ VANIIADPRV QGAALTGSEK AGSAVAAQAA KHIKKSTLEL GGNDVFVVLD
DADLEKAVKI GVQARLTNAG QVCTAAKRFI LHEKIADQFL SQFTEAFRKV KVGDQMDAST
ELGPLSSKDA LDTLTRQVEE AVKNGATLHV GGTPLESKGN FFEPTILTNI TRDNPAYFEE
FFGPVAQMYV VKDDDEAVKL ANDSHYGLGG AVFSQNIERA KRMASRIETG MVYINWLTDT
AAELPFGGVK RSGFGRELSD LGIKEFVNQK LVVVRR