Gene SeAg_B4853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B4853 
Symbol 
ID6793847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp4730628 
End bp4731998 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content57% 
IMG OID642778917 
Productaldehyde dehydrogenase (NAD) family protein 
Protein accessionYP_002149478 
Protein GI197251125 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTACC AGACAGTGAA TCCTGCCAAT AATCAGCTCA TTAAAGAGTA TCCCCCGCAT 
ACGGACGCGG ATATTGAAGC CGCGCTGCAA AAAGCTGACG CGCTCTATCA CTCCGACTGG
TCCAGGGGAG AGATTGACCA ACGTCTGCCG GTACTGCATA AGCTGGCTGA CTTGATCGAC
AGCCGTGTTG AAGAACTGGC AAAAATCGCC AGCCAGGAGA TGGGCAAGCT CATCGAGCAG
AGCCGTGGCG AAGTCAAACT GTGTGCGCAG ATCGCTCGCT ATTATGCGGA TAACGCGAAG
CAGTTTCTTG CCCCGGTGCC TTATAAAACC GAGTTTGGCG ACGCGTGGGT AGAACATCAT
CCGATTGGCG TCATCATGGC CGTTGAGCCG TGGAACTTCC CGTACTATCA GTTGATGCGT
GTGCTGGCGC CGAACCTGGC CGCTGGTAAC CCGGTGCTGG CGAAACATGC CAGCATCGTA
CCGCACTGCG CCGAGACGTT TGCCCATCTG GTGCGTGAAG CCGGCGCGCC GGAAGGCGCA
TGGACCAACC TGTTTATTTC CTCCGATCAG GTGGCGAATA TCATCGCCGA CCCGCGCGTG
CAGGGCGCGG CGCTGACCGG ATCTGAAAAA GCGGGGAGCG CCGTGGCGGC ACAGGCGGCG
AAGCACATTA AAAAATCGAC GCTGGAACTG GGCGGGAACG ATGTGTTCGT CGTGCTGGAC
GATGCCGATC TTGAGAAAGC GGTGAAAATT GGCGTGCAGG CACGGCTCAC TAATGCAGGA
CAGGTATGTA CGGCGGCGAA GCGCTTTATC CTGCATGAGA AAATCGCCGA TCAATTCCTC
AGCCAGTTCA CCGAGGCGTT CAGGAAGGTG AAGGTGGGGG ATCAGATGGA TGCTTCTACC
GAACTGGGGC CGCTGTCGTC GAAAGATGCG CTGGATACAT TGACCAGACA GGTCGAGGAA
GCGGTGAAAA ATGGCGCGAC GCTGCACGTT GGCGGCAAGC CGCTGGAAAG CAAAGGCAAC
TTCTTTGAGC CGACGATTCT GACCAATATT ACGCGTGACA ACCCGGCGTA CTTTGAAGAG
TTCTTCGGCC CGGTGGCGCA GATGTATGTG GTGAAAGACG ATGACGAGGC GGTAAAACTC
GCCAACGATT CCCACTACGG CCTGGGCGGC GCGGTGTTTA GTCAGGATAT TGAGCGTGCT
AAACGCATGG CCTCCCGGAT TGAAACCGGG ATGGTTTATA TCAACTGGCT GACCGACACC
GCAGCGGAGC TGCCTTTCGG CGGCGTTAAG CGTTCGGGCT TCGGACGCGA GCTATCGGAT
CTGGGGATTA AGGAGTTTGT GAACCAGAAG CTGGTAGTGG TGCGCCGCTA A
 
Protein sequence
MAYQTVNPAN NQLIKEYPPH TDADIEAALQ KADALYHSDW SRGEIDQRLP VLHKLADLID 
SRVEELAKIA SQEMGKLIEQ SRGEVKLCAQ IARYYADNAK QFLAPVPYKT EFGDAWVEHH
PIGVIMAVEP WNFPYYQLMR VLAPNLAAGN PVLAKHASIV PHCAETFAHL VREAGAPEGA
WTNLFISSDQ VANIIADPRV QGAALTGSEK AGSAVAAQAA KHIKKSTLEL GGNDVFVVLD
DADLEKAVKI GVQARLTNAG QVCTAAKRFI LHEKIADQFL SQFTEAFRKV KVGDQMDAST
ELGPLSSKDA LDTLTRQVEE AVKNGATLHV GGKPLESKGN FFEPTILTNI TRDNPAYFEE
FFGPVAQMYV VKDDDEAVKL ANDSHYGLGG AVFSQDIERA KRMASRIETG MVYINWLTDT
AAELPFGGVK RSGFGRELSD LGIKEFVNQK LVVVRR