Gene SNSL254_A4876 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4876 
Symbol 
ID6485211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4744294 
End bp4745664 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content57% 
IMG OID642740087 
Productaldehyde dehydrogenase (NAD) family protein 
Protein accessionYP_002043764 
Protein GI194446590 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value0.570732 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTACC AGACAGTGAA TCCTGCCAAT AATCAGCTCA TTAAAGAGTA CCCCCCGCAC 
ACGGACGCGG ATATTGAAGC CGCGCTGCAA AAAGCTGACG CGCTCTATCA CTCCGACTGG
TCCAAAGGAG AGATTGACCA ACGTCTGCCG GTACTGCATA AGCTGGCTGA CTTGATCGAC
AGCCGTGTTG AAGAACTGGC AAAAATCGCC AGCCAGGAGA TGGGCAAGCT CATCGAGCAG
AGCCGTGGCG AAGTCAAACT GTGTGCGCAG ATCGCTCGCT ATTATGCGGA TAACGCGAAG
CAGTTTCTTG CCCCGGTGCC TTATAAAACC GAGTTTGGCG ACGCGTGGGT AGAACATCAT
CCGATTGGCG TCATCATGGC CGTTGAGCCG TGGAACTTCC CGTACTATCA GTTGATGCGT
GTGCTGGCGC CGAACTTGGC CGCTGGTAAC CCGGTGCTGG CGAAACATGC CAGCATCGTA
CCGCACTGCG CCGAGACGTT TGCCCATCTG GTGCGTGAAG CCGGCGCGCC GGAAGGCGCA
TGGACCAACC TGTTTATTTC CTCCGATCAG GTGGCGAACA TCATCGCCGA CCCGCGCGTG
CAGGGTGCGG CGCTGACCGG ATCTGAAAAA GCGGGGAGCG CCGTGGCGGC ACAGGCGGCG
AAGCACATTA AAAAATCGAC GCTGGAACTG GGCGGGAACG ATGTGTTCGT CGTGCTGGAC
GATGCCGATC TTGAGAAAGC GGTGAAAATT GGCGTGCAGG CACGGCTCAC TAATGCAGGA
CAGGTATGTA CGGCGGCGAA GCGCTTTATC CTGCATGAGA AAATCGCCGA TCAATTCCTC
AGCCAGTTCA CCGAGGCGTT CAGGAAGGTG AAGGTGGGGG ATCAGATGGA CGCTTCTACC
GAACTGGGGC CGCTGTCGTC GAAAGATGCT CTGGATACAC TGACCAGACA GGTCGAGGAA
GCGGTGAAAA ATGGCGCGAC GCTGCACGTT GGCGGCACGC CGCTGGAAAG CAAAGGCAAC
TTCTTTGAGC CGACCATTCT GACCAATATT ACGCGTGACA ACCCGGCGTA CTTTGAAGAG
TTCTTCGGCC CGGTGGCGCA GATGTATGTG GTGAAAGACG ATGATGAGGC GGTAAAACTC
GCCAACGATT CCCACTACGG CCTGGGCGGC GCGGTGTTTA GTCAGGATAT TGAGCGTGCT
AAACGCATGG CCTCCCGGAT TGAAACCGGG ATGGTTTATA TCAACTGGCT GACCGACACC
GCAGCGGAGC TGCCTTTCGG CGGCGTTAAG CGTTCGGGCT TCGGACGCGA GCTATCGGAT
CTGGGGATTA AGGAGTTTGT GAACCAGAAG CTGGTAGTGG TGCGCCGCTA A
 
Protein sequence
MAYQTVNPAN NQLIKEYPPH TDADIEAALQ KADALYHSDW SKGEIDQRLP VLHKLADLID 
SRVEELAKIA SQEMGKLIEQ SRGEVKLCAQ IARYYADNAK QFLAPVPYKT EFGDAWVEHH
PIGVIMAVEP WNFPYYQLMR VLAPNLAAGN PVLAKHASIV PHCAETFAHL VREAGAPEGA
WTNLFISSDQ VANIIADPRV QGAALTGSEK AGSAVAAQAA KHIKKSTLEL GGNDVFVVLD
DADLEKAVKI GVQARLTNAG QVCTAAKRFI LHEKIADQFL SQFTEAFRKV KVGDQMDAST
ELGPLSSKDA LDTLTRQVEE AVKNGATLHV GGTPLESKGN FFEPTILTNI TRDNPAYFEE
FFGPVAQMYV VKDDDEAVKL ANDSHYGLGG AVFSQDIERA KRMASRIETG MVYINWLTDT
AAELPFGGVK RSGFGRELSD LGIKEFVNQK LVVVRR