Gene SeHA_C4926 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4926 
Symbol 
ID6489418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4805804 
End bp4807174 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content57% 
IMG OID642744971 
Productaldehyde dehydrogenase (NAD) family protein 
Protein accessionYP_002048543 
Protein GI194449598 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.515013 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones87 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTACC AGACAGTGAA TCCTGCCAAT AATCAGCTCA TTAAAGAGTA TCCCCCGCAC 
ACGGACGCGG ATATTGAAGC CGCGCTGCAA AAAGCTGACG CGCTCTATCA CTCCGACTGG
TCCAGGGGAG AGATTGACCA ACGTTTGCCG GTACTGCATA AGCTGGCTGA CTTGATCGAC
AGCCGTGTTG AAGAACTGGC AAAAATCGCC AGCCAGGAGA TGGGCAAGCT CATCGAGCAG
AGCCGTGGCG AAGTCAAACT GTGTGCGCAG ATCGCTCGCT ATTATGCGGA TAACGCGAAG
CAGTTTCTTG CCCCGGTGCC TTATAAAACC GAGTTTGGCG ACGCGTGGGT AGAACATCAT
CCGATTGGCG TCATCATGGC TGTTGAGCCG TGGAACTTCC CGTACTATCA GTTGATGCGT
GTGCTGGCGC CGAACCTGGC CGCAGGTAAC CCGGCGCTGG CGAAACATGC CAGCATCGTA
CCGCACTGCG CCGAGACGTT TGCCCATCTG GTGCGTGAAG CCGGCGCGCC GGAAGGCGCA
TGGACCAACC TGTTTATTTC CTCCGATCAG GTGGCGAACA TCATCGCCGA CCCGCGCGTG
CAGGGCGCGG CGCTGACCGG ATCTGAAAAA GCGGGGAGCG CCGTGGCGGC ACAGGCGGCG
AAGCACATTA AAAAATCGAC GCTGGAACTG GGCGGGAACG ATGTGTTCGT CGTGCTGGAC
GATGCCGATC TTGAGAAAGC GGTGAAAATT GGCGTGCAGG CACGACTCAC TAATGCAGGG
CAGGTATGTA CGGCGGCGAA GCGCTTTATC CTGCATGAGA AAATCGCCGA TCAATTCCTC
AGCCAGTTCA CCGAGGCGTT CAGGAAGGTG AAGGTGGGGG ATCAGATGGA TGCTTCTACC
GAACTGGGGC CGCTGTCGTC GAAAGATGCG CTGGATACAT TGACCAGACA GGTCGAGGAA
GCGGTGAAAA ATGGCGCGAC GCTGCACGTT GGCGGCAAGC CGCTGGAAAG CAAAGGCAAC
TTCTTTGAGC CGACCATTCT GACCCACATT ACGCGTGACA ACCCGGCGTA CTTTGAAGAG
TTCTTCGGCC CGGTGGCGCA GATGTATGTG GTGAAAGACG ATGACGAGGC GGTAAAACTC
GCCAACGATT CCCACTACGG CCTGGGCGGC GCGGTGTTTA GTCAGAATAT CGAACGCGCT
AAACGCATGG CCTCCCGGAT TGAAACCGGG ATGGTTTATA TCAACTGGCT GACCGACACC
GCAGCGGAGC TGCCGTTCGG CGGCGTTAAG CGTTCGGGCT TCGGACGCGA GCTATCGGAT
CTGGGGATTA AGGAGTTTGT GAACCAGAAG CTGGTAGTGG TGCGCCGCTA A
 
Protein sequence
MAYQTVNPAN NQLIKEYPPH TDADIEAALQ KADALYHSDW SRGEIDQRLP VLHKLADLID 
SRVEELAKIA SQEMGKLIEQ SRGEVKLCAQ IARYYADNAK QFLAPVPYKT EFGDAWVEHH
PIGVIMAVEP WNFPYYQLMR VLAPNLAAGN PALAKHASIV PHCAETFAHL VREAGAPEGA
WTNLFISSDQ VANIIADPRV QGAALTGSEK AGSAVAAQAA KHIKKSTLEL GGNDVFVVLD
DADLEKAVKI GVQARLTNAG QVCTAAKRFI LHEKIADQFL SQFTEAFRKV KVGDQMDAST
ELGPLSSKDA LDTLTRQVEE AVKNGATLHV GGKPLESKGN FFEPTILTHI TRDNPAYFEE
FFGPVAQMYV VKDDDEAVKL ANDSHYGLGG AVFSQNIERA KRMASRIETG MVYINWLTDT
AAELPFGGVK RSGFGRELSD LGIKEFVNQK LVVVRR