Gene SeSA_A4778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A4778 
Symbol 
ID6516136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp4637996 
End bp4639366 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content57% 
IMG OID642749710 
Productsuccinate-semialdehyde dehydrogenase (NADP+) 
Protein accessionYP_002117442 
Protein GI194736366 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTACC AGACAGTGAA TCCTGCCAAT AATCAGCTCA TTAAAGAGTA CCCCCCGCAC 
ACGGACGCGG ATATTGAAGC CGCGCTGCAA AAAGCTGACG CGCTCTATCA CTCCGACTGG
TCCAAAGGAG AGATTGACCA ACGTCTGCCG GTACTGCATA AACTGGCTGA CTTGATCGAC
AGCCGTGTTG AAGAACTGGC AAAAATCGCC AGCCAGGAGA TGGGTAAGCT CATCGAGCAG
AGCCGTGGCG AAGTCAAACT GTGTGCGCAG ATCGCTCGCT ATTATGCGGA TAACGCGAAG
CAGTTTCTTG CCCCGGTGCC TTATAAAACC GAGTTTGGCG ACGCGTGGGT AGAACATCAT
CCGATTGGCG TCATCATGGC CGTTGAGCCG TGGAACTTCC CGTACTATCA GTTGATGCGT
GTGCTGGCGC CGAACCTGGC CGCAGGTAAC CCGGTGCTGG CGAAACATGC CAGCATCGTA
CCACACTGCG CCGAGACATT TGCCCATCTG GTGCGTGAAG CCGGCGCGCC GGACGGCGCA
TGGACCAACC TGTTTATTTC CTCCGATCAG GTGGCGAACA TCATCGCCGA CCCGCGCGTG
CAGGGCGCGG CGCTGACCGG ATCTGAAAAA GCGGGGAGCG CCGTGGCGGC ACAGGCGGCG
AAGCACATTA AAAAATCGAC GCTGGAACTG GGCGGGAACG ATGTGTTCGT CGTGCTGGAC
GATGCCGATC TTGAGAAAGC GGTGAAAATT GGCGTGCAGG CACGACTCAC TAATGCAGGG
CAGGTATGTA CGGCGGCGAA GCGCTTTATC CTACATGAGA AAATCGCCGA TCAATTCCTC
AGCCAGTTCA CCGAGGCGTT CAGGAAGGTG AAGGTGGGGG ATCAGATGGA CGCTTCTACC
GAACTGGGGC CGCTGTCGTC GAAAGATGCG CTGGAAACAT TGACCAGACA GGTCGAGGAA
GCGGTGAAAA ATGGCGCGAC GCTGCACGTT GGCGGCAAGC CGCTGGAAAG CAAAGGCAAC
TTCTTTGAGC CGACGATTCT GACCAATATT ACGCGTGACA ACCCGGCGTA CTTTGAAGAG
TTCTTCGGCC CGGTGGCGCA GATGTATGTG GTGAAAGACG ATGACGAGGC GGTAAAACTC
GCCAACGATT CCCACTACGG GCTGGGCGGC GCGGTGTTTA GTCAGGATAT CGAACGCGCT
AAACGCATGG CCTCCCGGAT TGAAACCGGG ATGGTTTATA TCAACTGGCT GACCGACACC
GCAGCGGAGC TGCCGTTCGG CGGCGTTAAG CGTTCGGGCT TCGGACGCGA GCTATCGGAT
CTGGGGATTA AGGAGTTTGT GAACCAGAAG CTGGTAGTGG TGCGCCGCTA A
 
Protein sequence
MAYQTVNPAN NQLIKEYPPH TDADIEAALQ KADALYHSDW SKGEIDQRLP VLHKLADLID 
SRVEELAKIA SQEMGKLIEQ SRGEVKLCAQ IARYYADNAK QFLAPVPYKT EFGDAWVEHH
PIGVIMAVEP WNFPYYQLMR VLAPNLAAGN PVLAKHASIV PHCAETFAHL VREAGAPDGA
WTNLFISSDQ VANIIADPRV QGAALTGSEK AGSAVAAQAA KHIKKSTLEL GGNDVFVVLD
DADLEKAVKI GVQARLTNAG QVCTAAKRFI LHEKIADQFL SQFTEAFRKV KVGDQMDAST
ELGPLSSKDA LETLTRQVEE AVKNGATLHV GGKPLESKGN FFEPTILTNI TRDNPAYFEE
FFGPVAQMYV VKDDDEAVKL ANDSHYGLGG AVFSQDIERA KRMASRIETG MVYINWLTDT
AAELPFGGVK RSGFGRELSD LGIKEFVNQK LVVVRR