Gene SeHA_C3372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3372 
Symbol 
ID6490487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3281339 
End bp3282823 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content51% 
IMG OID642743505 
Productphenylacetaldehyde dehydrogenase 
Protein accessionYP_002047120 
Protein GI194451378 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG TGACATTACT TGCCGAAGTA ACGACTTTTT TACGCCAACG ACACGGACAA 
TTTATTGCAG GTGAACGTCA GGCCGGAAAC GGCACGAACT TCTCGGTCAC TAACCCAGCC
ACCGGCAAAA TCATCGCCGA CGTTGTGTCG GCAACCCCTG CGCAGGCAGA AGAGGCCATG
CAGAGCGCCA GACGGGCGTT TGATGTCTGG CGTAAAATGC CAACGTTACA ACGCGGCGCA
TTACTGCTGA AACTGGCTGA TACTCTTGCC GCTCATCGTG AAGAGTTAGC TCAACTGGAA
AGCGTCTGTT CAGGTAAAAC GATTATGCTG TCGCGCGGTC TTGAACTCGA TCAGTCAGTG
GCCTTCCTGC GTTACTTTGC CGGTTGGGCA GGAAAAATAA CCGGTGAAAC GCTGAATGTC
TCCCTGCCAT CAATGGGAGA AGAGAGATAC ACAGCGTTTA CCCAACGCCA ACCCATTGGC
GTGGTCGTCG GTATTGTGCC GTGGAATTTC TCAATTATGA TTGCTATCTG GAAACTGGCC
GCAGCGCTGG TATGTGGCTG CACCATCGTC ATTAAACCAA GTGAATATAC CCCGCTGACA
CTGCTGCGAG TCGCTGAGCT GGCTAAAGAG GCAGGTTTCC CTGATGGCGT AATTAACGTG
GTAAACGGTG CTGGCGGTGA GATAGCGCAA CAGCTGATCG CGCATCCAGA TTGCGCCAAA
GTGAGTTTCA CCGGGTCAGT CGCGACAGGT GAGAAAGTCC GGCGTTCGGC AACATCGTCA
GGAAAACGCG TTACCCTCGA ACTAGGAGGG AAAAATGCGG CGCTGTTTCT CAATGATCTC
ACGGCACAAG CCATGGTCAA CGGTATTCTT GAAGCCGGTT ATCTGAATCA AGGGCAAATT
TGTGCTGCCG CAGAGCGTTT TTATCTGCCC CAGGAAAAAC TGGATACGGT CATGACGCTC
CTCAGACAAC GGTTATCGGA GATCGTGCCC GGCTCGCCTT TAGATGAAAA AACGGTGATG
GGCCCGCTGG CGAATCAGGT TCAGCTTGAA AAAGTGCTGC GTCTGATTCA ACGTGCACGG
GAAGAAGGGG ATACCATTGT TTATGGCGGT GAAACTTTAC CCGGCGAAGG GTACTTTTTA
CAGCCGACAG CGGTAAAAGT GCGTAGTAAA AACAGTACGC TGATGCACGA GGAGACCTTT
GGCCCTGTCT GTAGCTTTAT CGGTTATCAG AATGAAAAAG AGGCGCTTTC GCATATCAAC
GATTCGCCAT TCGGCCTTGC TGCAAGTGTG TGGTCGGAAA ATATATCTAA GGCATTACGC
TACGCTGAAG ATATTGATGC TGGCATGGTG TGGGTCAATA TGCATACCTT CCTCGATCCC
GCGGTACCCT TTGGAGGGAT GAAAGGATCG GGCATAGGTC GTGAATTTGG CAGCGCGTTT
ATTGATGACT ATACCGAACT TAAATCTGTC ATGGTCCGTT ATTAA
 
Protein sequence
MSDVTLLAEV TTFLRQRHGQ FIAGERQAGN GTNFSVTNPA TGKIIADVVS ATPAQAEEAM 
QSARRAFDVW RKMPTLQRGA LLLKLADTLA AHREELAQLE SVCSGKTIML SRGLELDQSV
AFLRYFAGWA GKITGETLNV SLPSMGEERY TAFTQRQPIG VVVGIVPWNF SIMIAIWKLA
AALVCGCTIV IKPSEYTPLT LLRVAELAKE AGFPDGVINV VNGAGGEIAQ QLIAHPDCAK
VSFTGSVATG EKVRRSATSS GKRVTLELGG KNAALFLNDL TAQAMVNGIL EAGYLNQGQI
CAAAERFYLP QEKLDTVMTL LRQRLSEIVP GSPLDEKTVM GPLANQVQLE KVLRLIQRAR
EEGDTIVYGG ETLPGEGYFL QPTAVKVRSK NSTLMHEETF GPVCSFIGYQ NEKEALSHIN
DSPFGLAASV WSENISKALR YAEDIDAGMV WVNMHTFLDP AVPFGGMKGS GIGREFGSAF
IDDYTELKSV MVRY