Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A3474 |
Symbol | |
ID | 6874461 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 3336034 |
End bp | 3337518 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642786468 |
Product | phenylacetaldehyde dehydrogenase |
Protein accession | YP_002217105 |
Protein GI | 198244557 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 85 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACG TGACATTACT TGCCGAAGTA ACGACTTTTT TACGCCAACG ACACGGACAA TTTATTGCAG GTGAACGTCA GGCCGGAAAC GGCACGAACT TCTCGGTCAC TAACCCAGCC ACCGGCAAAA TCATCGCCGA CGTTGTGTCG GCAACCCCTG CGCAGGCAGA AGAGGCCATG CAGAGCGCCA GACGGGCGTT TGATGTCTGG CGTAAAATGC CAACGTTACA ACGCGGCGCA TTACTGCTGA AACTGGCTGA TACTCTTGCC GCTCATCGTG AAGAGTTAGC TCAACTGGAA AGCGTCTGTT CAGGTAAAAC GATTATGCTG TCGCGCGGTC TTGAACTCGA TCAGTCAGTG GCCTTCCTGC GTTACTTTGC CGGTTGGGCA GGAAAAATAA CCGGTGAAAC GCTGAATGTC TCCCTGCCAT CAATGGGAGA AGAGAGATAC ACCGCGTTTA CCCAACGCCA ACCCATTGGG GTGGTCGTCG GTATTGTGCC GTGGAATTTC TCCATTATGA TTGCTATCTG GAAGCTGGCC GCAGCGCTGG TATGTGGCTG CACCATCGTC ATTAAACCAA GTGAATATAC CCCGCTGACA CTGCTGCGAG TCGCTGAACT GGCTAAAGAG GCAGGTTTCC CTGATGGCGT AATTAACGTG GTAAACGGTG CTGGCGGTGA GATAGCGCAA CAGCTGATCG CGCATCCAGA TTGCGCCAAA GTGAGTTTCA CCGGGTCAGT CGCGACAGGT GAGAAAGTCC GGCGTTCGGC AACATCGTCA GGAAAACGCG TTACCCTCGA ACTGGGAGGG AAAAATGCGG CGCTGTTTCT CAATGATCTC TCGGCACAAG CCATGGTCAA CGGTATTCTT GAAGCCGGTT ATCTGAATCA AGGACAGATT TGTGCTGCCG CAGAGCGTTT TTATCTGCCC CAGGAAAAAC TGGATACAGT CATGACGCTC CTCAGGCAAC GGTTATCGGA GATCGTGCCC GGGTCGCCTT TAGATGAAAA AACTGTGATG GGCCCGCTGG CGAATCAGGT TCAGCTTGAA AAAGTGCTGC GTCTGATTCA ACGCGCACGG GAAGAAGGGG ATACCATTGT TTACGGCGGT GAAACTTTAC CCGGCGAAGG GTACTTTTTA CAGCCGACGG CGGTAAAAGT GCGTAGTAAA AACAGTACGC TGATGCACGA GGAGACCTTT GGCCCTGTCT GTAGCTTTAT CGGTTATCAG AATGAAAAAG AGGCGCTTTC GCATATCAAC GCTTCGCCAT TCGGCCTTGC TGCAAGTGTG TGGTCGGAAA ATATGTCTAA GGCATTACGC TACGCTGAAG ATATTGATGC TGGCATGGTG TGGGTCAATA TGCATACCTT CCTCGATCCC GCGGTACCCT TTGGAGGGAT GAAAGGATCG GGTAGTGGTC GTGAATTTGG CAGCGCGTTT ATTGATGACT ATACCGAACT TAAATCTGTC ATGGTCCGTT ATTAA
|
Protein sequence | MSDVTLLAEV TTFLRQRHGQ FIAGERQAGN GTNFSVTNPA TGKIIADVVS ATPAQAEEAM QSARRAFDVW RKMPTLQRGA LLLKLADTLA AHREELAQLE SVCSGKTIML SRGLELDQSV AFLRYFAGWA GKITGETLNV SLPSMGEERY TAFTQRQPIG VVVGIVPWNF SIMIAIWKLA AALVCGCTIV IKPSEYTPLT LLRVAELAKE AGFPDGVINV VNGAGGEIAQ QLIAHPDCAK VSFTGSVATG EKVRRSATSS GKRVTLELGG KNAALFLNDL SAQAMVNGIL EAGYLNQGQI CAAAERFYLP QEKLDTVMTL LRQRLSEIVP GSPLDEKTVM GPLANQVQLE KVLRLIQRAR EEGDTIVYGG ETLPGEGYFL QPTAVKVRSK NSTLMHEETF GPVCSFIGYQ NEKEALSHIN ASPFGLAASV WSENMSKALR YAEDIDAGMV WVNMHTFLDP AVPFGGMKGS GSGREFGSAF IDDYTELKSV MVRY
|
| |