Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3379 |
Symbol | |
ID | 6486239 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 3278307 |
End bp | 3279791 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642738670 |
Product | phenylacetaldehyde dehydrogenase |
Protein accession | YP_002042390 |
Protein GI | 194442770 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACG TGACATTACT TGCCGAAGTA ACGACTTTTT TACGCCAACG ACACGGACAA TTTATTGCAG GTGAACGTCA GGCCGGAAAC GGCACGAACT TCTCGGTCAC TAACCCAGCC ACCGGCAAAA TCATCGCCGA CGTTGTGTCG GCAACCCCTG CGCAGGCAGA AGAGGCCATG CAGAGCGCCA GACGGGCGTT TGATGTCTGG CGTAAAATGC CAACGTTACA ACGCGGCACA TTACTGCTGA AACTGGCTGA TATTCTTGCC GCTCATCGTG AAGAGTTAGC TCAACTGGAA AGCGTCTGTT CAGGTAAAAC GATTACGCTG TCGCGCGGTC TTGAACTCGA TCAGTCAGTG GCCTTCCTGC GTTACTTTGC CGGTTGGGCA GGAAAAATAA CCGGTGAAAC GCTGAATGTC TCCCTGCCAT CAATGGGAGA AGAGAGATAC ACCGCGTTTA CCCAACGCCA ACCCATTGGT GTGGTCGTCG GTATTGTGCC GTGGAATTTC TCCATTATGA TTGCTATCTG GAAGCTGGCC GCAGCGCTGG TATGTGGCTG CACCATCGTC ATTAAACCAA GTGAATATAC CCCGCTGACA CTGCTGCGAG TCGCTGAACT GGCGAAAGAG GCAGGTTTCC CTGATGGCGT AATTAACGTG GTAAACGGTG CTGGCGGTGA GATAGCGCAA CAGCTGATCG CGCATCCAGA TTGCGCCAAA GTGAGTTTCA CCGGGTCAGT CGCGACAGGT GAGAAAGTCC GGCGTTCGGC AACATCGTCA GGAAAACGCG TTACCCTCGA ACTGGGAGGG AAAAATGCGG CGCTGTTTCT CAATGATCTC ACGGCACAAG CCATGGTCAA CGGTATTCTT GAAGCCGGTT ATCTGAATCA AGGACAGATT TGTGCTGCCG CAGAGCGTTT TTATCTGCCC CAGGAAAAAC TGGATACGGT CATGACGCTC CTCAGACAAC GGTTATCGGA GATCGTGCCC GGGTCGCCTT TAGATGAAAA AACTGTGATG GGCCCGCTGG CGAATCAGGT TCAGCTTGAA AAAGTGCTGC GTCTGATTCA ACGCGCACGG GAAGAAGGGG ATACCATTGT TTACGGCGGT GAAACTTTAC CCGGCGAAGG GTACTTTTTA CAGCCGACGG CGGTAAAAGT GCGTAGTAAA AACAGTACGC TGATGCACGA GGAGACCTTT GGCCCTGTCT GTAGCTTTAT CGGTTATCAG AATGAAAAAG AGGCGCTTTC GCATATCAAC GCTTCGCCAT TCGGCCTTGC TGCAAGTGTG TGGTCGGAAA ATATGTCTAA GGCATTACGC TACGCTGAAG ATATTGATGC TGGCATGGTG TGGGTCAATA TGCATACCTT CCTCGATCCC GCGGTACCCT TTGGAGGGAT GAAAGGATCG GGTAGTGGTC GTGAATTTGG CAGCGCGTTT ATTGATGACT ATACCGAACT TAAATCTGTC ATGGTCCGTT ATTAA
|
Protein sequence | MSDVTLLAEV TTFLRQRHGQ FIAGERQAGN GTNFSVTNPA TGKIIADVVS ATPAQAEEAM QSARRAFDVW RKMPTLQRGT LLLKLADILA AHREELAQLE SVCSGKTITL SRGLELDQSV AFLRYFAGWA GKITGETLNV SLPSMGEERY TAFTQRQPIG VVVGIVPWNF SIMIAIWKLA AALVCGCTIV IKPSEYTPLT LLRVAELAKE AGFPDGVINV VNGAGGEIAQ QLIAHPDCAK VSFTGSVATG EKVRRSATSS GKRVTLELGG KNAALFLNDL TAQAMVNGIL EAGYLNQGQI CAAAERFYLP QEKLDTVMTL LRQRLSEIVP GSPLDEKTVM GPLANQVQLE KVLRLIQRAR EEGDTIVYGG ETLPGEGYFL QPTAVKVRSK NSTLMHEETF GPVCSFIGYQ NEKEALSHIN ASPFGLAASV WSENMSKALR YAEDIDAGMV WVNMHTFLDP AVPFGGMKGS GSGREFGSAF IDDYTELKSV MVRY
|
| |