Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B0155 |
Symbol | |
ID | 6794044 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | - |
Start bp | 163551 |
End bp | 164540 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642774459 |
Product | aldo-keto reductase yakc (NADP+) |
Protein accession | YP_002145123 |
Protein GI | 197247778 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000000886712 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATATC GTACATTAGG CGCAAACGGA CCGCGAGTGT CAGCCATCGG ACTGGGATGT ATGGGCATGA GCGCATTTTA CGGCGCTCAT GACGACAGCA CCTCAATTAA GACGCTACAT TATGCGTTAG ATCAGGGGGT AACACTGCTC GATACCGCAG ATATGTATGG CCCTTATACC AATGAAAGGT TAGTTGGAAG AGCCATCGCC GATCGTCGCG ATCGGGTATT TTTAGCGACG AAATTTGGTA TCGTTCTCGA CCCTGCTAAC CCTATGGCGC GTGGCGTCAA TGGCAGACCG GAGTACGTTC GCCGTAGTTG TGAGCAAAGC CTGCAACGCC TGGGGGTCGA TCATATCGAT CTGTACTACC AACATCGCGT TGATCCATCA GTCCCCATAG AAGAGACTGT CGGTGCAATG GCGGACCTGG TGCGCGAGGG AAAAGTGCGT TATCTCGGGC TATCCGAAGC ATCAACGCAA ACGCTGGAAC GCGCCCATAA CGTTCACCCT ATTACCGCGC TGCAAAGTGA GTATTCGCTT TGGTCCCGCG AAGCGGAAAT TTCAGCACTT TCCACCTGTG AACGGTTGGG TATAGGATTC GTCGCTTACA GCCCGCTGGG ACGCGGATTT CTGACCGGTA CGATTAAAAC GCCAGAAGAT TTTGCTGCGA ATGACTTCCG TCGCACAAAT CCCAGGTTCA TGGGTGAGAA CTTCTCGCGC AATTTACGTC TGGCTGAAGC AATAAAACAA ATGGCACGCG AAAAAGAGTG TACCCCCGCA CAATTAGCGC TGGCCTGGCT GCTGGCCCGC AACAGGCACA TCGTTCCCAT TCCCGGCACC CGCCACTGCG CCAGAGTGGA TGAAAACCTC GGCGCGTTAT CACTGATCCT CAGCCCGCAG GAGCTGGCGG CAATTGAGGC GGTTTTTCCT CACGACGCCG CCGCCGGCCC CCGCTACTGG CCGGAAATTA TGTCGACATT AAATCGCTAA
|
Protein sequence | MQYRTLGANG PRVSAIGLGC MGMSAFYGAH DDSTSIKTLH YALDQGVTLL DTADMYGPYT NERLVGRAIA DRRDRVFLAT KFGIVLDPAN PMARGVNGRP EYVRRSCEQS LQRLGVDHID LYYQHRVDPS VPIEETVGAM ADLVREGKVR YLGLSEASTQ TLERAHNVHP ITALQSEYSL WSREAEISAL STCERLGIGF VAYSPLGRGF LTGTIKTPED FAANDFRRTN PRFMGENFSR NLRLAEAIKQ MAREKECTPA QLALAWLLAR NRHIVPIPGT RHCARVDENL GALSLILSPQ ELAAIEAVFP HDAAAGPRYW PEIMSTLNR
|
| |