Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4834 |
Symbol | |
ID | 6484260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 4708304 |
End bp | 4709323 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642740047 |
Product | hypothetical protein |
Protein accession | YP_002043725 |
Protein GI | 194445237 |
COG category | [R] General function prediction only |
COG ID | [COG1064] Zn-dependent alcohol dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.809336 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATAA TAAAAAGCTA TGCCGCCAAA GAGGCGGGCG GCGAACTCGA ACTCTATGAA TATGACGCGG GAGAACTCCA ACCGGAAGAT GTCGAGGTAC GGGTCGACTA CTGCGGGATC TGCCATTCCG ATCTGTCAAT GATCGACAAT GAATGGGGGT TCTCTCAATA CCCTCTGGTT GCCGGACATG AGGTCATCGG TCGGTTGGCC GCACTCGGTA GCGCGGCACA GGATAAGGGA CTAAAAGTCG GCCAGCGCGT TGGAATCGGC TGGACGGCGC GCAGCTGCGG ACACTGCGAT GCCTGTATCA GCGGCAATCA AATTAACTGC CTGGAAGGGG CAGTGCCCAC TATCCTCAAT CGTGGCGGTT TTGCCGAGAA GCTTCGCGCA GGCTGGCAGT GGGTAATTCC TCTTCCGGAG AATATGGATA TGGCGTCCGC AGGCCCGCTG TTATGTGGCG GCATTACGGT CTTTAAACCG CTACTGATGC ACCATATTAC TGCTACCAGC CGCGTTGGCG TCATCGGTAT CGGCGGGCTG GGGCATATCG CCATAAAGCT GTTACACGCA ATGGGCTGCG AAGTCACCGC GTTCAGCTCC AATCCATCGA AGGAGCAGGA GGTGCTGGCG ATGGGGGCCA ATAACGTGGT GAACAGCCGC GATCCGGAAG CGTTAAAAGC ACTGGCGGGC CAGTTCGATC TCATTATTAA CACGGTCAAC GTCGATCTCG ACTGGCAGCC CTACTTCGAA GCGCTGACCT ATGGCGGCAA CTTCCATACC GTTGGGGCCG TATTGAAGCC GCTGCCCGTA CCGGCGTTTA CATTGATTGC CGGCGATCGC AGTATCTCAG GCTCGGCAAC CGGAACGCCA TATGAACTTC GCAAACTGAT GAAATTCGCC GGACGCAGCA AAGTCGCGCC CACCACGGAA CTGTTCGCAA TGTCACAAAT CAACGAGGCT ATTCAGCACG TTCGCGACGG CAAAGCCCGC TATCGTGTAG TGCTAAAAGC TGACTTCTGA
|
Protein sequence | MTIIKSYAAK EAGGELELYE YDAGELQPED VEVRVDYCGI CHSDLSMIDN EWGFSQYPLV AGHEVIGRLA ALGSAAQDKG LKVGQRVGIG WTARSCGHCD ACISGNQINC LEGAVPTILN RGGFAEKLRA GWQWVIPLPE NMDMASAGPL LCGGITVFKP LLMHHITATS RVGVIGIGGL GHIAIKLLHA MGCEVTAFSS NPSKEQEVLA MGANNVVNSR DPEALKALAG QFDLIINTVN VDLDWQPYFE ALTYGGNFHT VGAVLKPLPV PAFTLIAGDR SISGSATGTP YELRKLMKFA GRSKVAPTTE LFAMSQINEA IQHVRDGKAR YRVVLKADF
|
| |