Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A0711 |
Symbol | |
ID | 6484764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 721475 |
End bp | 722470 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 642736124 |
Product | Sel1 repeat family |
Protein accession | YP_002039897 |
Protein GI | 194445235 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.556165 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 0.833851 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCATT CCATAACCAG TCATCCCTGC GACAACGTAT CTTTAGCACA ATTAACCGAA CTGGCGCAGT CAGGAAATAG TGAAGCTCAA TATATATTAG GCCGTTTATA TAATGACGAA CGTATAGATG GCAGCGAAGA GGATAAGCTC TCTTTTTATT GGTTGCAGCA GGCAGCTGAA CAAGGACACT GCGAGGCACA ATATTGGCTC GGCTTACGAT ACAAAGACAC GCCTACCGAC ATGAAAGATA ATACGCTGGC TTTATTCTGG TCAGAAAAAG CCGCACAGCA AGGACACCGC CACGCTTTCA ACACATTAGG CTGGGTTCAG GAAGGAGAAA CCGGGATGGC GCCAGATTAT GCCCAGGCCG TTGCCTGGTA TCGCAAAGGA GCAGAACAGA GCCACAACCT TGCGCAATAT AATCTCGGGA GAATGTATCA TTCAGGAACT GGCGTAGAGC AAAATGATAC ACAGGCACTC TACTGGTTTA AACAGGCAGC ATTACAAGGC CATTGCGCCA GCCAGGAAAG ACTGGCGTAT ATGTATGGCA ATGGGAAAGG ATGTCGGAAA AATTTATCCC TTGCCGCCCT TTGGTACAAG AAGAGCGCGC TACAAGAGAG CAGCTACTCG CAATATCAGA TGGGCTATTG TTATTACATC GGAAAAGGTA TCAAACAAGA TTACCAGCAG GCAATATACT GGTTTCGAAA AGCGGCAGAC CAGGGTGACA ATGATGCCTA TAACAGTATC GGCTGGATGT ACAAATGCGG TCATGGCGTC GAGCAAAATT ATTCGCTAGC ACTGGAATGG TTCCACAAAT CAGCAGAATG TAACAATTCA TCGGGCTGGT ACAACCTCGG ATGCATGTAC AGAGATGGAT ACGGTACCGC ACAAGACCTA CAACAAGCGC TCTACTGGTT CAAAAAAGCA CAGCCCACGG GCAAATGGAA CGTCGACGAA GAGATCCGCA AACTGGAAGC CCAACTGCAC GCTTAA
|
Protein sequence | MNHSITSHPC DNVSLAQLTE LAQSGNSEAQ YILGRLYNDE RIDGSEEDKL SFYWLQQAAE QGHCEAQYWL GLRYKDTPTD MKDNTLALFW SEKAAQQGHR HAFNTLGWVQ EGETGMAPDY AQAVAWYRKG AEQSHNLAQY NLGRMYHSGT GVEQNDTQAL YWFKQAALQG HCASQERLAY MYGNGKGCRK NLSLAALWYK KSALQESSYS QYQMGYCYYI GKGIKQDYQQ AIYWFRKAAD QGDNDAYNSI GWMYKCGHGV EQNYSLALEW FHKSAECNNS SGWYNLGCMY RDGYGTAQDL QQALYWFKKA QPTGKWNVDE EIRKLEAQLH A
|
| |