Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B0697 |
Symbol | |
ID | 6793029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | - |
Start bp | 692358 |
End bp | 693353 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642774975 |
Product | sel1 repeat-containing family protein |
Protein accession | YP_002145630 |
Protein GI | 197251446 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.10663 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCATT CCATAACCAG TCATCCCTGC GACAACGTAT CTTTAGCACA ATTAACCGAA CTGGCGCAGT CAGGAAATAG TGAAGCTCAA TATATATTAG GCCGTTTATA TAATGACGAA CGTATAGATG GCAGCGAAGA GGATAAGCTC TCTTTTTATT GGCTACAGCA GGCGGCCGAG CAAGGGCATT GCGAGGCGCA ATATTGGCTC GGCTTACGAT ATTCAGACAC GCCTACCAGC ATGAAAGATA ATGCCAAAGC CTCATACTGG TTGGAAAAAG CAGCAAAGCA AGGGCATAAG CTTGCGCCTA ACGACCTGGG GTGGGTCCTG GAAGGAGAAA CAGGAAGTGA ACCAGATTAC GCTCAGGCAG TATTCTGGTA TCGCGTCGGT ACGGAACGCG GGCACAGCTA TGCGCAAAAT AATCTCGGCA AAATGTATGA AGGAGGTGAC GGTGTTGAGA AGAATCATCA ACTGGCCTTT TATTGGTACA AACAGGCGGC CTTACAAGGT GACGCTACCG CCCAGGAGAA TCTGGCAGAT ATGTATTGGG ACGGTCGCGG CACGACAAAA AACCTACGCC TGGCTACCTT ATGGTATTTG AGAAGTGCGC TACAGGATGA AGTCCATTCC CAATTCCAGC TTGGCTGCGC GTATAGCGAA GGGGAAGGCG TTAAGCAGGA TTATCAGCAG GCAATGCACT GGTATCAACA AGCTGCGGCG CAGGGAGATA GCAATGCTTA CGTTAATATC GGCTGGATGT ACAAACAAGG ACACGGTGTC GAGCGTGACG ATGAAGAAGC ACTTAGCTGG TTTCATCGGG CGGCGGAAGC TGGCAACGTT ACCGCATGGT ATAACCTGGG TTTTATGTAC CGCGACGGGC GCGGTACCGC AGTGGATGTG AAGCAGGCGC TCTACTGGTT CAAAAAAGCA CAGCCCACGG GCAAATGGAA CGTCGACGAA GAGATCCGCA AACTGGAAGC CCAACTGCAC GCTTAA
|
Protein sequence | MNHSITSHPC DNVSLAQLTE LAQSGNSEAQ YILGRLYNDE RIDGSEEDKL SFYWLQQAAE QGHCEAQYWL GLRYSDTPTS MKDNAKASYW LEKAAKQGHK LAPNDLGWVL EGETGSEPDY AQAVFWYRVG TERGHSYAQN NLGKMYEGGD GVEKNHQLAF YWYKQAALQG DATAQENLAD MYWDGRGTTK NLRLATLWYL RSALQDEVHS QFQLGCAYSE GEGVKQDYQQ AMHWYQQAAA QGDSNAYVNI GWMYKQGHGV ERDDEEALSW FHRAAEAGNV TAWYNLGFMY RDGRGTAVDV KQALYWFKKA QPTGKWNVDE EIRKLEAQLH A
|
| |