Gene SeAg_B0697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B0697 
Symbol 
ID6793029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp692358 
End bp693353 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content51% 
IMG OID642774975 
Productsel1 repeat-containing family protein 
Protein accessionYP_002145630 
Protein GI197251446 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.10663 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCATT CCATAACCAG TCATCCCTGC GACAACGTAT CTTTAGCACA ATTAACCGAA 
CTGGCGCAGT CAGGAAATAG TGAAGCTCAA TATATATTAG GCCGTTTATA TAATGACGAA
CGTATAGATG GCAGCGAAGA GGATAAGCTC TCTTTTTATT GGCTACAGCA GGCGGCCGAG
CAAGGGCATT GCGAGGCGCA ATATTGGCTC GGCTTACGAT ATTCAGACAC GCCTACCAGC
ATGAAAGATA ATGCCAAAGC CTCATACTGG TTGGAAAAAG CAGCAAAGCA AGGGCATAAG
CTTGCGCCTA ACGACCTGGG GTGGGTCCTG GAAGGAGAAA CAGGAAGTGA ACCAGATTAC
GCTCAGGCAG TATTCTGGTA TCGCGTCGGT ACGGAACGCG GGCACAGCTA TGCGCAAAAT
AATCTCGGCA AAATGTATGA AGGAGGTGAC GGTGTTGAGA AGAATCATCA ACTGGCCTTT
TATTGGTACA AACAGGCGGC CTTACAAGGT GACGCTACCG CCCAGGAGAA TCTGGCAGAT
ATGTATTGGG ACGGTCGCGG CACGACAAAA AACCTACGCC TGGCTACCTT ATGGTATTTG
AGAAGTGCGC TACAGGATGA AGTCCATTCC CAATTCCAGC TTGGCTGCGC GTATAGCGAA
GGGGAAGGCG TTAAGCAGGA TTATCAGCAG GCAATGCACT GGTATCAACA AGCTGCGGCG
CAGGGAGATA GCAATGCTTA CGTTAATATC GGCTGGATGT ACAAACAAGG ACACGGTGTC
GAGCGTGACG ATGAAGAAGC ACTTAGCTGG TTTCATCGGG CGGCGGAAGC TGGCAACGTT
ACCGCATGGT ATAACCTGGG TTTTATGTAC CGCGACGGGC GCGGTACCGC AGTGGATGTG
AAGCAGGCGC TCTACTGGTT CAAAAAAGCA CAGCCCACGG GCAAATGGAA CGTCGACGAA
GAGATCCGCA AACTGGAAGC CCAACTGCAC GCTTAA
 
Protein sequence
MNHSITSHPC DNVSLAQLTE LAQSGNSEAQ YILGRLYNDE RIDGSEEDKL SFYWLQQAAE 
QGHCEAQYWL GLRYSDTPTS MKDNAKASYW LEKAAKQGHK LAPNDLGWVL EGETGSEPDY
AQAVFWYRVG TERGHSYAQN NLGKMYEGGD GVEKNHQLAF YWYKQAALQG DATAQENLAD
MYWDGRGTTK NLRLATLWYL RSALQDEVHS QFQLGCAYSE GEGVKQDYQQ AMHWYQQAAA
QGDSNAYVNI GWMYKQGHGV ERDDEEALSW FHRAAEAGNV TAWYNLGFMY RDGRGTAVDV
KQALYWFKKA QPTGKWNVDE EIRKLEAQLH A