Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_pSN254_0013 |
Symbol | |
ID | 4929517 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_009140 |
Strand | + |
Start bp | 8485 |
End bp | 9462 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642572317 |
Product | signal peptide peptidase SppA domain-containing protein |
Protein accession | YP_001101892 |
Protein GI | 134047167 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.81268 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.0000208532 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAAAAGG AAAAACTAAT GGCAGAAAAC AAAGAGATCC CCTGGGAGAA AGAACTCATT GAGAAGTACA TGTTCACCCT GCATAAAGAG CAGGTGAAAG ACCGGCGCTG GCGGACCATG TTGCGTGTGC TTCGCGCATC TGGTTTCGTG CTTCTCATGA TCGGCTTCAT CATTTTGGCA TCTAATCCGG GTGGGATGCC GTGGCAAAGC GCCAAGGCAG GAGCCCCTCA CACCGCGTAT ATCAACATCC GTGGTGAAAT TGCTGCTGGC ACTCTGGCCG ATGCTGATCA CCTTATCCCG TCCATCCAAG CTGCATTCGA CAACCCGAAC TCACAAGCTA TCGTGCTTCG CATAAACAGC CCTGGCGGTA GCCCGGTTCA AGCAGGACGG ATTTATGACG AAGTGAAGGC GCAGCGAGCC CTTCATCCGG AGAAAAAGGT CTACGCCATC ATTGATGACA TCGGTGCCTC TGGCGGTTAC TACATCGCCT CTGCTGCGGA TGAAATCTAT GCTGACCGCG CCAGCCTTGT CGGTTCTATC GGCGTCATCA GCTCGGGGTT TGGATTCACC GGCTTGATGG ACAAGCTCGG CATCGAGCGC CGGGCTATCA CTTCCGGAGA GCACAAAGCG CTTCTCGACC CATTCTCCCC TCTTACCTCT GACATGAAGA AATTCTGGGA GGGCGTTCTA TCGAAAACCC ACCAGCAGTT CATCGAACGA GTGAAGGCTG GGCGGGGTGA TCGACTGAAA GACGACCCAG AGGTGTTTTC TGGATTGCTC TGGAACGGGG AGCAGGCCAA AGACATTGGG CTGATTGATG GCCTGGGTAG TTTGAACTCC GTGGCGCGAG ACGTCATCCA CCAGAGCAAC TTGGTGGACT ACACACCAAC CGAAGACATC ATCCGGCGAC TGACCCAACG AGCGAAGCTC GAAGCCAGTT CCTTCGTGCA AGAACTCAGC GCTGTGAAAG TTTACTGA
|
Protein sequence | MQKEKLMAEN KEIPWEKELI EKYMFTLHKE QVKDRRWRTM LRVLRASGFV LLMIGFIILA SNPGGMPWQS AKAGAPHTAY INIRGEIAAG TLADADHLIP SIQAAFDNPN SQAIVLRINS PGGSPVQAGR IYDEVKAQRA LHPEKKVYAI IDDIGASGGY YIASAADEIY ADRASLVGSI GVISSGFGFT GLMDKLGIER RAITSGEHKA LLDPFSPLTS DMKKFWEGVL SKTHQQFIER VKAGRGDRLK DDPEVFSGLL WNGEQAKDIG LIDGLGSLNS VARDVIHQSN LVDYTPTEDI IRRLTQRAKL EASSFVQELS AVKVY
|
| |