Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2689 |
Symbol | |
ID | 6486880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 2608716 |
End bp | 2610179 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642738021 |
Product | TPR repeat-containing protein YfgC |
Protein accession | YP_002041755 |
Protein GI | 194444843 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.534359 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 76 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCAGGC AGTTGAAAAA AAACCTGGTG GCAACCCTCA TTGCAGCATT GGCTCTCGGT CAGGTCGCGC CCGCATTTGC CGACCCTGCC GACACGCTGC CCGATATGGG AACCTCGGCA GGAAGCACGC TTTCTATCGG ACAAGAGATG CAAATGGGCG ACTTTTATGT ACGCCAGCTA CGCGGTAGCG CGCCGTTAAT CAACGATCCG CTGCTGGTAC AATACATTAA CGCGCTGGGT ATGCGTCTGG TCTCGCACGC CGACTCCGTC AAAACGCCCT TCCATTTTTT CTTGATCAAT AATGACGAAA TCAACGCCTT CGCGTTCTTT GGCGGCAATG TGGTGCTGCA CTCGGCGCTT TTTCGCTACG CGGATAACGA AAGCCAGCTA GCTTCAGTCA TGGCGCATGA AATCTCCCAC GTGACGCAGC GCCACCTGGC GCGCGCGATG GAAGATCAAA AGCGCAGCGC GCCGCTTACC TGGGTGGGCG CGCTGGGTTC CATTTTGCTG GCCATGGCCA GCCCACAGGC CGGTATGGCG GCGCTAACCG GTACTCTGGC GGGAACGCGC CAGGGAATGA TAAGTTTCAC CCAGCAAAAT GAGCAAGAAG CCGACCGTAT TGGTATTCAG GTACTGCAAC GCGCCGGATT TGACCCACAG GCGATGCCCT CTTTCCTCGA AAAACTGCTC GACCAGGCGC GTTACTCCAC GCGCCCGCCT GAAATTTTGC TCACTCACCC CTTACCGGAA AGCCGCCTTG CGGATGCCCG CAACCGTGCC AACCAGATGC GCCCGGTCGT GGTGCAATCT TCCGCCGACT TTTATTTCGC CAAAGCGCGC GCCCTGGGAA TGTACAATTC CGGACGTAAC CAGCTCACCA GCGACCTGCT GGATCAGTGG TCTAAAGGCA ACGTACGTCA GCAACATGCG GCGCAATATG GCCGGGCGTT GCAGGCGATG GAAGCGAGCA AGTACGATGA AGCGCGCAAA ACGTTGCAGC CGCTATTAAG CGCGGAACCG AATAATGCCT GGTATCTTGA CCTCGCTACC GATATTGACC TGGGGCAGAA AAGAGCCAAC GACGCGATTA ATCGCCTGAA AAATGCCCGC GATCTGCGCG TTAATCCTGT GCTGCAGTTA AACCTCGCCA ATGCGTACCT CCAGGGAGGC CAGCCGAAAG CGGCGGAAAC CATTCTGAAT CGCTACACCT TTAGCCATAA AGATGACGGT AACGGCTGGG ATCTGCTTGC TCAGGCCGAA GCCGCGCTGA ACAACCGCGA TCAGGAGCTG GCGGCGCGCG CTGAAAGTTA TGCGCTGGCG GGACGACTGG ATCAGGCAAT TTCACTGCTC AGTAGCGCCA GCGCCCAGGC AAAACTGGGT AGCCAGCAAC AGGCGCGTTA CGATGCGCGT ATCGACCAGT TGCGCCAGTT ACAGGAACGC TTCAAGCCAT ACACGAAAAT GTAA
|
Protein sequence | MFRQLKKNLV ATLIAALALG QVAPAFADPA DTLPDMGTSA GSTLSIGQEM QMGDFYVRQL RGSAPLINDP LLVQYINALG MRLVSHADSV KTPFHFFLIN NDEINAFAFF GGNVVLHSAL FRYADNESQL ASVMAHEISH VTQRHLARAM EDQKRSAPLT WVGALGSILL AMASPQAGMA ALTGTLAGTR QGMISFTQQN EQEADRIGIQ VLQRAGFDPQ AMPSFLEKLL DQARYSTRPP EILLTHPLPE SRLADARNRA NQMRPVVVQS SADFYFAKAR ALGMYNSGRN QLTSDLLDQW SKGNVRQQHA AQYGRALQAM EASKYDEARK TLQPLLSAEP NNAWYLDLAT DIDLGQKRAN DAINRLKNAR DLRVNPVLQL NLANAYLQGG QPKAAETILN RYTFSHKDDG NGWDLLAQAE AALNNRDQEL AARAESYALA GRLDQAISLL SSASAQAKLG SQQQARYDAR IDQLRQLQER FKPYTKM
|
| |