Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A1143 |
Symbol | |
ID | 6873456 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 1136974 |
End bp | 1138734 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642784327 |
Product | lon protease (S16) proteolytic domain-containing protein |
Protein accession | YP_002215001 |
Protein GI | 198245826 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1067] Predicted ATP-dependent protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0143883 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACCATTA CGAAACTTGC ATGGCGTGAT CTGGTTCCGG ATAGCGAAAG CTATCAGGAG ATATTTGCAC AGCCACACGC GACTGACGAA AACGACACCT TACTCAGTGA TACTCAGCCA CGACTGCAAT TTGCGCTTGA GCAACTTATA CAGCCGTGGG CATCATCCTC TTTTATGCTG ACTAAAGCGC CTGAAGAGCA AGAGTATCTC ACTTTACTTT CAGATGCCGT CCGCGCTCTG CAAACCGATG CCGGACAATT AACCGGCGGA CATTATGACG TTTCCGGGCA TACTGTTCAT TACCGCGCCG CGCAGAATGC GCAAGACAAC TTTGCCACCG TCACACAAGT CGTCAGCGCG GACTGGGTCG AAGCCGAACA GCTCTTTGGT TGCCTGCGGC AGTATAACGG CGACATTATC CTGCAGCCGG GACTGGTTCA TCAGGCGAAC GGCGGCGTGC TGATTATTTC CTTACGAACC CTTCTGGCGC AGCCGTTACT GTGGATGCGT CTGAAAGCCA TCGTTAGCCG CGAGCGTTTT GACTGGGTGG CCTTTGACGA GTCGCGTCCA TTACCGGTCT CCGTGCCATC AATGCCGCTC AAACTGAAGG TGATTCTGGT TGGCGAACGT GAATCACTGG CTGATTTTCA GGAGATGGAA CCGGAGCTCG CGGAACAGGC TATCTACAGT GAATTTGAAG ACAATTTACA GATAGCGGAC GCAGAAGCTA TGACCCTGTG GTGTCAATGG GTGACGCGTA TCGCTTTACG CGATAATTTG CCCCCTCCGG CACCGGACGC CTGGCCCGTC CTGATACGCG AGGCTGTGCG CTATACCGGC GAACAGGATA CGCTGCCTCT TTGCCCACTG TGGATAGCCC GCCAGTTTAA GGAGGCGTCG CCTTTATGCG AAGGCGATAC CTGCGGCGCA GAAGCGCTCA GCCTGATGCT TGCCCGACGC GAATGGCGAG AAGGCTTTCT GGCGGAGCGG ATGCAGGATG AGATTCTGCA AGAGCAGATC CTGATTGAAA CCGAAGGCGA ACGCGTTGGA CAAATCAATG CGCTTTCCGT CATTGAGTTT CCCGGGCATC CGCGCGCCTT TGGCGAACCG TCGCGAATTA GCTGTGTTGT GCATATCGGC GATGGCGAAT TTAACGATAT TGAGCGCAAG GCCGAACTTG GCGGGAATAT CCACGCTAAG GGAATGATGA TTATGCAGGC CTTCCTGATG TCGGAGTTGC AGCTGGAGCA ACAAATTCCC TTCTCTGCCT CGTTAACCTT TGAGCAGTCC TACAGCGAAG TGGATGGCGA TAGCGCCTCA ATGGCGGAAT TATGTGCGCT CATCAGCGCG CTGGCCAATG TGCCGGTGAA TCAAAACATT GCGATTACCG GCTCGGTCGA TCAGTTTGGT CGCGCGCAAC CGGTGGGTGG GCTAAACGAA AAAATTGAAG GTTTCTTCGC CATCTGCGAG CAGCGGGAAT TAAACGGTAA ACAGGGCGTG ATTATCCCTG CAGCCAATGT CCGCCATCTC AGTCTTAAAT CTGAACTGCT GCAAGCGGTT AAAGAAGAGA AGTTCACTAT CTGGGCGGTA GACGACGTGA CCGACGCCTT ACCGCTACTG TTAAATCTGG TGTGGGATGG CGAAGGTCAA ACGACGTTGA TGCAGACTAT CCAGGAGCGT ATCGCGCAGG CGACGCAACA GGAAGGCCGT CATCGTTTCC CGTGGCCATT ACGTTGGCTG AACGCTTTTA TTCCGAACTG A
|
Protein sequence | MTITKLAWRD LVPDSESYQE IFAQPHATDE NDTLLSDTQP RLQFALEQLI QPWASSSFML TKAPEEQEYL TLLSDAVRAL QTDAGQLTGG HYDVSGHTVH YRAAQNAQDN FATVTQVVSA DWVEAEQLFG CLRQYNGDII LQPGLVHQAN GGVLIISLRT LLAQPLLWMR LKAIVSRERF DWVAFDESRP LPVSVPSMPL KLKVILVGER ESLADFQEME PELAEQAIYS EFEDNLQIAD AEAMTLWCQW VTRIALRDNL PPPAPDAWPV LIREAVRYTG EQDTLPLCPL WIARQFKEAS PLCEGDTCGA EALSLMLARR EWREGFLAER MQDEILQEQI LIETEGERVG QINALSVIEF PGHPRAFGEP SRISCVVHIG DGEFNDIERK AELGGNIHAK GMMIMQAFLM SELQLEQQIP FSASLTFEQS YSEVDGDSAS MAELCALISA LANVPVNQNI AITGSVDQFG RAQPVGGLNE KIEGFFAICE QRELNGKQGV IIPAANVRHL SLKSELLQAV KEEKFTIWAV DDVTDALPLL LNLVWDGEGQ TTLMQTIQER IAQATQQEGR HRFPWPLRWL NAFIPN
|
| |