Gene SeD_A1143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1143 
Symbol 
ID6873456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1136974 
End bp1138734 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content54% 
IMG OID642784327 
Productlon protease (S16) proteolytic domain-containing protein 
Protein accessionYP_002215001 
Protein GI198245826 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0143883 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCATTA CGAAACTTGC ATGGCGTGAT CTGGTTCCGG ATAGCGAAAG CTATCAGGAG 
ATATTTGCAC AGCCACACGC GACTGACGAA AACGACACCT TACTCAGTGA TACTCAGCCA
CGACTGCAAT TTGCGCTTGA GCAACTTATA CAGCCGTGGG CATCATCCTC TTTTATGCTG
ACTAAAGCGC CTGAAGAGCA AGAGTATCTC ACTTTACTTT CAGATGCCGT CCGCGCTCTG
CAAACCGATG CCGGACAATT AACCGGCGGA CATTATGACG TTTCCGGGCA TACTGTTCAT
TACCGCGCCG CGCAGAATGC GCAAGACAAC TTTGCCACCG TCACACAAGT CGTCAGCGCG
GACTGGGTCG AAGCCGAACA GCTCTTTGGT TGCCTGCGGC AGTATAACGG CGACATTATC
CTGCAGCCGG GACTGGTTCA TCAGGCGAAC GGCGGCGTGC TGATTATTTC CTTACGAACC
CTTCTGGCGC AGCCGTTACT GTGGATGCGT CTGAAAGCCA TCGTTAGCCG CGAGCGTTTT
GACTGGGTGG CCTTTGACGA GTCGCGTCCA TTACCGGTCT CCGTGCCATC AATGCCGCTC
AAACTGAAGG TGATTCTGGT TGGCGAACGT GAATCACTGG CTGATTTTCA GGAGATGGAA
CCGGAGCTCG CGGAACAGGC TATCTACAGT GAATTTGAAG ACAATTTACA GATAGCGGAC
GCAGAAGCTA TGACCCTGTG GTGTCAATGG GTGACGCGTA TCGCTTTACG CGATAATTTG
CCCCCTCCGG CACCGGACGC CTGGCCCGTC CTGATACGCG AGGCTGTGCG CTATACCGGC
GAACAGGATA CGCTGCCTCT TTGCCCACTG TGGATAGCCC GCCAGTTTAA GGAGGCGTCG
CCTTTATGCG AAGGCGATAC CTGCGGCGCA GAAGCGCTCA GCCTGATGCT TGCCCGACGC
GAATGGCGAG AAGGCTTTCT GGCGGAGCGG ATGCAGGATG AGATTCTGCA AGAGCAGATC
CTGATTGAAA CCGAAGGCGA ACGCGTTGGA CAAATCAATG CGCTTTCCGT CATTGAGTTT
CCCGGGCATC CGCGCGCCTT TGGCGAACCG TCGCGAATTA GCTGTGTTGT GCATATCGGC
GATGGCGAAT TTAACGATAT TGAGCGCAAG GCCGAACTTG GCGGGAATAT CCACGCTAAG
GGAATGATGA TTATGCAGGC CTTCCTGATG TCGGAGTTGC AGCTGGAGCA ACAAATTCCC
TTCTCTGCCT CGTTAACCTT TGAGCAGTCC TACAGCGAAG TGGATGGCGA TAGCGCCTCA
ATGGCGGAAT TATGTGCGCT CATCAGCGCG CTGGCCAATG TGCCGGTGAA TCAAAACATT
GCGATTACCG GCTCGGTCGA TCAGTTTGGT CGCGCGCAAC CGGTGGGTGG GCTAAACGAA
AAAATTGAAG GTTTCTTCGC CATCTGCGAG CAGCGGGAAT TAAACGGTAA ACAGGGCGTG
ATTATCCCTG CAGCCAATGT CCGCCATCTC AGTCTTAAAT CTGAACTGCT GCAAGCGGTT
AAAGAAGAGA AGTTCACTAT CTGGGCGGTA GACGACGTGA CCGACGCCTT ACCGCTACTG
TTAAATCTGG TGTGGGATGG CGAAGGTCAA ACGACGTTGA TGCAGACTAT CCAGGAGCGT
ATCGCGCAGG CGACGCAACA GGAAGGCCGT CATCGTTTCC CGTGGCCATT ACGTTGGCTG
AACGCTTTTA TTCCGAACTG A
 
Protein sequence
MTITKLAWRD LVPDSESYQE IFAQPHATDE NDTLLSDTQP RLQFALEQLI QPWASSSFML 
TKAPEEQEYL TLLSDAVRAL QTDAGQLTGG HYDVSGHTVH YRAAQNAQDN FATVTQVVSA
DWVEAEQLFG CLRQYNGDII LQPGLVHQAN GGVLIISLRT LLAQPLLWMR LKAIVSRERF
DWVAFDESRP LPVSVPSMPL KLKVILVGER ESLADFQEME PELAEQAIYS EFEDNLQIAD
AEAMTLWCQW VTRIALRDNL PPPAPDAWPV LIREAVRYTG EQDTLPLCPL WIARQFKEAS
PLCEGDTCGA EALSLMLARR EWREGFLAER MQDEILQEQI LIETEGERVG QINALSVIEF
PGHPRAFGEP SRISCVVHIG DGEFNDIERK AELGGNIHAK GMMIMQAFLM SELQLEQQIP
FSASLTFEQS YSEVDGDSAS MAELCALISA LANVPVNQNI AITGSVDQFG RAQPVGGLNE
KIEGFFAICE QRELNGKQGV IIPAANVRHL SLKSELLQAV KEEKFTIWAV DDVTDALPLL
LNLVWDGEGQ TTLMQTIQER IAQATQQEGR HRFPWPLRWL NAFIPN