Gene SeD_A1013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1013 
SymbolclpA 
ID6872570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1004109 
End bp1006385 
Gene Length2277 bp 
Protein Length758 aa 
Translation table11 
GC content54% 
IMG OID642784198 
ProductATP-dependent Clp protease ATP-binding subunit 
Protein accessionYP_002214873 
Protein GI198245341 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID[TIGR02639] ATP-dependent Clp protease ATP-binding subunit clpA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones91 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAATC AAGAACTGGA ACTCAGTTTA AACATGGCTT TCGCCAGAGC GCGCGAGCAC 
CGTCATGAGT TTATGACCGT CGAGCATCTG TTGCTGGCGC TGCTCAGCAA CCCATCGGCT
CGCGAAGCGC TGGAAGCATG CTCCGTGGAT CTGGTGGCGC TCCGTCAGGA ACTCGAAGCC
TTCATTGAAC AAACCACACC CGTACTGCCT GCCAGTGAAG AAGAGCGTGA TACGCAGCCG
ACGTTAAGTT TCCAGCGTGT CCTGCAGCGT GCCGTCTTCC ATGTTCAGTC TTCCGGGCGT
AGTGAAGTGA CTGGCGCGAA TGTGCTGGTG GCTATCTTTA GCGAACAGGA ATCACAGGCG
GCTTATCTGC TGCGCAAGCA TGAAGTGAGC CGTCTGGATA TCGTGAACTT TATTTCTCAC
GGGACGCGAA AAGACGAACC GAGCCAATCT TCCGATCTCG GCAATCAGCC AACTGGCGAC
GAACAAGCTG GCGGGGAGGA ACGTATGGAA AACTTCACGA CGAATCTTAA CCAACTTGCT
CGCGTGGGCG GCATCGATCC GCTGATTGGT CGTGAAAAAG AACTTGAACG CGCGATCCAG
GTCTTGTGTC GTCGCCGTAA AAATAACCCG TTGCTGGTAG GGGAATCCGG CGTCGGCAAA
ACGGCGATTG CCGAAGGGCT GGCCTGGCGT ATCGTGCAGG GCGATGTGCC GGAAGTGATG
GCCGATTGCA CCATTTACTC TCTGGATATC GGTTCGCTGC TGGCGGGCAC CAAATACCGC
GGCGATTTTG AAAAACGGTT TAAGGCGTTG CTGAAACAGC TTGAGCAGGA TACCAACAGC
ATCCTGTTTA TCGATGAAAT CCATACCATT ATCGGCGCTG GCGCGGCGTC GGGCGGACAG
GTGGATGCGG CAAATCTGAT TAAACCGCTG CTTTCCAGCG GCAAGATCCG GGTGATCGGC
TCAACGACCT ATCAGGAATT CAGCAATATT TTTGAGAAAG ACCGTGCATT AGCGCGCCGT
TTCCAGAAAA TTGATATTAC CGAGCCTTCG GTGGAAGAGA CGGTGCAAAT TATCAACGGC
TTGAAACCTA AGTACGAAGC GCACCACGAC GTGCGTTATA CCGCGAAAGC GGTGCGTGCG
GCGGTCGAGT TGGCGGTAAA ATATATCAAT GACCGCCATC TGCCGGATAA AGCCATTGAC
GTGATTGACG AAGCGGGCGC TCGGGCGCGT CTGATGCCGG TGAGCAAACG TAAGAAAACG
GTCAACGTGG CGGATATTGA GTCCGTAGTG GCGCGAATTG CGCGAATTCC TGAAAAGAGC
GTCTCGCAGA GCGATCGCGA TACGCTGAAG AACCTGGGCG ATCGTCTGAA AATGCTGGTC
TTCGGCCAGG ATAACGCGAT TGAGGCGCTG ACCGAAGCTA TTAAGATGAG TCGTGCCGGT
CTGGGCCATG AGCATAAACC TGTCGGCTCA TTCTTGTTCG CCGGGCCAAC TGGCGTAGGG
AAAACTGAAG TTACGGTACA GCTTTCAAAA GCGCTGGGTA TTGAGCTGTT GCGCTTCGAT
ATGTCCGAAT ATATGGAGCG TCATACGGTG AGCCGTTTGA TCGGCGCGCC TCCGGGATAC
GTCGGTTTCG ACCAGGGCGG GCTGCTGACG GATGCGGTGA TTAAGCATCC TCATGCGGTG
CTGTTGCTGG ATGAGATCGA AAAAGCGCAC CCGGATGTCT TTAACCTGCT GCTGCAGGTG
ATGGATAACG GTACGCTGAC CGATAACAAT GGCCGTAAGG CGGATTTCCG CAACGTGGTG
CTGGTGATGA CCACCAACGC CGGCGTGCGA GAAACCGAAC GTAAATCTAT TGGTCTTATT
CATCAGGACA ACAGTACCGA TGCGATGGGC GAGATCAAGA AAGTGTTTAC GCCGGAGTTC
CGTAACCGTC TCGACAACAT TATTTGGTTC GATCATCTGT CTGGCGAGGT GATTCATCAG
GTTGTCGATA AGTTTATCGT CGAGTTGCAG GCTCAGTTGG ATCAGAAAGG CGTCTCTCTG
GAAGTCAGTC AGGAAGCGCG CGACTGGCTG GCGGAAAAGG GCTATGACCG GGCGATGGGC
GCACGACCGA TGGCGCGTGT GATTCAGGAT AACCTGAAAA AACCGCTGGC CAATGAGTTG
CTGTTTGGAT CGCTGGTTGA TGGCGGACAG GTCACCGTCG CGCTGGATAA AGAGAAAAAT
GCGTTGACGT ATGGCTTCCA GAGCGCGCAA AAGCACAAGC CGGAAGCCGC GCATTAA
 
Protein sequence
MLNQELELSL NMAFARAREH RHEFMTVEHL LLALLSNPSA REALEACSVD LVALRQELEA 
FIEQTTPVLP ASEEERDTQP TLSFQRVLQR AVFHVQSSGR SEVTGANVLV AIFSEQESQA
AYLLRKHEVS RLDIVNFISH GTRKDEPSQS SDLGNQPTGD EQAGGEERME NFTTNLNQLA
RVGGIDPLIG REKELERAIQ VLCRRRKNNP LLVGESGVGK TAIAEGLAWR IVQGDVPEVM
ADCTIYSLDI GSLLAGTKYR GDFEKRFKAL LKQLEQDTNS ILFIDEIHTI IGAGAASGGQ
VDAANLIKPL LSSGKIRVIG STTYQEFSNI FEKDRALARR FQKIDITEPS VEETVQIING
LKPKYEAHHD VRYTAKAVRA AVELAVKYIN DRHLPDKAID VIDEAGARAR LMPVSKRKKT
VNVADIESVV ARIARIPEKS VSQSDRDTLK NLGDRLKMLV FGQDNAIEAL TEAIKMSRAG
LGHEHKPVGS FLFAGPTGVG KTEVTVQLSK ALGIELLRFD MSEYMERHTV SRLIGAPPGY
VGFDQGGLLT DAVIKHPHAV LLLDEIEKAH PDVFNLLLQV MDNGTLTDNN GRKADFRNVV
LVMTTNAGVR ETERKSIGLI HQDNSTDAMG EIKKVFTPEF RNRLDNIIWF DHLSGEVIHQ
VVDKFIVELQ AQLDQKGVSL EVSQEARDWL AEKGYDRAMG ARPMARVIQD NLKKPLANEL
LFGSLVDGGQ VTVALDKEKN ALTYGFQSAQ KHKPEAAH