Gene EcSMS35_2278 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2278 
SymbolclpA 
ID6144626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2303366 
End bp2305642 
Gene Length2277 bp 
Protein Length758 aa 
Translation table11 
GC content53% 
IMG OID641617152 
ProductATP-dependent Clp protease ATP-binding subunit 
Protein accessionYP_001744325 
Protein GI170679955 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID[TIGR02639] ATP-dependent Clp protease ATP-binding subunit clpA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0258572 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0153878 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAATC AAGAACTGGA ACTCAGTTTA AATATGGCTT TCGCCAGAGC GCGCGAGCAC 
CGTCATGAGT TTATGACCGT CGAGCACTTG TTACTGGCGC TGCTCAGTAA CCCATCTGCC
CGGGAGGCGC TGGAAGCGTG TTCTGTGGAT TTGGTTGCGC TCCGTCAGGA ACTGGAAGCC
TTTATTGAAC AAACCACACC CGTTCTGCCT GCCAGTGAAG AGGAGCGCGA CACACAGCCG
ACGCTGAGTT TTCAGCGTGT ACTGCAACGT GCGGTCTTCC ATGTCCAGTC CTCCGGTCGC
AATGAGGTTA CCGGTGCAAA CGTTCTGGTC GCTATTTTTA GCGAACAGGA GTCGCAGGCG
GCATATCTGT TGCGTAAACA TGAAGTCAGC CGTCTCGATG TGGTGAACTT TATCTCTCAT
GGCACGCGTA AAGACGAGCC GACACAGTCT TCTGATCCTG GCAGCCAGCC AAACAGCGAA
GAACAAGCTG GTGGGGAGGA ACGTATGGAG AATTTCACGA CGAACCTGAA TCAGCTTGCG
CGAGTGGGCG GAATCGACCC ACTGATTGGT CGTGAGAAGG AGCTGGAGCG TGCTATTCAG
GTTCTCTGCC GTCGCCGTAA AAACAACCCG CTGCTGGTGG GGGAATCTGG TGTCGGTAAA
ACCGCGATTG CGGAAGGTCT TGCCTGGCGA ATTGTTCAGG GCGATGTGCC GGAAGTGATG
GCTGACTGTA CGATTTACTC TCTCGATATC GGTTCTCTGT TAGCGGGCAC TAAATATCGC
GGCGACTTTG AAAAACGTTT TAAAGCGTTG CTCAAGCAGC TGGAGCAGGA CACTAACAGC
ATCCTGTTTA TTGATGAGAT CCACACCATT ATCGGTGCGG GTGCAGCGTC TGGTGGTCAG
GTCGATGCGG CTAACCTGAT CAAACCGTTG CTCTCCAGCG GTAAAATTCG CGTAATTGGT
TCGACAACCT ATCAGGAGTT CAGCAACATT TTCGAGAAAG ACCGTGCTCT GGCGCGTCGC
TTCCAGAAAA TTGATATTAC TGAACCGTCG ATCGAAGAAA CTGTTCAAAT CATCAATGGC
CTGAAACCGA AGTACGAAGC GCACCATGAC GTGCGTTATA CCGCAAAAGC GGTGCGTGCA
GCGGTAGAGC TGGCGGTGAA ATACATTAAC GATCGTCATC TGCCGGATAA AGCCATTGAC
GTTATCGACG AAGCGGGCGC TCGCGCACGC CTGATGCCGG TAAGCAAACG CAAGAAAACC
GTTAATGTGG CGGATATTGA GTCTGTGGTG GCCCGTATTG CGCGCATTCC AGAGAAGAGT
GTTTCGCAGA GTGACCGCGA TACCCTGAAA AACCTCGGCG ATCGCCTGAA AATGCTGGTC
TTCGGTCAGG ATAAAGCCAT TGAGGCGCTG ACTGAAGCCA TTAAGATGGC GCGTGCAGGT
TTAGGTCACG AACATAAACC GGTCGGTTCG TTCCTGTTTG CCGGTCCTAC CGGGGTCGGG
AAAACAGAGG TGACGGTACA GCTTTCGAAA GCTTTGGGCA TTGAGCTTCT GCGCTTTGAT
ATGTCCGAGT ATATGGAACG CCATACCGTC AGCCGTCTGA TTGGTGCGCC TCCGGGATAC
GTTGGTTTTG ATCAGGGGGG GCTGCTGACC GATGCGGTCA TCAAGCATCC GCACGCGGTA
CTGCTGCTGG ACGAAATCGA GAAAGCGCAT CCGGACGTGT TCAATATTCT GTTGCAGGTG
ATGGACAACG GTACGCTGAC CGATAACAAC GGACGCAAAG CGGACTTCCG TAACGTGGTA
TTGGTGATGA CCACCAACGC TGGGGTACGA GAAACTGAGC GTAAATCGAT TGGTCTTATC
CATCAGGACA ACAGCACCGA TGCGATGGAA GAGATCAAGA AGATCTTTAC GCCGGAGTTT
CGTAACCGTC TCGACAACAT TATCTGGTTC GATCATCTCT CCACCGACGT GATCCATCAG
GTAGTGGATA AATTCATCGT CGAGTTGCAG GTTCAGCTGG ATCAGAAAGG TGTTTCTCTG
GAAGTGAGCC AGGAAGCGCG TAACTGGCTG GCCGAGAAAG GTTACGACCG GGCAATGGGC
GCACGTCCGA TGGCGCGTGT CATCCAGGAC AACCTGAAAA AACCGCTCGC CAACGAACTG
TTGTTTGGTT CGCTGGTGGA CGGCGGTCAG GTGACGGTTG CGCTGGATAA AGAGAAAAAT
GAGCTGACTT ACGGATTCCA GAGTGCACAA AAGCACAAGG CGGAAGCAGC GCATTAA
 
Protein sequence
MLNQELELSL NMAFARAREH RHEFMTVEHL LLALLSNPSA REALEACSVD LVALRQELEA 
FIEQTTPVLP ASEEERDTQP TLSFQRVLQR AVFHVQSSGR NEVTGANVLV AIFSEQESQA
AYLLRKHEVS RLDVVNFISH GTRKDEPTQS SDPGSQPNSE EQAGGEERME NFTTNLNQLA
RVGGIDPLIG REKELERAIQ VLCRRRKNNP LLVGESGVGK TAIAEGLAWR IVQGDVPEVM
ADCTIYSLDI GSLLAGTKYR GDFEKRFKAL LKQLEQDTNS ILFIDEIHTI IGAGAASGGQ
VDAANLIKPL LSSGKIRVIG STTYQEFSNI FEKDRALARR FQKIDITEPS IEETVQIING
LKPKYEAHHD VRYTAKAVRA AVELAVKYIN DRHLPDKAID VIDEAGARAR LMPVSKRKKT
VNVADIESVV ARIARIPEKS VSQSDRDTLK NLGDRLKMLV FGQDKAIEAL TEAIKMARAG
LGHEHKPVGS FLFAGPTGVG KTEVTVQLSK ALGIELLRFD MSEYMERHTV SRLIGAPPGY
VGFDQGGLLT DAVIKHPHAV LLLDEIEKAH PDVFNILLQV MDNGTLTDNN GRKADFRNVV
LVMTTNAGVR ETERKSIGLI HQDNSTDAME EIKKIFTPEF RNRLDNIIWF DHLSTDVIHQ
VVDKFIVELQ VQLDQKGVSL EVSQEARNWL AEKGYDRAMG ARPMARVIQD NLKKPLANEL
LFGSLVDGGQ VTVALDKEKN ELTYGFQSAQ KHKAEAAH