Gene B21_00893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00893 
SymbolclpA 
ID8116091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp927378 
End bp929654 
Gene Length2277 bp 
Protein Length758 aa 
Translation table11 
GC content53% 
IMG OID644847155 
Producthypothetical protein 
Protein accessionYP_002998728 
Protein GI251784424 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID[TIGR02639] ATP-dependent Clp protease ATP-binding subunit clpA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.1419 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCAATC AAGAACTGGA ACTCAGTTTA AATATGGCTT TCGCCAGAGC GCGCGAGCAC 
CGTCATGAGT TTATGACCGT CGAGCACTTG TTACTGGCGC TGCTCAGTAA CCCATCTGCC
CGGGAGGCGC TGGAAGCGTG TTCTGTGGAT TTGGTTGCGC TCCGTCAGGA ACTGGAAGCC
TTTATTGAAC AAACCACACC CGTTCTGCCT GCCAGTGAAG AGGAGCGCGA CACACAGCCG
ACGCTGAGTT TTCAGCGTGT ACTGCAACGT GCGGTCTTCC ATGTCCAGTC CTCCGGTCGC
AATGAGGTTA CCGGTGCAAA CGTTCTGGTC GCTATCTTTA GCGAACAGGA GTCGCAGGCG
GCATATCTGT TGCGTAAACA TGAAGTCAGC CGTCTCGATG TGGTGAATTT TATCTCTCAT
GGCACGCGTA AAGACGAGCC GACACAGTCT TCTGATCCTG GCAGCCAGCC AAACAGCGAA
GAACAAGCTG GTGGGGAGGA ACGTATGGAG AATTTCACGA CGAACCTGAA TCAGCTTGCG
CGCGTGGGCG GAATCGACCC ACTGATTGGT CGTGAGAAGG AGCTTGAGCG TGCTATTCAG
GTTCTCTGCC GTCGCCGTAA AAACAACCCG CTGCTGGTGG GGGAATCTGG TGTCGGTAAA
ACCGCGATTG CAGAAGGTCT TGCCTGGCGA ATTGTTCAGG GCGATGTGCC GGAAGTGATG
GCTGACTGTA CAATTTACTC TCTCGATATC GGTTCTCTGT TAGCGGGCAC TAAATATCGC
GGCGACTTTG AAAAACGTTT TAAAGCGTTG CTCAAGCAGC TGGAGCAGGA CACTAACAGC
ATCCTGTTTA TTGATGAGAT CCACACCATT ATCGGTGCGG GTGCAGCGTC TGGTGGCCAG
GTCGATGCGG CTAACCTGAT CAAACCGTTG CTCTCCAGCG GTAAAATTCG CGTAATTGGT
TCGACAACCT ATCAGGAGTT CAGCAACATT TTCGAGAAAG ACCGTGCTCT GGCGCGTCGC
TTCCAGAAAA TTGATATTAC TGAACCGTCG ATCGAAGAAA CTGTTCAAAT CATCAATGGC
CTGAAACCGA AGTATGAAGC GCACCACGAC GTGCGTTATA CCGCAAAAGC GGTGCGTGCG
GCGGTAGAGC TGGCGGTGAA ATACATTAAC GATCGTCATC TGCCGGATAA AGCCATTGAT
GTTATCGACG AAGCGGGCGC TCGCGCACGT CTGATGCCGG TAAGCAAACG CAAGAAAACC
GTTAATGTGG CGGATATTGA GTCCGTGGTG GCCCGTATTG CACGCATTCC AGAGAAGAGT
GTTTCTCAGA GTGACCGCGA TACCCTGAAA AACCTCGGCG ATCGCCTGAA AATGCTGGTC
TTCGGCCAGG ATAAAGCCAT TGAGGCGCTG ACTGAAGCCA TTAAGATGGC GCGTGCAGGT
TTAGGTCACG AACATAAACC GGTCGGTTCG TTCCTGTTTG CCGGTCCTAC CGGGGTCGGG
AAAACAGAGG TGACGGTACA GCTTTCGAAA GCGTTGGGCA TTGAGCTTCT GCGCTTTGAT
ATGTCCGAGT ATATGGAACG CCATACCGTC AGCCGTCTGA TTGGTGCGCC TCCGGGATAC
GTTGGTTTTG ATCAGGGCGG TTTGCTGACT GATGCGGTCA TCAAGCATCC ACATGCGGTT
CTGCTGCTGG ACGAAATCGA GAAAGCGCAC CCTGACGTGT TCAATATTCT GTTGCAGGTG
ATGGACAACG GTACGCTGAC CGATAACAAC GGACGCAAAG CGGACTTCCG TAACGTGGTG
CTGGTGATGA CCACCAACGC CGGGGTACGT GAAACTGAGC GTAAATCCAT TGGTCTTATC
CACCAGGATA ACAGCACCGA TGCGATGGAA GAGATTAAGA AGATCTTTAC ACCGGAATTC
CGTAACCGTC TCGACAACAT TATCTGGTTT GATCATCTGT CAACCGACGT GATCCATCAG
GTGGTGGATA AATTCATCGT CGAGTTGCAG GTTCAGCTGG ATCAGAAAGG TGTTTCTCTG
GAAGTGAGCC AGGAAGCGCG TAACTGGCTG GCCGAGAAAG GTTACGACCG GGCAATGGGC
GCACGTCCGA TGGCGCGTGT CATCCAGGAC AACCTGAAAA AAACGCTCGC CAACGAACTG
CTGTTTGGTT CGCTGGTGGA CGGCGGTCAG GTCACCGTCG CGCTGGATAA AGAGAAAAAT
GAGCTGACTT ACGGATTCCA GAGTGCACAA AAGCACAAGG CGGAAGCAGC GCATTAA
 
Protein sequence
MLNQELELSL NMAFARAREH RHEFMTVEHL LLALLSNPSA REALEACSVD LVALRQELEA 
FIEQTTPVLP ASEEERDTQP TLSFQRVLQR AVFHVQSSGR NEVTGANVLV AIFSEQESQA
AYLLRKHEVS RLDVVNFISH GTRKDEPTQS SDPGSQPNSE EQAGGEERME NFTTNLNQLA
RVGGIDPLIG REKELERAIQ VLCRRRKNNP LLVGESGVGK TAIAEGLAWR IVQGDVPEVM
ADCTIYSLDI GSLLAGTKYR GDFEKRFKAL LKQLEQDTNS ILFIDEIHTI IGAGAASGGQ
VDAANLIKPL LSSGKIRVIG STTYQEFSNI FEKDRALARR FQKIDITEPS IEETVQIING
LKPKYEAHHD VRYTAKAVRA AVELAVKYIN DRHLPDKAID VIDEAGARAR LMPVSKRKKT
VNVADIESVV ARIARIPEKS VSQSDRDTLK NLGDRLKMLV FGQDKAIEAL TEAIKMARAG
LGHEHKPVGS FLFAGPTGVG KTEVTVQLSK ALGIELLRFD MSEYMERHTV SRLIGAPPGY
VGFDQGGLLT DAVIKHPHAV LLLDEIEKAH PDVFNILLQV MDNGTLTDNN GRKADFRNVV
LVMTTNAGVR ETERKSIGLI HQDNSTDAME EIKKIFTPEF RNRLDNIIWF DHLSTDVIHQ
VVDKFIVELQ VQLDQKGVSL EVSQEARNWL AEKGYDRAMG ARPMARVIQD NLKKTLANEL
LFGSLVDGGQ VTVALDKEKN ELTYGFQSAQ KHKAEAAH