Gene SbBS512_E2447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2447 
SymbolclpA 
ID6270181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2248203 
End bp2250479 
Gene Length2277 bp 
Protein Length758 aa 
Translation table11 
GC content53% 
IMG OID641726439 
ProductATP-dependent Clp protease ATP-binding subunit 
Protein accessionYP_001880920 
Protein GI187730565 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID[TIGR02639] ATP-dependent Clp protease ATP-binding subunit clpA 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0676288 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCAATC AAGAACTGGA ACTCAGTTTA AATATGGCTT TCGCCAGAGC GCGCGAGCAC 
CGTCATGAGT TTATGACCGT CGAGCACTTG TTACTGGCGC TGCTCAGTAA CCCATCTGCC
CGGGAGGCGC TGGAAGCGTG TTCTGTGGAT TTGGTTGCGC TCTGTCAGGA ACTGGAAGCC
TTTATTGAAC AAACCACACC CGTTCTGCCT GCCAGTGAAG AGGAGCGCGA CACACAGCCG
ACGCTGAGTT TTCAGCGTGT ACTGCAACGT GCGGTCTTCC ATGTCCAGTC CTCCGGTCGC
AATGAGGTAA CCGGTGCAAA CGTTCTGGTC GCTATCTTTA GCGAACAGGA GTCGCAGGCG
GCATATCTGT TGCGTAAACA TGAAGTCAGC CGTCTCGATG TGGTGAACTT TATCTCTCAT
GGCACGCGTA AAGACGAGCC GACACAGTCT TCTGATCCTG GCAGCCAGCC AAACAGCGAA
GAACAAGCTG GTGGGGAGGA ACGTATGGAG AATTTCACGA CGAACCTGAA TCAGCTTGCG
CGCGTGGGCG GAATCGACCC ACTGATTGGT CGTGAGAAGG AGCTGGAGCG TGCTATTCAG
GTTCTCTGCC GTCGCCGTAA AAACAACCCG CTGCTGGTGG GGGAATCTGG TGTCGGTAAA
ACCGCGATTG CGGAAGGTCT TGCCTGGCGA ATTGTTCAGG GCGATGTGCC GGAAGTGATG
GCTGACTGTA CGATTTACTC TCTCGATATC GGTTCTCTGT TAGCGGGCAC TAAATATCGC
GGCGACTTTG AAAAACGTTT TAAAGCGTTG CTCAAGCAGC TGGAGCAGGA CACTAACAGC
ATCCTGTTTA TTGATGAGAT CCACACCATT ATCGGTGCGG GTGCAGCGTC TGGTGGCCAG
GTCGATGCGG CTAACCTGAT CAAACCGTTG CTCTCCAGCG GTAAAATTCG CGTAATTGGT
TCGACAACCT ATCAGGAGTT CAGCAACATT TTCGAGAAAG ACCGTGCTCT GGCGCGTCGC
TTCCAGAAAA TTGATATTAC TGAACCGTCG ATCGAAGAAA CTGTTCAAAT CATCAATGGC
CTGAAACCGA AGTATGAAGC GCACCACGAC GTGCGTTATA CTGCAAAAGC GGTGCGTGCA
GCGGTAGAGC TGGCGGTGAA ATACATTAAC GATCGTCATC TGCCGGATAA AGCCATTGAC
GTTATCGACG AAGCGGGCGC TCGCGCACGC CTGATGCCGG TAAGCAAACG CAAGAAAACC
GTTAATGTGG CGGATATTGA GTCCGTGGTG GCCCGTATTG CGCGCATTCC AGAGAAGAGT
GTTTCTCAGA GTGACCGCGA TACCCTGAAA AACCTCGGCG ATCGCCTGAA AATGCTGGTC
TTCGGTCAGG ATAAAGCCAT TGAGGCGCTG ACTGAAGCCA TTAAGATGGC GCGTGCAGGT
TTAGGTCACG AACATAAACC GGTTGGTTCG TTCCTGTTTG CCGGCCCTAC CGGGGTCGGG
AAAACAGAGG TGACGGTACA GCTTTCGAAA GCGTTGGGCA TTGAGCTGCT GCGCTTTGAT
ATGTCCGAGT ATATGGAACG CCATACCGTC AGCCGTCTGA TTGGTGCGCC TCCGGGATAC
GTTGGTTTTG ATCAGGGAGG TTTGCTGACT GATGCGGTCA TCAAGTATCC ACATGCGGTG
CTGTTGCTGG ACGAAATCGA GAAAGCGCAT CCGGACGTGT TCAATATTCT GTTGCAGGTG
ATGGACAACG GTACGCTGAC CGATAACAAC GGACGCAAAG CGGACTTCCG TAACGTGGTG
CTGGTGATGA CCACCAACGC CGGGGTACGT GAAACTGAGC GTAAATCCAT TGGTCTTATC
CACCAGGATA ACAGCACCGA TGCGATGGAA GAGATCAAGA AGATCTTTAC ACCGGAATTC
CGTAACCGTC TCGACAACAT TATCTGGTTC GATCATCTGT CAACTGACGT GATCCATCAG
GTGGTGGATA AATTCATCGT CGAGTTGCAG GTTCAGCTGG ATCAGAAAGG TGTTTCTCTG
GAAGTGAGCC AGGAAGCGCG TAACTGGCTG GCCGAGAAAG GTTACGACCG GGCAATGGGC
GCACGTCCGA TGGCGCGTGT CATCCAGGAC AACCTGAAAA AACCGCTCGC CAACGAACTG
CTGTTTGGTT CGCTGGTGGA CGGCGGTCAG GTCACCGTCG AGCTGGATAA AGAGAAAAAT
GAGCTGACTT ACGGATTCCA GAGTGCACAA AAGCACAAGG CGGAAGCAGC GCATTAA
 
Protein sequence
MLNQELELSL NMAFARAREH RHEFMTVEHL LLALLSNPSA REALEACSVD LVALCQELEA 
FIEQTTPVLP ASEEERDTQP TLSFQRVLQR AVFHVQSSGR NEVTGANVLV AIFSEQESQA
AYLLRKHEVS RLDVVNFISH GTRKDEPTQS SDPGSQPNSE EQAGGEERME NFTTNLNQLA
RVGGIDPLIG REKELERAIQ VLCRRRKNNP LLVGESGVGK TAIAEGLAWR IVQGDVPEVM
ADCTIYSLDI GSLLAGTKYR GDFEKRFKAL LKQLEQDTNS ILFIDEIHTI IGAGAASGGQ
VDAANLIKPL LSSGKIRVIG STTYQEFSNI FEKDRALARR FQKIDITEPS IEETVQIING
LKPKYEAHHD VRYTAKAVRA AVELAVKYIN DRHLPDKAID VIDEAGARAR LMPVSKRKKT
VNVADIESVV ARIARIPEKS VSQSDRDTLK NLGDRLKMLV FGQDKAIEAL TEAIKMARAG
LGHEHKPVGS FLFAGPTGVG KTEVTVQLSK ALGIELLRFD MSEYMERHTV SRLIGAPPGY
VGFDQGGLLT DAVIKYPHAV LLLDEIEKAH PDVFNILLQV MDNGTLTDNN GRKADFRNVV
LVMTTNAGVR ETERKSIGLI HQDNSTDAME EIKKIFTPEF RNRLDNIIWF DHLSTDVIHQ
VVDKFIVELQ VQLDQKGVSL EVSQEARNWL AEKGYDRAMG ARPMARVIQD NLKKPLANEL
LFGSLVDGGQ VTVELDKEKN ELTYGFQSAQ KHKAEAAH