Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2447 |
Symbol | clpA |
ID | 6270181 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2248203 |
End bp | 2250479 |
Gene Length | 2277 bp |
Protein Length | 758 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641726439 |
Product | ATP-dependent Clp protease ATP-binding subunit |
Protein accession | YP_001880920 |
Protein GI | 187730565 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0542] ATPases with chaperone activity, ATP-binding subunit |
TIGRFAM ID | [TIGR02639] ATP-dependent Clp protease ATP-binding subunit clpA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0676288 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCAATC AAGAACTGGA ACTCAGTTTA AATATGGCTT TCGCCAGAGC GCGCGAGCAC CGTCATGAGT TTATGACCGT CGAGCACTTG TTACTGGCGC TGCTCAGTAA CCCATCTGCC CGGGAGGCGC TGGAAGCGTG TTCTGTGGAT TTGGTTGCGC TCTGTCAGGA ACTGGAAGCC TTTATTGAAC AAACCACACC CGTTCTGCCT GCCAGTGAAG AGGAGCGCGA CACACAGCCG ACGCTGAGTT TTCAGCGTGT ACTGCAACGT GCGGTCTTCC ATGTCCAGTC CTCCGGTCGC AATGAGGTAA CCGGTGCAAA CGTTCTGGTC GCTATCTTTA GCGAACAGGA GTCGCAGGCG GCATATCTGT TGCGTAAACA TGAAGTCAGC CGTCTCGATG TGGTGAACTT TATCTCTCAT GGCACGCGTA AAGACGAGCC GACACAGTCT TCTGATCCTG GCAGCCAGCC AAACAGCGAA GAACAAGCTG GTGGGGAGGA ACGTATGGAG AATTTCACGA CGAACCTGAA TCAGCTTGCG CGCGTGGGCG GAATCGACCC ACTGATTGGT CGTGAGAAGG AGCTGGAGCG TGCTATTCAG GTTCTCTGCC GTCGCCGTAA AAACAACCCG CTGCTGGTGG GGGAATCTGG TGTCGGTAAA ACCGCGATTG CGGAAGGTCT TGCCTGGCGA ATTGTTCAGG GCGATGTGCC GGAAGTGATG GCTGACTGTA CGATTTACTC TCTCGATATC GGTTCTCTGT TAGCGGGCAC TAAATATCGC GGCGACTTTG AAAAACGTTT TAAAGCGTTG CTCAAGCAGC TGGAGCAGGA CACTAACAGC ATCCTGTTTA TTGATGAGAT CCACACCATT ATCGGTGCGG GTGCAGCGTC TGGTGGCCAG GTCGATGCGG CTAACCTGAT CAAACCGTTG CTCTCCAGCG GTAAAATTCG CGTAATTGGT TCGACAACCT ATCAGGAGTT CAGCAACATT TTCGAGAAAG ACCGTGCTCT GGCGCGTCGC TTCCAGAAAA TTGATATTAC TGAACCGTCG ATCGAAGAAA CTGTTCAAAT CATCAATGGC CTGAAACCGA AGTATGAAGC GCACCACGAC GTGCGTTATA CTGCAAAAGC GGTGCGTGCA GCGGTAGAGC TGGCGGTGAA ATACATTAAC GATCGTCATC TGCCGGATAA AGCCATTGAC GTTATCGACG AAGCGGGCGC TCGCGCACGC CTGATGCCGG TAAGCAAACG CAAGAAAACC GTTAATGTGG CGGATATTGA GTCCGTGGTG GCCCGTATTG CGCGCATTCC AGAGAAGAGT GTTTCTCAGA GTGACCGCGA TACCCTGAAA AACCTCGGCG ATCGCCTGAA AATGCTGGTC TTCGGTCAGG ATAAAGCCAT TGAGGCGCTG ACTGAAGCCA TTAAGATGGC GCGTGCAGGT TTAGGTCACG AACATAAACC GGTTGGTTCG TTCCTGTTTG CCGGCCCTAC CGGGGTCGGG AAAACAGAGG TGACGGTACA GCTTTCGAAA GCGTTGGGCA TTGAGCTGCT GCGCTTTGAT ATGTCCGAGT ATATGGAACG CCATACCGTC AGCCGTCTGA TTGGTGCGCC TCCGGGATAC GTTGGTTTTG ATCAGGGAGG TTTGCTGACT GATGCGGTCA TCAAGTATCC ACATGCGGTG CTGTTGCTGG ACGAAATCGA GAAAGCGCAT CCGGACGTGT TCAATATTCT GTTGCAGGTG ATGGACAACG GTACGCTGAC CGATAACAAC GGACGCAAAG CGGACTTCCG TAACGTGGTG CTGGTGATGA CCACCAACGC CGGGGTACGT GAAACTGAGC GTAAATCCAT TGGTCTTATC CACCAGGATA ACAGCACCGA TGCGATGGAA GAGATCAAGA AGATCTTTAC ACCGGAATTC CGTAACCGTC TCGACAACAT TATCTGGTTC GATCATCTGT CAACTGACGT GATCCATCAG GTGGTGGATA AATTCATCGT CGAGTTGCAG GTTCAGCTGG ATCAGAAAGG TGTTTCTCTG GAAGTGAGCC AGGAAGCGCG TAACTGGCTG GCCGAGAAAG GTTACGACCG GGCAATGGGC GCACGTCCGA TGGCGCGTGT CATCCAGGAC AACCTGAAAA AACCGCTCGC CAACGAACTG CTGTTTGGTT CGCTGGTGGA CGGCGGTCAG GTCACCGTCG AGCTGGATAA AGAGAAAAAT GAGCTGACTT ACGGATTCCA GAGTGCACAA AAGCACAAGG CGGAAGCAGC GCATTAA
|
Protein sequence | MLNQELELSL NMAFARAREH RHEFMTVEHL LLALLSNPSA REALEACSVD LVALCQELEA FIEQTTPVLP ASEEERDTQP TLSFQRVLQR AVFHVQSSGR NEVTGANVLV AIFSEQESQA AYLLRKHEVS RLDVVNFISH GTRKDEPTQS SDPGSQPNSE EQAGGEERME NFTTNLNQLA RVGGIDPLIG REKELERAIQ VLCRRRKNNP LLVGESGVGK TAIAEGLAWR IVQGDVPEVM ADCTIYSLDI GSLLAGTKYR GDFEKRFKAL LKQLEQDTNS ILFIDEIHTI IGAGAASGGQ VDAANLIKPL LSSGKIRVIG STTYQEFSNI FEKDRALARR FQKIDITEPS IEETVQIING LKPKYEAHHD VRYTAKAVRA AVELAVKYIN DRHLPDKAID VIDEAGARAR LMPVSKRKKT VNVADIESVV ARIARIPEKS VSQSDRDTLK NLGDRLKMLV FGQDKAIEAL TEAIKMARAG LGHEHKPVGS FLFAGPTGVG KTEVTVQLSK ALGIELLRFD MSEYMERHTV SRLIGAPPGY VGFDQGGLLT DAVIKYPHAV LLLDEIEKAH PDVFNILLQV MDNGTLTDNN GRKADFRNVV LVMTTNAGVR ETERKSIGLI HQDNSTDAME EIKKIFTPEF RNRLDNIIWF DHLSTDVIHQ VVDKFIVELQ VQLDQKGVSL EVSQEARNWL AEKGYDRAMG ARPMARVIQD NLKKPLANEL LFGSLVDGGQ VTVELDKEKN ELTYGFQSAQ KHKAEAAH
|
| |