Gene Dshi_2939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2939 
SymbolclpA 
ID5710790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp3096331 
End bp3098658 
Gene Length2328 bp 
Protein Length775 aa 
Translation table11 
GC content64% 
IMG OID641268865 
ProductATP-dependent Clp protease ATP-binding subunit 
Protein accessionYP_001534273 
Protein GI159045479 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0542] ATPases with chaperone activity, ATP-binding subunit 
TIGRFAM ID[TIGR02639] ATP-dependent Clp protease ATP-binding subunit clpA 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.407521 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.118399 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTTCGT TTTCGAATAC GCTAGAACAG GCCATCCACA CCGCGCTCGC CCAGGCCAAT 
GCGCGTCGTC ATGAACTGGC GACGCTTGAG CATCTGCTTT TGGCGCTGAT CGACGAGCCG
GACGCGGCCA AGGTCATGAA AGCCTGCAAT GTCGACCTGG ACGCCCTGCG CCAGACCCTC
GAGGCTTTCA TCGAGGAAGA TCTCGCAACG CTCGCCACGG ATGTTGAGGG GTCCGAAGCC
GTGCCCACCG CTGCGTTCCA GCGGGTGATC CAGCGCGCCG CGATCCATGT GCAGTCCTCG
GGCCGCCAGG AAGTGACCGG CGCCAATGTG CTGGTCGCGA TCTTCGCGGA ACGCGAGTCC
AATGCCGCGT ATTTCCTGCA GGAACAGGAC ATGACCCGGT ATGACGCGGT GAATTTCATC
GCCCATGGCG TGGCCAAGAA CCCCAGTTTC GGCGAATCCC GGCCCGTGTC GGGCGCGTCG
GACATGGAAG AAGAGGCCAG CGCCAGCACC GATCAGGGCG GGGACGAGAA GGAGTCCGCG
CTTGCAAAGT ACTGCGTGGA CCTGAACGCC AAATCGCGCA AGGGTGACGT CGACCCGCTT
ATCGGGCGCG ACAGCGAGGT GGAGCGCTGT ATCCAGGTGT TGTGCCGTCG GCGCAAGAAT
AACCCGCTTC TGGTGGGCGA TCCCGGCGTG GGCAAGACCG CCATCGCCGA GGGCCTGGCG
CGCAAGATCG TGCAGGGCGA AACGCCCGAG GTGCTGCGCG GTGCCACCAT CTACAGCCTG
GACATGGGGG CGCTGCTGGC GGGCACCCGC TATCGCGGCG ATTTCGAAGA GCGGTTGAAG
GCCGTTGTTA CCGAACTTGA AGACCACCCG GATGCGGTAC TCTTCATCGA CGAGATCCAC
ACGGTGATCG GGGCCGGGGC AACATCCGGC GGGGCGATGG ACGCCTCCAA CCTGCTCAAG
CCCGCGTTGC AGGGCGGCAA GCTGCGCTGC ATGGGCTCGA CCACCTACAA GGAGTTCCGT
CAGCATTTCG AGAAGGATCG CGCGCTGAGC CGCCGGTTCC AGAAGATCGA CGTGACCGAG
CCCTCGGTCG AGGACACGGT GAAGATCCTC AAGGGCCTCA AGCCCTATTT CGAGGATCAC
CATGCGATCA AGTACACCTC CGACGCGATC AAGACCGCGG TGGAGCTGTC GGCGCGCTAT
ATCAACGACC GGAAACTGCC GGACAAGGCC ATCGACGTGA TCGACGAGGC CGGGGCCGCG
CAGCACCTGG TGGCCGAATC CAAGCGCCGC AAGACCATCG GCGCCAAGGA AATCGAGGCC
GTGGTGGCCA AGATCGCCCG CATCCCGCCG AAAAACGTCT CCAAGGACGA TGCGGAGGTG
CTGAAGGATC TCGAGGTCTC GCTCAAGCGC GTGGTCTTCG GCCAGGACAA TGCGATCGAG
GCGCTCAGCT CCGCGATCAA GCTGGCCCGC GCGGGCCTGC GCGAGCCGGA AAAGCCCATC
GGCAACTACC TCTTCGCGGG CCCCACGGGT GTCGGCAAGA CCGAGGTGGC CAAGCAGCTC
TCCAGTACGC TGGGGGTGGA GCTTTTGCGG TTCGACATGT CGGAATACAT GGAGAAACAC
GCGGTCTCGC GCCTGATCGG TGCACCTCCG GGCTATGTCG GGTTCGACCA GGGCGGTCTT
CTGACCGACG GGGTCGACCA GCACCCGCAT TGCGTGCTCC TGCTCGACGA GATCGAGAAG
GCGCACCCGG ATGTCTACAA CATCCTGCTG CAGGTGATGG ATCACGGCAC GCTCACCGAC
CATAACGGGC GGACCGTGGA TTTCCGTAAC GTGATCCTGA TCATGACCTC CAACGCGGGG
GCCGCGGAAC AGGCCAAGGC CGCCATCGGC TTCGGGCGCG ACCGGCGCGA GGGCGAGGAT
ACCGCCGCGA TCGAGCGGAC CTTCACGCCC GAGTTCCGCA ACCGCCTGGA CGCAGTGATC
AGCTTCGCGC CGCTCGGCAA GGAGATCATC ATGCAGGTCG TCGAAAAGTT CGTGCTCCAG
CTCGAAGCAC AACTGCTGGA CCGCAACGTC ACCATCGAGC TGAGCGAGGA AGCGGCGACG
CTGCTCGGCG ACATGGGTTA CGACGACAAG ATGGGCGCCC GCCCCCTGGG CCGCGTGATC
CAAGAGCAGA TCAAGAAGCC GCTGGCGGAA GAGTTGCTCT TCGGCAAGCT GGCCAAGGGC
GGCATCGTCA AGGTCGGCGT GAAGGACGGC AAGATCGATC TGCAGATCGC GCCGATCGGC
GCACCCCGGA TCTCCAGCAA GAAGCCCCCG CTTCTGACCG CGGACTGA
 
Protein sequence
MPSFSNTLEQ AIHTALAQAN ARRHELATLE HLLLALIDEP DAAKVMKACN VDLDALRQTL 
EAFIEEDLAT LATDVEGSEA VPTAAFQRVI QRAAIHVQSS GRQEVTGANV LVAIFAERES
NAAYFLQEQD MTRYDAVNFI AHGVAKNPSF GESRPVSGAS DMEEEASAST DQGGDEKESA
LAKYCVDLNA KSRKGDVDPL IGRDSEVERC IQVLCRRRKN NPLLVGDPGV GKTAIAEGLA
RKIVQGETPE VLRGATIYSL DMGALLAGTR YRGDFEERLK AVVTELEDHP DAVLFIDEIH
TVIGAGATSG GAMDASNLLK PALQGGKLRC MGSTTYKEFR QHFEKDRALS RRFQKIDVTE
PSVEDTVKIL KGLKPYFEDH HAIKYTSDAI KTAVELSARY INDRKLPDKA IDVIDEAGAA
QHLVAESKRR KTIGAKEIEA VVAKIARIPP KNVSKDDAEV LKDLEVSLKR VVFGQDNAIE
ALSSAIKLAR AGLREPEKPI GNYLFAGPTG VGKTEVAKQL SSTLGVELLR FDMSEYMEKH
AVSRLIGAPP GYVGFDQGGL LTDGVDQHPH CVLLLDEIEK AHPDVYNILL QVMDHGTLTD
HNGRTVDFRN VILIMTSNAG AAEQAKAAIG FGRDRREGED TAAIERTFTP EFRNRLDAVI
SFAPLGKEII MQVVEKFVLQ LEAQLLDRNV TIELSEEAAT LLGDMGYDDK MGARPLGRVI
QEQIKKPLAE ELLFGKLAKG GIVKVGVKDG KIDLQIAPIG APRISSKKPP LLTAD