Gene Acid345_1559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1559 
SymbolclpX 
ID4068668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1907332 
End bp1908603 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content57% 
IMG OID637983568 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_590635 
Protein GI94968587 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.173836 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00342669 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGAGAACCA GGACTGGTAA CGAAGAAGTA CTTCGCTGTT CTTTCTGCCA TAAATCCCAG 
GATGCCGTCG CGAAGCTAAT CTCTTCGCCG AGCGATTATC CGCGGGCTTA CATCTGCGAC
GAATGTGTCG CGGTCTGCAA TTCGATTCTC GAAGATGATC GCGCCGAAGC CAGTCCGTCG
GCCGCTCCGA ACCATCTTCC CAAGCCGCTG GAAGTCAAAA CCTTCCTCGA CGAATACGTC
ATCGGCCAGG ACCAGACCAA GAAGAAGCTC TCCGTCGCTG TTTACAACCA CTACAAGCGC
GTTTTCATGA ATCGCCAGCG TAGCGGCGAA GTCGAACTCC AGAAATCGAA CATCCTCTTA
GTCGGTCCCA CCGGAACCGG CAAAACGCTT TTGGCCCAAA CCTTGGCCAA GATGCTCGAT
GTTCCCTTCG CGATTGTGGA CGCGACAACT CTCACCGAAG CCGGCTACGT CGGCGAAGAT
GTCGAGAACA TCATCCTCAA GTTGCTCCAG GCTGCCGACG GCGATGTCGC CCGCGCCCAG
ACCGGCATCA TCTACATCGA CGAAATCGAC AAGATCGGCC GCAAAGACGA GAACCCGTCC
ATCACCCGCG ATGTCTCCGG TGAAGGCGTG CAGCAAGCGC TGCTCAAGAT TCTCGAAGGC
ACCGTCGCAA ACGTTCCGCC TCAAGGCGGT CGTAAGCACC CGCATCAGGA GTTCACACCC
GTCGATACGA CGAACATTCT CTTCATCGTC GGCGGCGCGT TCGTCGGCTT GGAGAAGATC
GTTGGCCGTC GCCTCGGCAA GCGCGCCCTC GGCTTCCGCG ACGAGGAAAA AGAAAAGAAC
TTCACCAGCG CCAACAACCG CGACTCCGAG ATGCTCCAGA GATCCGAACC GCAGGACCTC
ATCAAGTTCG GTCTCATTCC CGAATTTGTC GGACGCCTCC CCGTCATGGG AGTCCTCAAC
GACCTCGACG AAAACGCGCT TGTGGAAATC TTGACCAAGC CCAAGAACGC AATCGTGAAG
CAGTACCAGC GCCTCTTCGA ATTTGAAAAC GTAAAGCTGA AGTTCAGTGA CGAAGCCGTC
CGCGCCATCG CCCGCGAAGC CATGCAGCGA AAGGTCGGCG CCCGCGGTCT CCGCATGATC
CTCGAAGAGT TGATGCTCGA CTTGATGTAT AGCTTGCCAA GCCAAAAGCG CGTCAAGGAA
TTCGAGGTCA CTGCCGAAAT GGTCCAAAAA CGCGACGTCT CCATCAGCCT GATCGAGAAA
CAAGCCAGCT AA
 
Protein sequence
MRTRTGNEEV LRCSFCHKSQ DAVAKLISSP SDYPRAYICD ECVAVCNSIL EDDRAEASPS 
AAPNHLPKPL EVKTFLDEYV IGQDQTKKKL SVAVYNHYKR VFMNRQRSGE VELQKSNILL
VGPTGTGKTL LAQTLAKMLD VPFAIVDATT LTEAGYVGED VENIILKLLQ AADGDVARAQ
TGIIYIDEID KIGRKDENPS ITRDVSGEGV QQALLKILEG TVANVPPQGG RKHPHQEFTP
VDTTNILFIV GGAFVGLEKI VGRRLGKRAL GFRDEEKEKN FTSANNRDSE MLQRSEPQDL
IKFGLIPEFV GRLPVMGVLN DLDENALVEI LTKPKNAIVK QYQRLFEFEN VKLKFSDEAV
RAIAREAMQR KVGARGLRMI LEELMLDLMY SLPSQKRVKE FEVTAEMVQK RDVSISLIEK
QAS