Gene Sare_3876 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3876 
SymbolclpX 
ID5707473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4418232 
End bp4419521 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content63% 
IMG OID641273300 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_001538659 
Protein GI159039406 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.7639 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000675219 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCACGGA TCGGTGACGG TGGCGACCTA CTGAAATGTT CCTTCTGCGG GAAGTCACAA 
AAGCAGGTCA AGAAGCTTAT CGCGGGCCCC GGGGTCTACA TCTGCGACGA GTGTATCGAT
CTCTGTAACG AGATCATTGA GGAAGAGCTG GCTGAATCTG GCGAAGTGAA GTGGGAAGAG
CTTCCCAAGC CGATGGAGAT CTGCCAGTTC CTCGACAATT ACGTCGTTGG TCAGGCTCAG
GCCAAGAAGG CCCTCGCCGT CGCGGTCTAC AACCACTACA AGCGGATTCA GGCCGAGGCG
GTCGGCGCTC CCGGCACCGA CAGTGTCGAA CTGGCCAAGT CCAACATTCT GCTACTCGGC
CCGACCGGAT GCGGCAAGAC CCACCTCGCG CAGACCCTGG CCCGGATGCT CAACGTGCCG
TTCGCGATCG CGGACGCCAC GGCCCTGACC GAGGCCGGGT ACGTCGGCGA GGATGTGGAG
AACATCCTCC TCAAGCTGAT CCAGGCTGCC GACTACGACA TCAAGCGCGC CGAGACCGGA
ATCATCTACA TCGACGAGGT CGACAAGATC GCGCGTAAGT CGGAGAACCC ATCGATAACC
CGTGACGTTT CTGGTGAGGG CGTGCAGCAG GCGCTGCTCA AGATGCTGGA GGGAACGGTC
GCGAACGTTC CGCCCCAGGG TGGCCGCAAG CATCCACATC AGGAGTTCAT CCAGATCGAC
ACCACCAACG TGCTCTTCAT CTGTGGCGGC GCCTTCGCGG GGCTGGACCA GATCATCGAG
GCGCGTACCG GCCATGGTGG CACCGGCTTC GGCGCCCGGT TGCGCGCGGT CTCGGAACGT
TCGACGGATG ACACCTTCAG CCAGGTCATG CCGGAGGACA TGCTGAAGTT CGGTCTGATC
CCTGAGTTCA TCGGCCGGCT TCCGGTGATC ACCAATGTCC GTAGCCTCGA CCGTTCGGCC
TTGGTGCGGA TCCTTACCGA GCCACGCAAC GCGCTCGTCC GGCAGTACCA GCGCCTTTTC
GAGTTGGACG GCGTCGAGTT GGAGTTCGAG CAACCGGCCC TCGAGGCGGT CGCCGACCAG
GCGATGCTCC GTGGCACCGG TGCCCGAGGT CTCCGCGCGA TCATGGAGGA GGTACTGCTC
TCCGTGATGT ACGAGGTGCC CAGCAATCCC GACGCTGCTC GAGTGTTGAT TACCCGGGAG
GTGGTTCTGG AGAACGTCAA CCCGACGATC GTTCCGCGTG AGTTCACCGG CCGCCGGGCC
CGGCGGGAGC GCGAGGAGAA GTCGGCCTGA
 
Protein sequence
MARIGDGGDL LKCSFCGKSQ KQVKKLIAGP GVYICDECID LCNEIIEEEL AESGEVKWEE 
LPKPMEICQF LDNYVVGQAQ AKKALAVAVY NHYKRIQAEA VGAPGTDSVE LAKSNILLLG
PTGCGKTHLA QTLARMLNVP FAIADATALT EAGYVGEDVE NILLKLIQAA DYDIKRAETG
IIYIDEVDKI ARKSENPSIT RDVSGEGVQQ ALLKMLEGTV ANVPPQGGRK HPHQEFIQID
TTNVLFICGG AFAGLDQIIE ARTGHGGTGF GARLRAVSER STDDTFSQVM PEDMLKFGLI
PEFIGRLPVI TNVRSLDRSA LVRILTEPRN ALVRQYQRLF ELDGVELEFE QPALEAVADQ
AMLRGTGARG LRAIMEEVLL SVMYEVPSNP DAARVLITRE VVLENVNPTI VPREFTGRRA
RREREEKSA