Gene Noca_3475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3475 
SymbolclpX 
ID4595572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3685095 
End bp3686375 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content66% 
IMG OID639778081 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_924662 
Protein GI119717697 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCACGTA TCGGTGACGG AGGCGACCTG CTCAAGTGCT CGTTCTGCGG GAAGAGCCAG 
AAGCAGGTCA AGAAGCTGAT CGCGGGCCCC GGCGTCTACA TCTGCGACGA GTGCATCGAC
CTGTGCAACG AGATCATCGA GGAGGAGCTC AGCGAGGGCG CCGAGGTCAG CCTCGACGAG
CTGCCGAAGC CCAAGGAGAT CTTCGAGTTC CTCAACTCCT ACGTCATCGG CCAGGAGCAG
GCCAAGAAGT CACTCGCCGT CGCGGTCTAC AACCACTACA AGCGGGTGCA GGCCGGCCTC
CAGCCCATGT CGGGCAAGCA CAGCAAGGAG GAGGTCGTCG AGGTCGCCAA GTCCAACATC
TTGGTGATCG GCCCCACCGG CTGCGGCAAG ACCTACCTCG CGCAGACCCT GGCCCGGATG
CTCAACGTGC CGTTCGCGAT CGCCGACGCC ACCGCGCTCA CCGAGGCCGG CTACGTCGGT
GAGGACGTCG AGAACATCCT GCTCAAGCTG ATCCAGGCCG CCGACTACGA CGTCAAGAAG
GCCGAGACCG GCATCATCTA CATCGACGAG ATCGACAAGG TGGCCCGCAA GGCGGAGAAC
CCCTCGATCA CCCGCGACGT CTCCGGCGAG GGCGTCCAGC AGGCGCTGCT CAAGATCATC
GAGGGCACCA CCGCCTCGGT CCCGCCGCAG GGCGGCCGCA AGCATCCCCA CCAGGAGTTC
ATCCAGATCG ACACCACGAA CATCCTGTTC GTCGTGGGTG GGGCGTTCGC CGGGCTGGAG
CACATCATCG AGCAGCGGGT CGGCAAGAAG ACCCTCGGCT TCACCGCCGA GGTCCGCGGC
AAGGCCGAGC GCGAGGCCGA GGACCTGCTC GCCCAGGTCC GGCCCGAGGA CCTCACGAAG
TTCGGCCTGA TCCCCGAGTT CATCGGCCGG CTGCCGCTGA TCGCGAGCGT GAGCAAGCTC
GACCAGGAGG CCCTCGTGCA GATCCTCACC GAGCCGCGCA ACGCCCTGGT CAAGCAGTAC
CAGAAGCTCT TCGAGCTCGA CGGTGTCGAG CTCGAGTTCA CCCCCGACGC CATCGAGGCG
ATCGCCGACA ACGCGCTCGA GCGCGGCACC GGTGCCCGTG GCCTGCGCGC GATCATCGAG
GAGGTCCTCC TCCACGTGAT GTACGACGTG CCCTCGCGTG GCGACATCGC GAAGGTGATC
GTCACCCGCG AGGTCGTCAT GGACGGGGTC TCGCCGACCC TGATCCCGCG CGAGTCGGAG
AAGAAGAAGA AGTCCGCGTA G
 
Protein sequence
MARIGDGGDL LKCSFCGKSQ KQVKKLIAGP GVYICDECID LCNEIIEEEL SEGAEVSLDE 
LPKPKEIFEF LNSYVIGQEQ AKKSLAVAVY NHYKRVQAGL QPMSGKHSKE EVVEVAKSNI
LVIGPTGCGK TYLAQTLARM LNVPFAIADA TALTEAGYVG EDVENILLKL IQAADYDVKK
AETGIIYIDE IDKVARKAEN PSITRDVSGE GVQQALLKII EGTTASVPPQ GGRKHPHQEF
IQIDTTNILF VVGGAFAGLE HIIEQRVGKK TLGFTAEVRG KAEREAEDLL AQVRPEDLTK
FGLIPEFIGR LPLIASVSKL DQEALVQILT EPRNALVKQY QKLFELDGVE LEFTPDAIEA
IADNALERGT GARGLRAIIE EVLLHVMYDV PSRGDIAKVI VTREVVMDGV SPTLIPRESE
KKKKSA