Gene Nmul_A2339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2339 
SymbolclpX 
ID3784742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2665698 
End bp2666975 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content53% 
IMG OID637812429 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_413022 
Protein GI82703456 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0299651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGAGA AAACTGGCGG AGAAAAACTG CTTTATTGTT CCTTTTGTGG TAAGAGCCAG 
CATGAGGTAA AGAAGCTGAT CGCCGGTCCT TCGGTATTCA TATGCGACGA ATGTATCGAG
TTGTGCAACG ATATCATTCG TGAGGAAATA CAGGGTGTGG AGGCGGCCAA GCTCGCAAAG
TCGGACCTGC CTGTCCCCCA TGAAATTCGA CAGATCCTGG ATCAATACGT GATCGGCCAG
GAGCAGGCGA AGAAAATTCT GTCGGTAGCG GTCTATAATC ACTACAAGCG TCTGAGAACG
CTTGCAAAAT CCGCTGATCC GGATGAGATC GAGCTGGCGA AGAGCAACAT CCTGCTGATT
GGCCCCACGG GTTCAGGCAA GACGCTGCTT GCCCAGACGC TGGCACGTTT GCTGGATGTG
CCTTTTGTCA TGGCGGATGC GACCACCCTG ACCGAGGCGG GTTATGTCGG CGAGGATGTT
GAAAACATCA TCCAGAAATT GCTGCAGAAA TGTAACTACG ACGCGGAAAA GGCGCAGCAA
GGCATTGTCT ACATAGACGA AATCGACAAG ATTTCGCGCA AGTCAGATAA TCCATCCATC
ACGCGCGATG TCTCGGGTGA GGGCGTACAG CAGGCACTGC TCAAGCTGAT CGAGGGCACG
GTTGCGTCCG TGCCGCCGCA GGGGGGGAGA AAGCATCCCA ACCAGGAGTT CGTGCAAGTC
GATACCACCA ATATTCTTTT CATTTGCGGG GGCGCTTTTG ATGGCCTGGA GAAAATCATA
CGCGCCCGCT CTGAGAAGGG CGGGATAGGC TTCAGCGCCA GTGTAAGAAG CCAGGATAAC
CGCAAGGATT TTGGCGCTGT ACTGCGCGGG GTCGAGCCCG AGGATCTGGT GAAGTATGGC
CTGATTCCGG AATTCGTCGG CCGATTACCT GTTGTTGCAA CCCTTGAGGA ACTTGACGAG
GCCGCGTTGA TTCAAATCCT GACCGAACCC AGGAATGCGC TCATCAAGCA GTATCAGAAA
ATGTTTCACA TGGAAGGGGG CATCGATCTC GAATTCCGGG AGCAGGCGCT TAAAGCGATT
GCCCGCAAGG CGCTGGTGCG AAAGACCGGT GCGCGAGGTT TACGTTCCAT TCTTGAAGCT
GCGCTGCTCG ATACGATGTT TGATCTGCCA TCACTGGAGA ATGTTGCCAA AGTGGTAATT
GACCACACTT CTGTCAATGG GGATATCAAA CCCATTCTGA TCTATTCGGA CAAACCGAAA
GTGGCGAAGA GCTGCTAG
 
Protein sequence
MSEKTGGEKL LYCSFCGKSQ HEVKKLIAGP SVFICDECIE LCNDIIREEI QGVEAAKLAK 
SDLPVPHEIR QILDQYVIGQ EQAKKILSVA VYNHYKRLRT LAKSADPDEI ELAKSNILLI
GPTGSGKTLL AQTLARLLDV PFVMADATTL TEAGYVGEDV ENIIQKLLQK CNYDAEKAQQ
GIVYIDEIDK ISRKSDNPSI TRDVSGEGVQ QALLKLIEGT VASVPPQGGR KHPNQEFVQV
DTTNILFICG GAFDGLEKII RARSEKGGIG FSASVRSQDN RKDFGAVLRG VEPEDLVKYG
LIPEFVGRLP VVATLEELDE AALIQILTEP RNALIKQYQK MFHMEGGIDL EFREQALKAI
ARKALVRKTG ARGLRSILEA ALLDTMFDLP SLENVAKVVI DHTSVNGDIK PILIYSDKPK
VAKSC