Gene RPD_2893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2893 
SymbolclpX 
ID4023393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3222137 
End bp3223411 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content62% 
IMG OID637963093 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_570022 
Protein GI91977363 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.313162 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.205754 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAGG TCGGTACGAG CGACTCCAAG AACACCCTGT ACTGCTCGTT CTGCGGAAAG 
AGCCAGCACG AGGTCCGCAA GCTGATCGCC GGCCCCACGG TATTCATCTG TGATGAATGC
GTGGAACTGT GCATGGACAT CATCCGCGAA GAGAACAAAT CTTCGCTGGT GAAGTCGCGC
GACGGCATCC CGACCCCGAA GGAGATCTGC AAGGTCCTCG ACGATTATGT GATCGGCCAG
GGCCATGCGA AGAAGGTGCT CTCGGTCGCG GTGCACAACC ACTACAAGCG GCTGAATCAC
CAGACCAAGC ACAACGACGT CGAGCTCGCG AAGTCGAACA TCCTGCTGAT CGGTCCGACC
GGCTCGGGCA AGACGCTGCT GGCGCAGACG CTGGCGCGGA TTCTCGACGT GCCGTTCACG
ATGGCGGATG CGACGACGCT GACCGAAGCC GGCTATGTCG GCGAGGATGT CGAGAACATC
ATTCTGAAGC TGCTGCAGGC TGCCGACTAC AACGTCGAGC GGGCGCAGCG CGGTATCGTC
TATATCGACG AAATCGACAA GATTTCGCGC AAGTCGGACA ATCCCTCGAT CACCCGCGAC
GTGTCGGGCG AGGGCGTCCA GCAGGCGCTG CTGAAGATCA TGGAAGGCAC GGTGGCTTCG
GTCCCGCCGC AGGGCGGCCG CAAGCATCCG CAGCAGGAGT TCCTGCAGGT CGACACCACC
AACATTCTGT TCATCTGCGG TGGTGCGTTC GCGGGCCTTG AAAAGATCAT CTCGGCGCGC
GGCCGTTCGA CCTCGATCGG CTTCGCGGCG CAGGTGCTCG CGCCTGAAGA TCGCCGCACC
GGCGAAATCT TCCGTCACGT CGAGCCGGAA GACTTGCTGA AGTACGGCCT GATCCCGGAA
TTCGTCGGCC GTCTGCCCGT CGTGGCGACG CTCGAGGATC TCGATGAGGC CTCGCTGAAG
AAGATCCTGA CCGACCCGAA GAACGCGCTG GTCAAGCAGT ATCAGCGGCT GTTCGAAATG
GAGAACATCG AACTCACCTT CGCCGACGAG GCGCTTGGTG CGGTGGCGCG CAAGGCGATC
GAGCGCAAGA CCGGCGCCCG CGGCCTGCGG TCGATTCTGG AGAGCATTCT GCTCGAGACG
ATGTTCGATC TGCCGGGTCT CGAAGGCGTC GAGGAAGTTG TGATCTCGCG CGAAGTCGTC
GATGGAACAG CGCGTCCGCT CTACATTTAT GCCGACCGTT CAGACCGGGC GGCGGAGAGC
AGCGCCAGCG CGTAA
 
Protein sequence
MSKVGTSDSK NTLYCSFCGK SQHEVRKLIA GPTVFICDEC VELCMDIIRE ENKSSLVKSR 
DGIPTPKEIC KVLDDYVIGQ GHAKKVLSVA VHNHYKRLNH QTKHNDVELA KSNILLIGPT
GSGKTLLAQT LARILDVPFT MADATTLTEA GYVGEDVENI ILKLLQAADY NVERAQRGIV
YIDEIDKISR KSDNPSITRD VSGEGVQQAL LKIMEGTVAS VPPQGGRKHP QQEFLQVDTT
NILFICGGAF AGLEKIISAR GRSTSIGFAA QVLAPEDRRT GEIFRHVEPE DLLKYGLIPE
FVGRLPVVAT LEDLDEASLK KILTDPKNAL VKQYQRLFEM ENIELTFADE ALGAVARKAI
ERKTGARGLR SILESILLET MFDLPGLEGV EEVVISREVV DGTARPLYIY ADRSDRAAES
SASA