Gene Rsph17029_1829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1829 
SymbolclpX 
ID4896905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1927275 
End bp1928540 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content63% 
IMG OID640112423 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_001043708 
Protein GI126462594 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.517377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.735095 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAACA ATACTGGCAG CGACAGCAAG AACACTCTCT ACTGCTCTTT CTGCGGCAAG 
AGCCAGCATG AGGTGCGCAA GCTGATCGCA GGGCCGACCG TGTTCATCTG CGATGAGTGC
GTCGAGCTCT GCATGGACAT CATCCGCGAA GAGACGAAAT CGACCGGCCT GAAGTCGGCC
GACGGCGTTC CGACCCCGCG CGAGATCTGC AAGGTGCTCG ATGATTACGT CATCGGGCAG
ATGCACGCCA AGCGCGTGCT GTCGGTCGCG GTGCACAACC ACTACAAGCG GCTCAATCAC
AGTTCCAAGA CCGACATCGA ACTCTCGAAG TCGAACATCC TGCTGATCGG CCCCACGGGC
TGCGGCAAGA CGCTGCTCGC GCAGACGCTT GCCCGCATCC TCGACGTGCC CTTCACCATG
GCGGATGCGA CCACGCTGAC CGAAGCGGGC TATGTCGGCG AGGATGTCGA GAACATCATC
CTGAAGCTGC TTCAGGCCTC GGAATACAAC GTCGAGCGGG CGCAGCGCGG CATCGTCTAT
ATCGACGAGG TCGACAAGAT CACCCGCAAG TCCGACAATC CCTCGATCAC GCGCGACGTG
TCCGGCGAGG GGGTGCAGCA GGCGCTGCTG AAGATCATGG AAGGCACCGT GGCGAGCGTT
CCCCCGCAGG GCGGGCGCAA GCATCCGCAA CAGGAATTCC TGCAGGTGGA CACGACGAAT
ATCCTGTTCA TCTGCGGCGG CGCCTTCGCG GGACTGGAAA AGATCATCGC GCAGCGCGGC
AAGGGCTCGG GCATCGGCTT CGGCGCCGAG GTCAAGGACC CGGATGCCCG CGGCGTGGGC
GAGCTCTTCA AGGAGCTCGA GCCCGAGGAT CTGCTGAAGT TCGGCCTGAT CCCCGAATTC
GTCGGCCGTC TGCCCGTGAT CGCGACCCTG ACCGACCTCG ACGAGGCGGC GCTGGTGACG
ATCCTGACCG AGCCGAAGAA TGCACTGGTG AAGCAGTATC AGCGGCTGTT CGAGATCGAG
GGCGTGAAGC TCACCTTCAC CGCCGACGCG CTGACCGCCA TCGCCAAGCG CGCGATCAAG
CGCAAGACCG GCGCCCGCGG CCTCCGCTCG ATCATGGAAG ACATCCTGCT CGACACGATG
TTCGAGCTTC CGGGCCTCGA GGGCGTGGAA GAGGTCGTCG TGAACGAGGA AGCGGTGAAC
TCCGGCGCGA AGCCGCTCCT GATCTACACC GAGGTCACGA AGAAGAAGGA CGCGACCGCC
AGCTGA
 
Protein sequence
MANNTGSDSK NTLYCSFCGK SQHEVRKLIA GPTVFICDEC VELCMDIIRE ETKSTGLKSA 
DGVPTPREIC KVLDDYVIGQ MHAKRVLSVA VHNHYKRLNH SSKTDIELSK SNILLIGPTG
CGKTLLAQTL ARILDVPFTM ADATTLTEAG YVGEDVENII LKLLQASEYN VERAQRGIVY
IDEVDKITRK SDNPSITRDV SGEGVQQALL KIMEGTVASV PPQGGRKHPQ QEFLQVDTTN
ILFICGGAFA GLEKIIAQRG KGSGIGFGAE VKDPDARGVG ELFKELEPED LLKFGLIPEF
VGRLPVIATL TDLDEAALVT ILTEPKNALV KQYQRLFEIE GVKLTFTADA LTAIAKRAIK
RKTGARGLRS IMEDILLDTM FELPGLEGVE EVVVNEEAVN SGAKPLLIYT EVTKKKDATA
S