Gene RoseRS_3663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3663 
SymbolclpX 
ID5210641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4585065 
End bp4586375 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content62% 
IMG OID640597256 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_001277968 
Protein GI148657763 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.76714 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.134529 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCGCA CACGCAGCGG TAACGCAAAT TCGTCGAATA ATCGCGGTGC ATACCTCTGT 
TCGTTCTGTG GACGGGGACA GGAAGAGGTG CAGCGCCTGA TCGCCGGTCC CGGCAATGTG
TTTATCTGCG ATGAGTGCGT CGCGCTGTGC AGCGCGATCA TCGCCGAAGA AACCGGGACA
CGCCCGTCGA CCCGACGTTC CTCCGCCAGC CTGCCGGCGC GCCTGCCCAC GCCGCGCCGC
CTGCGCGAAT GGCTCGATCA GTATGTCATC GGGCAGGATC GCGCAAAAGT GGTGCTATCG
GTGGCGGTCT ATAACCACTA CAAGCGCCTC CGCGCCGGGC AGAATGCTGA TGATGTCGAG
ATCGGCAAGA GCAATATTTT GCTGATCGGT CCGACCGGCA GCGGAAAGAC GTTGCTGGCG
CAGACGCTGG CGCGAGTGCT CGATGTTCCC TTCGCTATCG CCGATGCCAC CGCGCTGACC
GAGGCAGGGT ACGTCGGCGA GGATGTCGAA AACATTCTCC TGCGGTTGAT CCAGGCTGCC
GAAGGTGATA TCGAACGCGC GCAGACCGGG ATCATCTACA TCGATGAGAT CGATAAAATT
GCGCGCAAGA GTGATAATCC GTCGATTACG CGCGATGTGT CGGGCGAAGG GGTGCAACAG
GCGTTGCTGA AGATTCTCGA AGGGTGCGTG GCGCATGTGC CGCCGGTTCC CGGTCGCAAA
CATCCGCAGC AGGAGTATAT TTCGTTCGAT ACGACCCACG TGCTCTTCAT CTGCGGCGGC
GCCTTCGAGG GTCTCGACAA AATCATCAGC CAGCGCATCG GCGGCAAGCG CAGCATCGGC
TTCCACGCTG GCGAGTCTTC CGATGCTCCG GCATCGTTGC TGTCGCAGGT CACGCCGGAT
GACCTGCTGC GCTACGGTTT CATCCCCGAA TTCGTCGGGC GGCTTCCGGT TGTCGCGGCG
CTCGATCCGC TCGATAAGCA GGCAATGATC CGCATTCTGA CCGAGCCGCG CAATGCGATC
ATCAAGCAGT ACCAGAAGAT GCTCGCCCTC GACCACGTTG AACTCGAGGT CACGCCCGAC
GCGCTTGAAG CGATTGCGGA GCGGGCGCTC AGATCAAAGA CGGGCGCGCG CGCGCTGCGC
ACGATCGTTG AGGAGATCCT GCTCGACGTG ATGTACGAAG TGCCTTCGCA GGAGCACATC
GGGCGTTGCA TCATCAACGC CGAAGTGGTC GAAGGGCGCG GGCACCCGAT CCTGGTGCCG
CGCTCCGCTG AACGGCAGGA GTACCGCCGA CGCATGGACG AGGCTGTGTA A
 
Protein sequence
MSRTRSGNAN SSNNRGAYLC SFCGRGQEEV QRLIAGPGNV FICDECVALC SAIIAEETGT 
RPSTRRSSAS LPARLPTPRR LREWLDQYVI GQDRAKVVLS VAVYNHYKRL RAGQNADDVE
IGKSNILLIG PTGSGKTLLA QTLARVLDVP FAIADATALT EAGYVGEDVE NILLRLIQAA
EGDIERAQTG IIYIDEIDKI ARKSDNPSIT RDVSGEGVQQ ALLKILEGCV AHVPPVPGRK
HPQQEYISFD TTHVLFICGG AFEGLDKIIS QRIGGKRSIG FHAGESSDAP ASLLSQVTPD
DLLRYGFIPE FVGRLPVVAA LDPLDKQAMI RILTEPRNAI IKQYQKMLAL DHVELEVTPD
ALEAIAERAL RSKTGARALR TIVEEILLDV MYEVPSQEHI GRCIINAEVV EGRGHPILVP
RSAERQEYRR RMDEAV