Gene TM1040_1526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1526 
SymbolclpX 
ID4075824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1629344 
End bp1630609 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content58% 
IMG OID638006839 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_613521 
Protein GI99081367 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.86122 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.530308 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGA ACTCGAGCGG CGACAGCAAG AACACGCTTT ACTGTAGCTT CTGCGGCAAG 
AGCCAGCATG AGGTGAGAAA GCTGATTGCA GGCCCGACCG TGTTCATCTG TGATGAATGC
GTTGAGCTGT GCATGGACAT CATCCGCGAG GAGACGAAGG CCTCCGGGAT GAAAGCAACC
GACGGCGTGC CGACGCCCAA GGATATTTGC GAGGTCCTCG ATGATTACGT GATCGGTCAG
GCGACGGCAA AGCGTGTGCT GTCGGTGGCG GTTCACAACC ACTACAAGCG TCTGAACCAC
GCCCAGAAGG CCGGCAATGA TATTGAACTT TCAAAGTCCA ACATCCTGCT GATCGGCCCC
ACCGGCTGCG GTAAGACCCT TCTGGCGCAA ACTCTTGCAC GGATTCTGGA CGTGCCGTTT
ACCATGGCAG ATGCCACTAC GCTTACAGAG GCAGGCTATG TTGGTGAGGA TGTCGAGAAC
ATCATTCTGA AACTGCTGCA GGCGTCTGAA TACAATGTCG AACGCGCGCA GCGCGGTATC
GTCTACATCG ACGAGGTCGA CAAGATCACC CGCAAGTCTG AAAACCCCTC CATCACCCGT
GATGTGTCGG GCGAGGGCGT GCAGCAGGCT CTGCTGAAAC TGATGGAAGG CACTGTGGCC
TCCGTGCCGC CGCAGGGTGG GCGCAAGCAT CCCCAGCAGG AGTTCCTGCA GGTGGATACC
ACGAACATCC TCTTCATCTG CGGCGGTGCC TTTGCGGGCC TGGACAAGAT CATCAAGCAG
CGCGGCAAAG GCTCTGCGAT GGGCTTTGGT GCCGATGTGC GCGAAGAGAG CGATGCGGGC
GTGGGCGAGA CCTTCCGCGA TCTCGAGCCC GAAGATCTGC TGAAATTCGG CCTGATCCCG
GAATTCGTGG GCCGTTTGCC GGTTCTCGCG ACGCTTGAGG ATCTGGATGA GGATGCGCTG
ATCACCATCT TGACCAAGCC CAAGAATGCT TTGGTCAAAC AATACCAGCG CCTCTTTGAA
CTTGAAGACA CCGAGCTGGA CTTCACCGAT GAGGCGCTCT CGGCCATTGC CAAGAAAGCC
ATTGAGCGCA AGACCGGCGC GCGGGGTCTG CGCTCCATCC TCGAGGATAT CCTGCTCGAT
ACCATGTTCG AGCTGCCCGG AATGGAGAGC GTGACAAAAG TGGTCGTCAA TGAGGAGGCC
GTCTGCTCTG AGGCGCAGCC GCTGATGATC CACGCCGATG AAAAGGAATC GGCCACCGCA
GGTTGA
 
Protein sequence
MATNSSGDSK NTLYCSFCGK SQHEVRKLIA GPTVFICDEC VELCMDIIRE ETKASGMKAT 
DGVPTPKDIC EVLDDYVIGQ ATAKRVLSVA VHNHYKRLNH AQKAGNDIEL SKSNILLIGP
TGCGKTLLAQ TLARILDVPF TMADATTLTE AGYVGEDVEN IILKLLQASE YNVERAQRGI
VYIDEVDKIT RKSENPSITR DVSGEGVQQA LLKLMEGTVA SVPPQGGRKH PQQEFLQVDT
TNILFICGGA FAGLDKIIKQ RGKGSAMGFG ADVREESDAG VGETFRDLEP EDLLKFGLIP
EFVGRLPVLA TLEDLDEDAL ITILTKPKNA LVKQYQRLFE LEDTELDFTD EALSAIAKKA
IERKTGARGL RSILEDILLD TMFELPGMES VTKVVVNEEA VCSEAQPLMI HADEKESATA
G