Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1526 |
Symbol | clpX |
ID | 4075824 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 1629344 |
End bp | 1630609 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 638006839 |
Product | ATP-dependent protease ATP-binding subunit ClpX |
Protein accession | YP_613521 |
Protein GI | 99081367 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1219] ATP-dependent protease Clp, ATPase subunit |
TIGRFAM ID | [TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.86122 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.530308 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACGA ACTCGAGCGG CGACAGCAAG AACACGCTTT ACTGTAGCTT CTGCGGCAAG AGCCAGCATG AGGTGAGAAA GCTGATTGCA GGCCCGACCG TGTTCATCTG TGATGAATGC GTTGAGCTGT GCATGGACAT CATCCGCGAG GAGACGAAGG CCTCCGGGAT GAAAGCAACC GACGGCGTGC CGACGCCCAA GGATATTTGC GAGGTCCTCG ATGATTACGT GATCGGTCAG GCGACGGCAA AGCGTGTGCT GTCGGTGGCG GTTCACAACC ACTACAAGCG TCTGAACCAC GCCCAGAAGG CCGGCAATGA TATTGAACTT TCAAAGTCCA ACATCCTGCT GATCGGCCCC ACCGGCTGCG GTAAGACCCT TCTGGCGCAA ACTCTTGCAC GGATTCTGGA CGTGCCGTTT ACCATGGCAG ATGCCACTAC GCTTACAGAG GCAGGCTATG TTGGTGAGGA TGTCGAGAAC ATCATTCTGA AACTGCTGCA GGCGTCTGAA TACAATGTCG AACGCGCGCA GCGCGGTATC GTCTACATCG ACGAGGTCGA CAAGATCACC CGCAAGTCTG AAAACCCCTC CATCACCCGT GATGTGTCGG GCGAGGGCGT GCAGCAGGCT CTGCTGAAAC TGATGGAAGG CACTGTGGCC TCCGTGCCGC CGCAGGGTGG GCGCAAGCAT CCCCAGCAGG AGTTCCTGCA GGTGGATACC ACGAACATCC TCTTCATCTG CGGCGGTGCC TTTGCGGGCC TGGACAAGAT CATCAAGCAG CGCGGCAAAG GCTCTGCGAT GGGCTTTGGT GCCGATGTGC GCGAAGAGAG CGATGCGGGC GTGGGCGAGA CCTTCCGCGA TCTCGAGCCC GAAGATCTGC TGAAATTCGG CCTGATCCCG GAATTCGTGG GCCGTTTGCC GGTTCTCGCG ACGCTTGAGG ATCTGGATGA GGATGCGCTG ATCACCATCT TGACCAAGCC CAAGAATGCT TTGGTCAAAC AATACCAGCG CCTCTTTGAA CTTGAAGACA CCGAGCTGGA CTTCACCGAT GAGGCGCTCT CGGCCATTGC CAAGAAAGCC ATTGAGCGCA AGACCGGCGC GCGGGGTCTG CGCTCCATCC TCGAGGATAT CCTGCTCGAT ACCATGTTCG AGCTGCCCGG AATGGAGAGC GTGACAAAAG TGGTCGTCAA TGAGGAGGCC GTCTGCTCTG AGGCGCAGCC GCTGATGATC CACGCCGATG AAAAGGAATC GGCCACCGCA GGTTGA
|
Protein sequence | MATNSSGDSK NTLYCSFCGK SQHEVRKLIA GPTVFICDEC VELCMDIIRE ETKASGMKAT DGVPTPKDIC EVLDDYVIGQ ATAKRVLSVA VHNHYKRLNH AQKAGNDIEL SKSNILLIGP TGCGKTLLAQ TLARILDVPF TMADATTLTE AGYVGEDVEN IILKLLQASE YNVERAQRGI VYIDEVDKIT RKSENPSITR DVSGEGVQQA LLKLMEGTVA SVPPQGGRKH PQQEFLQVDT TNILFICGGA FAGLDKIIKQ RGKGSAMGFG ADVREESDAG VGETFRDLEP EDLLKFGLIP EFVGRLPVLA TLEDLDEDAL ITILTKPKNA LVKQYQRLFE LEDTELDFTD EALSAIAKKA IERKTGARGL RSILEDILLD TMFELPGMES VTKVVVNEEA VCSEAQPLMI HADEKESATA G
|
| |