Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0602 |
Symbol | clpX |
ID | 4709296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 681128 |
End bp | 682408 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639855062 |
Product | ATP-dependent protease ATP-binding subunit ClpX |
Protein accession | YP_001002190 |
Protein GI | 121997403 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1219] ATP-dependent protease Clp, ATPase subunit |
TIGRFAM ID | [TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGATC GAACTCAGAA CAAGGGCGAT GACAGCGGCA AGCTGCTGTA CTGCTCCTTC TGCGGGAAGA GCCAGCACGA GGTCCGCAAG CTCATCGCCG GGCCATCGGT GTTCATCTGT GACGAGTGCG TCGAGCTCTG CAACGACATC ATCCGCGAGG AGCTCCAGGA AGGTGCGGCG ACGGAAGGCG GCGGGCTGCC CCGGCCCCAC GAGATCAACC GCGAGCTCGA CCAGTACGTC ATTGGCCAGG AGCACGCCAA GAAGGTGCTC TCGGTGGCGG TGTACAACCA CTACAAGCGC CTCGAGAGCC GGACCAGCCA GGACGATGTG GAGCTGACCA AGAGCAACAT CCTGCTCATC GGCCCCACCG GCTCGGGCAA GACGTTGCTC GCCGAGACCC TGGCGCGGCT GCTCAACGTG CCGTTCACCA TCGCCGACGC CACGACTCTG ACCGAGGCCG GGTATGTCGG TGAGGATGTC GAGAACATCA TCCAGAAGTT GCTGCAGAAG TGCGATTACG ACGTCGAGAA GGCGCAGCAC GGCATCGTCT ACATCGACGA GATCGACAAG GTCTCGCGCA AGGCGGACAA CCCGTCGATC ACCCGGGACG TCTCCGGCGA GGGGGTGCAG CAGGCGCTGC TCAAGCTGAT CGAGGGGACC ACGGCCTCGG TGCCCCCGCA GGGTGGGCGC AAGCACCCTC AGCAGGAGTT CGTGCAGGTC GATACCAGCA ACATGTTGTT CATCTGCGGC GGCGCTTTCG CCGGGCTCGA CAAGGTCATC CAGGAGCGCT CCGAGCGGGG CGGGATCGGC TTCTCGGCCG AGATCAAGGG CGAGGGCGAG CGAGCTAGCG TCGGCGAGAC GCTGCAGACC GTTGAACCCA GCGATCTGGT CCGCTACGGC CTGATCCCGG AGTTTGTCGG CCGGCTGCCG GTCATCGCGA CCCTCAACGA GCTCGATCAG GAGGCCCTGG TGCAGATCCT CCGCGAGCCG AAGAACGCGC TGGTTAAGCA GTACCAGAAG CTCTTTGAGA TGGAGGGTGT GGAGCTCGAC CTGCGCGATG ACGCCCTGCG GGCGGTGGCC GACAAGGCGA TGGAGCGCAA GACCGGTGCC CGCGGGCTGC GCACCATCAT CGAGCAGGTG CTGCTCGAGA CCATGTACGA GCTCCCGTCC ATGGAGAATG TCAGCAAAGT GGTGGTCGAC GAGTCGGTCA TCGCCGGCGA CAGCGACCCG TACATCGTCT ACGCGGGCCC CGAGCATTCC AAGGCGGCGT CCGACGAGTA G
|
Protein sequence | MTDRTQNKGD DSGKLLYCSF CGKSQHEVRK LIAGPSVFIC DECVELCNDI IREELQEGAA TEGGGLPRPH EINRELDQYV IGQEHAKKVL SVAVYNHYKR LESRTSQDDV ELTKSNILLI GPTGSGKTLL AETLARLLNV PFTIADATTL TEAGYVGEDV ENIIQKLLQK CDYDVEKAQH GIVYIDEIDK VSRKADNPSI TRDVSGEGVQ QALLKLIEGT TASVPPQGGR KHPQQEFVQV DTSNMLFICG GAFAGLDKVI QERSERGGIG FSAEIKGEGE RASVGETLQT VEPSDLVRYG LIPEFVGRLP VIATLNELDQ EALVQILREP KNALVKQYQK LFEMEGVELD LRDDALRAVA DKAMERKTGA RGLRTIIEQV LLETMYELPS MENVSKVVVD ESVIAGDSDP YIVYAGPEHS KAASDE
|
| |