Gene Hhal_0602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0602 
SymbolclpX 
ID4709296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp681128 
End bp682408 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content65% 
IMG OID639855062 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_001002190 
Protein GI121997403 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGATC GAACTCAGAA CAAGGGCGAT GACAGCGGCA AGCTGCTGTA CTGCTCCTTC 
TGCGGGAAGA GCCAGCACGA GGTCCGCAAG CTCATCGCCG GGCCATCGGT GTTCATCTGT
GACGAGTGCG TCGAGCTCTG CAACGACATC ATCCGCGAGG AGCTCCAGGA AGGTGCGGCG
ACGGAAGGCG GCGGGCTGCC CCGGCCCCAC GAGATCAACC GCGAGCTCGA CCAGTACGTC
ATTGGCCAGG AGCACGCCAA GAAGGTGCTC TCGGTGGCGG TGTACAACCA CTACAAGCGC
CTCGAGAGCC GGACCAGCCA GGACGATGTG GAGCTGACCA AGAGCAACAT CCTGCTCATC
GGCCCCACCG GCTCGGGCAA GACGTTGCTC GCCGAGACCC TGGCGCGGCT GCTCAACGTG
CCGTTCACCA TCGCCGACGC CACGACTCTG ACCGAGGCCG GGTATGTCGG TGAGGATGTC
GAGAACATCA TCCAGAAGTT GCTGCAGAAG TGCGATTACG ACGTCGAGAA GGCGCAGCAC
GGCATCGTCT ACATCGACGA GATCGACAAG GTCTCGCGCA AGGCGGACAA CCCGTCGATC
ACCCGGGACG TCTCCGGCGA GGGGGTGCAG CAGGCGCTGC TCAAGCTGAT CGAGGGGACC
ACGGCCTCGG TGCCCCCGCA GGGTGGGCGC AAGCACCCTC AGCAGGAGTT CGTGCAGGTC
GATACCAGCA ACATGTTGTT CATCTGCGGC GGCGCTTTCG CCGGGCTCGA CAAGGTCATC
CAGGAGCGCT CCGAGCGGGG CGGGATCGGC TTCTCGGCCG AGATCAAGGG CGAGGGCGAG
CGAGCTAGCG TCGGCGAGAC GCTGCAGACC GTTGAACCCA GCGATCTGGT CCGCTACGGC
CTGATCCCGG AGTTTGTCGG CCGGCTGCCG GTCATCGCGA CCCTCAACGA GCTCGATCAG
GAGGCCCTGG TGCAGATCCT CCGCGAGCCG AAGAACGCGC TGGTTAAGCA GTACCAGAAG
CTCTTTGAGA TGGAGGGTGT GGAGCTCGAC CTGCGCGATG ACGCCCTGCG GGCGGTGGCC
GACAAGGCGA TGGAGCGCAA GACCGGTGCC CGCGGGCTGC GCACCATCAT CGAGCAGGTG
CTGCTCGAGA CCATGTACGA GCTCCCGTCC ATGGAGAATG TCAGCAAAGT GGTGGTCGAC
GAGTCGGTCA TCGCCGGCGA CAGCGACCCG TACATCGTCT ACGCGGGCCC CGAGCATTCC
AAGGCGGCGT CCGACGAGTA G
 
Protein sequence
MTDRTQNKGD DSGKLLYCSF CGKSQHEVRK LIAGPSVFIC DECVELCNDI IREELQEGAA 
TEGGGLPRPH EINRELDQYV IGQEHAKKVL SVAVYNHYKR LESRTSQDDV ELTKSNILLI
GPTGSGKTLL AETLARLLNV PFTIADATTL TEAGYVGEDV ENIIQKLLQK CDYDVEKAQH
GIVYIDEIDK VSRKADNPSI TRDVSGEGVQ QALLKLIEGT TASVPPQGGR KHPQQEFVQV
DTSNMLFICG GAFAGLDKVI QERSERGGIG FSAEIKGEGE RASVGETLQT VEPSDLVRYG
LIPEFVGRLP VIATLNELDQ EALVQILREP KNALVKQYQK LFEMEGVELD LRDDALRAVA
DKAMERKTGA RGLRTIIEQV LLETMYELPS MENVSKVVVD ESVIAGDSDP YIVYAGPEHS
KAASDE