Gene Haur_3829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3829 
SymbolclpX 
ID5735693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4807778 
End bp4809076 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content51% 
IMG OID641280981 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_001546593 
Protein GI159900346 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00448797 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCAAT CCGCCTTTAT GCCACCGACC TACTACTGCT CGTTTTGCGG GCGGAACCAA 
GATGAAGTCG ATCGCTTGGT CACAGGTCCG GGGGCATTAT TTATTTGCAA CGAGTGCATT
GAGCTGTTAA GCGCAATTAT TGCCAACGAG GAGCGTAAAG AAGCTCCCCA TGCCCCAATT
CTCCCACCAA CACTGCCGAT TCCCCATGCC ATTCGTGATC ATCTCGATGA ATATGTAATT
GGCCAAGATC GGGCGAAAAA GGTGATGGCG GTGGCGGTGT ACAACCACTA TAAGCGGTTG
CGTGCCCAAG CTCAAGGCGA TACCGATGTC GAAATTCAAA AGAGCAATAT TTTGTTAGTT
GGGCCAACTG GCTCAGGCAA AACGCTGCTT GCCCAAACCC TAGCTCGTAT GCTCGATGTG
CCATTTGCCA TCGCCGATGC CACCGCATTA ACCGAGGCTG GGTATGTTGG CGAAGATGTC
GAGACGATTC TGCTGCGCTT GATTCAAGCC GCCGATGGCG ATGTTGATCG CGCTCAAATG
GGCATTTTGT ATATCGACGA AATCGATAAA ATTGCCCGCA AAGCCGATAA TCCATCGATC
ACGCGTGATG TTTCGGGCGA GGGTGTGCAA CAAGCTTTGC TGAAAATCCT CGAAGGCGGG
GTGGTCAATG TGCCGCCAAT GCCAGGCCGC AAACATCCGC AGCAAGAATT TATTCCCTTT
GATACCACCA ATGTGCTGTT TATTTGTGGT GGGGCGTTCG AAGGCTTGGA GCATCATATT
GCTGAACGGA TGGGTAGTGG CGGAACCTTA GGCTTTGGCA AGACGATCGT CAAAGAAGAA
CGGCTCGAAC GTTCTAAGAA ATTGCTATCG TTGGTCAACC CCGACGATTT GCTGCACTTT
GGTTTTATTC CTGAGTTTAT CGGACGTATG CCGGTTGTCG CCGCGCTCAC GCCGCTCGAT
AAAGATGCCA TGATGCGGAT TTTGACCGAG CCACGCAACG CAATCATCAA GCAATATCAA
AAAATGTTGG CGCTTGATCA TGTGCAACTC GAAGTCAGCG GCGATGCCAT GGAAGCAATC
GTTGAGCGAG CGTTGGCTGG CAAAACAGGC GCTCGTGGTT TGCGCACTGC CGTTGAAGAA
ATTTTGCTCG ATGTGATGTT TGATTTGCCG TCGGAAACCG ATGTAGTGCG CTGTGTAATT
ACCGCTGAAA CGGTGCGTGA TGGTGCAATG CCAACCCTGA TTCGGCGAAC GAGCAGCCGC
AGTCGAGCTG GTAAACAACC AACCACCAAA GCTAGTTAA
 
Protein sequence
MAQSAFMPPT YYCSFCGRNQ DEVDRLVTGP GALFICNECI ELLSAIIANE ERKEAPHAPI 
LPPTLPIPHA IRDHLDEYVI GQDRAKKVMA VAVYNHYKRL RAQAQGDTDV EIQKSNILLV
GPTGSGKTLL AQTLARMLDV PFAIADATAL TEAGYVGEDV ETILLRLIQA ADGDVDRAQM
GILYIDEIDK IARKADNPSI TRDVSGEGVQ QALLKILEGG VVNVPPMPGR KHPQQEFIPF
DTTNVLFICG GAFEGLEHHI AERMGSGGTL GFGKTIVKEE RLERSKKLLS LVNPDDLLHF
GFIPEFIGRM PVVAALTPLD KDAMMRILTE PRNAIIKQYQ KMLALDHVQL EVSGDAMEAI
VERALAGKTG ARGLRTAVEE ILLDVMFDLP SETDVVRCVI TAETVRDGAM PTLIRRTSSR
SRAGKQPTTK AS