Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3829 |
Symbol | clpX |
ID | 5735693 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4807778 |
End bp | 4809076 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280981 |
Product | ATP-dependent protease ATP-binding subunit ClpX |
Protein accession | YP_001546593 |
Protein GI | 159900346 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1219] ATP-dependent protease Clp, ATPase subunit |
TIGRFAM ID | [TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00448797 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCAAT CCGCCTTTAT GCCACCGACC TACTACTGCT CGTTTTGCGG GCGGAACCAA GATGAAGTCG ATCGCTTGGT CACAGGTCCG GGGGCATTAT TTATTTGCAA CGAGTGCATT GAGCTGTTAA GCGCAATTAT TGCCAACGAG GAGCGTAAAG AAGCTCCCCA TGCCCCAATT CTCCCACCAA CACTGCCGAT TCCCCATGCC ATTCGTGATC ATCTCGATGA ATATGTAATT GGCCAAGATC GGGCGAAAAA GGTGATGGCG GTGGCGGTGT ACAACCACTA TAAGCGGTTG CGTGCCCAAG CTCAAGGCGA TACCGATGTC GAAATTCAAA AGAGCAATAT TTTGTTAGTT GGGCCAACTG GCTCAGGCAA AACGCTGCTT GCCCAAACCC TAGCTCGTAT GCTCGATGTG CCATTTGCCA TCGCCGATGC CACCGCATTA ACCGAGGCTG GGTATGTTGG CGAAGATGTC GAGACGATTC TGCTGCGCTT GATTCAAGCC GCCGATGGCG ATGTTGATCG CGCTCAAATG GGCATTTTGT ATATCGACGA AATCGATAAA ATTGCCCGCA AAGCCGATAA TCCATCGATC ACGCGTGATG TTTCGGGCGA GGGTGTGCAA CAAGCTTTGC TGAAAATCCT CGAAGGCGGG GTGGTCAATG TGCCGCCAAT GCCAGGCCGC AAACATCCGC AGCAAGAATT TATTCCCTTT GATACCACCA ATGTGCTGTT TATTTGTGGT GGGGCGTTCG AAGGCTTGGA GCATCATATT GCTGAACGGA TGGGTAGTGG CGGAACCTTA GGCTTTGGCA AGACGATCGT CAAAGAAGAA CGGCTCGAAC GTTCTAAGAA ATTGCTATCG TTGGTCAACC CCGACGATTT GCTGCACTTT GGTTTTATTC CTGAGTTTAT CGGACGTATG CCGGTTGTCG CCGCGCTCAC GCCGCTCGAT AAAGATGCCA TGATGCGGAT TTTGACCGAG CCACGCAACG CAATCATCAA GCAATATCAA AAAATGTTGG CGCTTGATCA TGTGCAACTC GAAGTCAGCG GCGATGCCAT GGAAGCAATC GTTGAGCGAG CGTTGGCTGG CAAAACAGGC GCTCGTGGTT TGCGCACTGC CGTTGAAGAA ATTTTGCTCG ATGTGATGTT TGATTTGCCG TCGGAAACCG ATGTAGTGCG CTGTGTAATT ACCGCTGAAA CGGTGCGTGA TGGTGCAATG CCAACCCTGA TTCGGCGAAC GAGCAGCCGC AGTCGAGCTG GTAAACAACC AACCACCAAA GCTAGTTAA
|
Protein sequence | MAQSAFMPPT YYCSFCGRNQ DEVDRLVTGP GALFICNECI ELLSAIIANE ERKEAPHAPI LPPTLPIPHA IRDHLDEYVI GQDRAKKVMA VAVYNHYKRL RAQAQGDTDV EIQKSNILLV GPTGSGKTLL AQTLARMLDV PFAIADATAL TEAGYVGEDV ETILLRLIQA ADGDVDRAQM GILYIDEIDK IARKADNPSI TRDVSGEGVQ QALLKILEGG VVNVPPMPGR KHPQQEFIPF DTTNVLFICG GAFEGLEHHI AERMGSGGTL GFGKTIVKEE RLERSKKLLS LVNPDDLLHF GFIPEFIGRM PVVAALTPLD KDAMMRILTE PRNAIIKQYQ KMLALDHVQL EVSGDAMEAI VERALAGKTG ARGLRTAVEE ILLDVMFDLP SETDVVRCVI TAETVRDGAM PTLIRRTSSR SRAGKQPTTK AS
|
| |