Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4635 |
Symbol | |
ID | 3912452 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 5238884 |
End bp | 5239834 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637886539 |
Product | heat shock protein HtpX |
Protein accession | YP_488229 |
Protein GI | 86751733 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0501] Zn-dependent protease with chaperone function |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTACT TCAAAACCGC GATGTTGCTC GCCGGTATGA CCGCGCTGTT CATGGGCGTC GGCTACCTGA TCGGCGGCGC CAGCGGCGCG ATGATCGCGC TGGTCGTCGC TGCGGCGATG AACATCTTCA CCTACTGGAA TTCCGACCGG ATGGTGCTGT CGATGTACGG CGCGCAGCAG GTCGACGAAC GCTCCGCGCC GGACCTGGTG CGGATGGTCG CCGAGCTGGC GGGTCGCGCC GGCCTGCCGA TGCCGCGCGT GTTCATCATG GACAACCCGC AGCCGAACGC GTTCGCGACC GGCCGCAATC CGGAGAACGC CGCGGTCGCC GTCACCACCG GCCTGATGCA GTCGCTCAGC CGCGAGGAAC TCGCCGGCGT GGTGGCGCAC GAGCTCGCGC ACATCAAGAA CCACGACACG CTGCTGATGA CCATCACCGC GACGATCGCC GGTGCGATCT CGATGGTGGC GCAGTTCGGC ATGTTCTTCG GCGGCAACCG CGAGAACAAC AACGGCCCGG GCCTGATCGG CTCGATTGCA CTGATGATCC TGGCACCGCT GGGCGCCATG TTGGTGCAGA TGGCGATCAG CCGGACCCGT GAATACGCCG CGGACGAAAT GGGCGCGCGG ATCTGCGGCC AGCCGATGTG GCTCGCTTCC GCGCTCGGCC GGATTGAAAA CGCTGCGCAT CAGGTGCCGA ACTACGAGGC CGAGCGGGCG CCGGCGACCG CGCATATGTT CATCATCAAC CCGCTGTCGG GTCAGGGCAT GGACAATCTG TTCTCGACCC ATCCAGCGAC CGCAAACCGG GTCGCTGCAT TGCAGCGGCT GGCCGGTGAG ATCGGCGGCG GTGGCGCGAG CTTCGGCCGG CCGACCCCGT CACCGCGCGG GCCGTGGAGC GGTGCGCCGC GCGGCAGCGG CGAGCCCCGC GCGCGCGGCC CGTGGGGTTG A
|
Protein sequence | MSYFKTAMLL AGMTALFMGV GYLIGGASGA MIALVVAAAM NIFTYWNSDR MVLSMYGAQQ VDERSAPDLV RMVAELAGRA GLPMPRVFIM DNPQPNAFAT GRNPENAAVA VTTGLMQSLS REELAGVVAH ELAHIKNHDT LLMTITATIA GAISMVAQFG MFFGGNRENN NGPGLIGSIA LMILAPLGAM LVQMAISRTR EYAADEMGAR ICGQPMWLAS ALGRIENAAH QVPNYEAERA PATAHMFIIN PLSGQGMDNL FSTHPATANR VAALQRLAGE IGGGGASFGR PTPSPRGPWS GAPRGSGEPR ARGPWG
|
| |