Gene RPB_0403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0403 
SymbolhslU 
ID3908841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp447864 
End bp449165 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content66% 
IMG OID637882289 
ProductATP-dependent protease ATP-binding subunit HslU 
Protein accessionYP_484025 
Protein GI86747529 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1220] ATP-dependent protease HslVU (ClpYQ), ATPase subunit 
TIGRFAM ID[TIGR00390] ATP-dependent protease HslVU, ATPase subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.236378 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACT TCTCACCCCG CGAAATCGTT TCCGAACTCG ACCGCTTCAT CGTCGGCCAG 
GCCGACGCCA AGCGCGCGGT CGCCATCGCG CTGCGCAACC GCTGGCGGCG GCTGCAGCTC
GAAGGCGCGC TGCGCGAAGA AGTGCTGCCG AAGAACATCC TGATGATCGG GCCGACCGGC
GTCGGCAAGA CCGAGATCGC GCGCCGGCTG GCCAAGCTCG CCAACGCACC GTTCCTCAAA
GTCGAGGCCA CCAAATTCAC CGAGGTCGGC TATGTCGGCC GCGACGTCGA GCAGATCATC
CGCGATCTGG TCGAAGTGGC GATCGCGCAG GTGCGCGAGA AGAAGCGCAA GGACGTCCAG
GCCCGCGCCC AGATCGCCGC CGAGGAACGC GTGCTCGATG CGCTGGTCGG CCCGGGATCG
AGCCCGGCGA CGCGGGACTC GTTTCGCCGC AAGCTGCGGA CCGGCGAGCT CAACGACAAG
GAAATCGAAA TCGAGACCCA GGCCGGCGGC GGCTCGCCGA TGTTCGAAAT TCCGGGCATG
CCGGGCGGCC AGATCGGCGC GATTTCGATC GGCGACATCT TCGGCAAGAT GGGTGGCCGC
ACCAAGACGC GCAGGCTCAC CGTCGTCGAT TCGCACGACA TCCTCGTCAA CGAGGAAGCC
GACAAGCTGC TCGACAATGA CCAGCTGGTG CAGGAAGCCA TCAACGCCGT CGAGAACAAC
GGCATCGTGT TTCTCGACGA GATCGACAAG ATCTGCGTGC GCGACGGCCG CAGCGGCGGC
GAGGTCTCGC GCGAGGGCGT GCAGCGCGAT CTGCTGCCGC TGATCGAAGG CACCACGGTC
GCCACCAAGC ACGGCGCGGT GAAGACCGAT CACATCCTGT TCATCGCCTC CGGCGCGTTC
CACATCGCCA AGCCGTCCGA CCTGCTGCCG GAGCTGCAGG GCCGGCTGCC GATCCGGGTC
GAGCTCAACG CACTCTCCCG CGACGACATG CGCCGGATTC TGACCGAGCC GGAAGCCTCG
CTGATCAAGC AATATGTGGC GCTGCTGCAG ACCGAAGGCG TGACGCTGGA ATTCGGCGAC
GACGCCATCG ACGCGCTCGC CGACGTCGCG GTCGCGGTCA ACTCCACCGT CGAGAACATC
GGCGCGCGGC GGCTGCAGAC GGTGATGGAG CGCGTGCTCG ACGACATCTC CTTCGGCGCG
CCGGACCGAG GCGGCGAGAC CATCCGGATC GACGCCGACT ACGTCCAGAA GAACGTCGGC
GATCTGGCGA AGAACACGGA TTTGAGCCGG TTCATCTTGT AG
 
Protein sequence
MTDFSPREIV SELDRFIVGQ ADAKRAVAIA LRNRWRRLQL EGALREEVLP KNILMIGPTG 
VGKTEIARRL AKLANAPFLK VEATKFTEVG YVGRDVEQII RDLVEVAIAQ VREKKRKDVQ
ARAQIAAEER VLDALVGPGS SPATRDSFRR KLRTGELNDK EIEIETQAGG GSPMFEIPGM
PGGQIGAISI GDIFGKMGGR TKTRRLTVVD SHDILVNEEA DKLLDNDQLV QEAINAVENN
GIVFLDEIDK ICVRDGRSGG EVSREGVQRD LLPLIEGTTV ATKHGAVKTD HILFIASGAF
HIAKPSDLLP ELQGRLPIRV ELNALSRDDM RRILTEPEAS LIKQYVALLQ TEGVTLEFGD
DAIDALADVA VAVNSTVENI GARRLQTVME RVLDDISFGA PDRGGETIRI DADYVQKNVG
DLAKNTDLSR FIL