Gene TM1040_2851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2851 
SymbolhslU 
ID4076385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp3021606 
End bp3022916 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content57% 
IMG OID638008180 
ProductATP-dependent protease ATP-binding subunit HslU 
Protein accessionYP_614845 
Protein GI99082691 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1220] ATP-dependent protease HslVU (ClpYQ), ATPase subunit 
TIGRFAM ID[TIGR00390] ATP-dependent protease HslVU, ATPase subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.192992 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.735765 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGATC TGACCCCCCG CGAAATCGTT TCTGAACTCG ACCGGTTCAT CATTGGCCAA 
AAGGATGCCA AACGCGCTGT CGCTGTAGCG TTGCGCAATC GCTGGCGCCG CAAACAGCTA
CCGGATGACC TGCGCGACGA AGTGCATCCA AAGAACATCC TGATGATCGG CCCCACTGGC
GTCGGCAAGA CCGAAATTTC GCGCCGCTTG GCGAAGCTGG CGCGCGCGCC TTTCATCAAG
GTCGAAGCTA CCAAATTCAC CGAGGTTGGC TATGTCGGCC GCGACGTGGA ACAAATCGTG
CGTGATCTTG TGGATACCGC AATCGTGCAA ACCCGCGAGC ACATGCGAGA GGACGTCAAA
GCCAAGGCGC ATAAAGCCGC CGAGGACCGT GTGCTCGAAG CGATCGCCGG AACCGATGCC
CGCGAGAGCA CGCTCGAGAT GTTCCGCAAA AAGCTCAAGG CAGGTGAGCT TGATGACACG
GTGATCGAGT TGGACATCGC CGATACCTCC AACCCCATGG GCGGTATGTT TGAAATTCCG
GGTCAGCCAG GTGCAAACAT GGGGATGATG AACCTCGGTG ATCTCTTCGG AAAAGCCATG
GGCGGGCGTA CCACACGCAA AAAGCTCACC GTTGCAGAGA GCTATGACGT GTTGATCGGG
GAAGAAGCGG ACAAGCTTCT GGATGATGAA ACCGTAAACA AGGCTGCATT GGAAGCAGTA
GAGCAGAACG GGATCGTGTT CCTTGATGAG ATCGACAAGG TCTGCGCCCG TTCCGATGCG
CGTGGTGGCG ACGTCAGCCG TGAGGGCGTG CAGCGGGACT TGCTGCCGCT GATCGAAGGC
ACCACTGTCA GCACCAAACA TGGCCCAGTC AAAACCGACC ATATCCTGTT CATAGCGTCC
GGTGCGTTCC ACATCGCCAA GCCTTCTGAT CTCCTGCCCG AGCTGCAAGG ACGTTTGCCG
ATCCGCGTAA ACCTGCGCGC CCTCAGTGAA GAGGATTTTG TGCGCATCCT GACCGAAACC
GACAATGCGC TGACACGCCA GTACGAGGCG CTCTTGGGCA CAGAAAAAGT CAAAGTGACC
TTCACCAAGG ACGGGATCCA CGCCCTTGCG CAGATTGCCG CCGAAGTGAA CCACACGGTG
GAGAACATCG GCGCGCGGCG TCTCTACACG GTAATGGAGC GGGTCTTTGA GGAGATGTCC
TTTGCTGCGC CGGATCGATC CGGTGAAGAG ATCATCGTAG ATGAGCCCTT TGTGACCAAG
AATTTGGGCG AATTGACCAA ATCCACCGAT CTCAGCCGCT ACGTGCTCTG A
 
Protein sequence
MTDLTPREIV SELDRFIIGQ KDAKRAVAVA LRNRWRRKQL PDDLRDEVHP KNILMIGPTG 
VGKTEISRRL AKLARAPFIK VEATKFTEVG YVGRDVEQIV RDLVDTAIVQ TREHMREDVK
AKAHKAAEDR VLEAIAGTDA RESTLEMFRK KLKAGELDDT VIELDIADTS NPMGGMFEIP
GQPGANMGMM NLGDLFGKAM GGRTTRKKLT VAESYDVLIG EEADKLLDDE TVNKAALEAV
EQNGIVFLDE IDKVCARSDA RGGDVSREGV QRDLLPLIEG TTVSTKHGPV KTDHILFIAS
GAFHIAKPSD LLPELQGRLP IRVNLRALSE EDFVRILTET DNALTRQYEA LLGTEKVKVT
FTKDGIHALA QIAAEVNHTV ENIGARRLYT VMERVFEEMS FAAPDRSGEE IIVDEPFVTK
NLGELTKSTD LSRYVL