Gene Tmz1t_0789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0789 
SymbolhslU 
ID7084181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp872230 
End bp873567 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content67% 
IMG OID643697813 
ProductATP-dependent protease ATP-binding subunit HslU 
Protein accessionYP_002354454 
Protein GI217969220 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1220] ATP-dependent protease HslVU (ClpYQ), ATPase subunit 
TIGRFAM ID[TIGR00390] ATP-dependent protease HslVU, ATPase subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.312061 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCAGA TGACCCCGCC GGAGATCGTC TCCGAACTCG ACAAGCACAT CGTCGGCCAG 
GACAAGGCCA AGAAGGCCGT GGCGATCGCG CTGCGCAACC GCTGGCGGCG CGCTCAGGTG
GAAGAGCCGC TGCGCAGCGA GATCACCCCC AAGAACATCC TCATGATCGG CCCCACCGGC
GTCGGCAAGA CCGAGATCGC GCGCCGCCTG GCGCGCCTGG CCAACGCGCC CTTCATCAAG
ATCGAGGCGA CCAAGTTCAC CGAGGTCGGC TATGTCGGCC GCGACGTCGA CACCATCATC
CGCGACCTCA TGGAGATCGC GGTCAAGGAC GGGCGCGAGC GTGCGATGAA GTCGGTGCGC
GACCGCGCGC TGGATGCCGC CGAGGACCGC GTGCTCGACG CCCTGCTGCC GCCGGCGCGC
CCGGTCGGCT TCAACGCCGA GCCCGAGCCG GCGCAGGATT CGTCCACCCG GCAGAAATTC
CGCAAGAAGC TGCGCGAGGG GGAACTCGAC GACAAGGAGA TCGAGATCGA GGTCGCCGCG
CCCTCGATGC AGGCCGAGAT CTTCGCCCCG CCGGGTATGG AGGAACTCAC CCAGCAGATA
CAGGGCATGT TCCAGAACCT CGGCGGCGGC AAGAAGAAGC AGCGCAAGCT GCAGATCCGC
GAGGCCATGA AGCTGCTCGC CGACGAGGAG GCCGCGCGCC TGATCAACGA CGAGGAGGTC
AAGCTCGAGG CCGTGCGCGC GGTCGAGCAG AACGGCATCG TGTTCCTCGA CGAGGTGGAC
AAGATCGCCG CGCGCAGCGA CGTGCAGGGC GCAGATGTCT CCCGTCAGGG CGTGCAGCGC
GACCTGCTGC CGCTGGTCGA GGGCACGACG ATCTCCACCA AGTACGGCAT GATCAAGACC
GATCACATCC TGTTCATCGC CAGCGGCGCC TTCCACCTGT CCAAGCCCTC GGATCTGATC
CCCGAGCTGC AGGGGCGTTT CCCGATCCGC GTCGAGCTGG AGTCGCTGTC TGTGGAGGAC
TTCGCCCGCA TCCTCACCAG CACCGACGCC TGCCTCACGC GCCAGTACGA GGCGCTGCTC
GCCACCGACG GGGTGAAGCT GGAGTTCGCC GACGACGGCA TCCGCCGCCT GGCCGAGATC
GCCTACCAGG TGAACGAGAA GACCGAGAAC ATCGGCGCGC GCCGGCTGTA CACCGTCATG
GAGAAGCTGC TCGAAGAGGT TTCCTTCGAG GCCGGGCGCA GCAGTGCGGA GCAGACCGTG
GTAGTCGACG CCGCCTATGT CGACAGCCGG CTCGTCATGC TCGCCCAGCG CGAGGATCTG
GCGCGTTACG TGCTTTGA
 
Protein sequence
MTQMTPPEIV SELDKHIVGQ DKAKKAVAIA LRNRWRRAQV EEPLRSEITP KNILMIGPTG 
VGKTEIARRL ARLANAPFIK IEATKFTEVG YVGRDVDTII RDLMEIAVKD GRERAMKSVR
DRALDAAEDR VLDALLPPAR PVGFNAEPEP AQDSSTRQKF RKKLREGELD DKEIEIEVAA
PSMQAEIFAP PGMEELTQQI QGMFQNLGGG KKKQRKLQIR EAMKLLADEE AARLINDEEV
KLEAVRAVEQ NGIVFLDEVD KIAARSDVQG ADVSRQGVQR DLLPLVEGTT ISTKYGMIKT
DHILFIASGA FHLSKPSDLI PELQGRFPIR VELESLSVED FARILTSTDA CLTRQYEALL
ATDGVKLEFA DDGIRRLAEI AYQVNEKTEN IGARRLYTVM EKLLEEVSFE AGRSSAEQTV
VVDAAYVDSR LVMLAQREDL ARYVL