Gene Moth_1030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1030 
SymbolhslU 
ID3832650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1059970 
End bp1061355 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content53% 
IMG OID637828958 
ProductATP-dependent protease ATP-binding subunit HslU 
Protein accessionYP_429887 
Protein GI83589878 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1220] ATP-dependent protease HslVU (ClpYQ), ATPase subunit 
TIGRFAM ID[TIGR00390] ATP-dependent protease HslVU, ATPase subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000255655 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000959668 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
TTGGATTTTA CGCCCCGGCA GATAGTAGCC GAACTGGATC GCTACATTAT TGGCCAGGAA 
GAGGCGAAGA AGTGTGTGGC GGTGGCCCTG CGCAACCGTT ACCGCCGCCA GAAGCTGAAC
CCGGAGCTGC GAGATGAGGT TTTACCCAAG AATATCATTA TGATTGGGCC GACGGGCGTT
GGTAAAACGG AGATCGCCCG CCGGCTCGCC AAACTGGTGG GGGCACCCTT TTTAAAGGTG
GAAGCCACCC GCTTTACCGA AGTAGGCTAT GTGGGCCGCG ATGTTGAATC GATGATCCGG
GAACTGGTAG AAAATGCCGT GAGAATGGTT AAAATAGAGA AAAGGGCTGA AGTAGAGGCT
AAGGCCGCCA AGATGGCCGA AAAACGGTTA CTCGACCTCC TTGTTCCCAG GCAGGGTAAA
GAGAAGGGTA CCCATAACCC CTGGGAGGTT CTTTTCGGCG GTGCCCAGGT GAACAGTGAG
GGTACGGTCC TGGAGGAAGA GAGTCTGAGG GAGAAAAGGG CCATCCTCAG GGAAAAACTG
CGACGCCAGG AACTGGAAGA CATGATGGTA GAGGTCGAAG TAGAAGACAC CACGGGCCCC
GGGGGCGTCA TCCTGGGCGG GCTGGGACTG GAAGAGCTGG GTATAAACCT CCAGGATATG
CTGGGCAATA TGCTCCCCCG GCGTAAGCGC AAGCGCCTGG TAACCGTAGC CGAAGCGCGG
CGGATCCTCA CCCAGCAGGA AGCCGACAAG CTCATCGACA TGGACGATGT CGCTGCCATA
GCCGTCCAGC GGGTGGAACA GGAGGGCATT ATCTTCCTGG ACGAAATTGA TAAAATAGCT
GGCCGGGAGA GCAGCCACGG CCCTGATGTT TCCCGGGAAG GAGTCCAGCG GGATATCCTA
CCCATTGTTG AAGGAACTAC GGTGCAAACC AAATACGGCC CGGTAAAAAC AGACCATATT
CTCTTTATTG CCGCCGGCGC CTTCCACGTG GCCAAACCGG CGGACCTAAT CCCTGAACTC
CAGGGGCGTT TCCCCTTGCG GGTCGAACTT AAGAGTTTGG GTCGTGAAGA TTTTCAGCGT
ATTTTAACTG AACCCAAAAA TTCCCTGTTA AAGCAATATA CAGCATTACT TGCAGTAGAT
GGTATAGAAT TACAATTTTC AGCCGATGCT ATTGCGGAAA TTGCCGATAT TGCTTATACT
GTAAATACTC AAGGCGAAGA CATCGGTGCC CGGCGCCTGC ACACCATTCT AGAAAAAATA
CTGCAGGATC TCCTCTTTGA AGCGCCAGAG GTGCAGGAGC GTAAAGTAGT AATCGATCGC
ACCTATGTAC GTAAACAATT AGGTGACATC ATGCAGCGTA CCGATGTGCA AGCATATATA
CTCTAA
 
Protein sequence
MDFTPRQIVA ELDRYIIGQE EAKKCVAVAL RNRYRRQKLN PELRDEVLPK NIIMIGPTGV 
GKTEIARRLA KLVGAPFLKV EATRFTEVGY VGRDVESMIR ELVENAVRMV KIEKRAEVEA
KAAKMAEKRL LDLLVPRQGK EKGTHNPWEV LFGGAQVNSE GTVLEEESLR EKRAILREKL
RRQELEDMMV EVEVEDTTGP GGVILGGLGL EELGINLQDM LGNMLPRRKR KRLVTVAEAR
RILTQQEADK LIDMDDVAAI AVQRVEQEGI IFLDEIDKIA GRESSHGPDV SREGVQRDIL
PIVEGTTVQT KYGPVKTDHI LFIAAGAFHV AKPADLIPEL QGRFPLRVEL KSLGREDFQR
ILTEPKNSLL KQYTALLAVD GIELQFSADA IAEIADIAYT VNTQGEDIGA RRLHTILEKI
LQDLLFEAPE VQERKVVIDR TYVRKQLGDI MQRTDVQAYI L