Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1030 |
Symbol | hslU |
ID | 3832650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1059970 |
End bp | 1061355 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637828958 |
Product | ATP-dependent protease ATP-binding subunit HslU |
Protein accession | YP_429887 |
Protein GI | 83589878 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1220] ATP-dependent protease HslVU (ClpYQ), ATPase subunit |
TIGRFAM ID | [TIGR00390] ATP-dependent protease HslVU, ATPase subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000000255655 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.00000000959668 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | TTGGATTTTA CGCCCCGGCA GATAGTAGCC GAACTGGATC GCTACATTAT TGGCCAGGAA GAGGCGAAGA AGTGTGTGGC GGTGGCCCTG CGCAACCGTT ACCGCCGCCA GAAGCTGAAC CCGGAGCTGC GAGATGAGGT TTTACCCAAG AATATCATTA TGATTGGGCC GACGGGCGTT GGTAAAACGG AGATCGCCCG CCGGCTCGCC AAACTGGTGG GGGCACCCTT TTTAAAGGTG GAAGCCACCC GCTTTACCGA AGTAGGCTAT GTGGGCCGCG ATGTTGAATC GATGATCCGG GAACTGGTAG AAAATGCCGT GAGAATGGTT AAAATAGAGA AAAGGGCTGA AGTAGAGGCT AAGGCCGCCA AGATGGCCGA AAAACGGTTA CTCGACCTCC TTGTTCCCAG GCAGGGTAAA GAGAAGGGTA CCCATAACCC CTGGGAGGTT CTTTTCGGCG GTGCCCAGGT GAACAGTGAG GGTACGGTCC TGGAGGAAGA GAGTCTGAGG GAGAAAAGGG CCATCCTCAG GGAAAAACTG CGACGCCAGG AACTGGAAGA CATGATGGTA GAGGTCGAAG TAGAAGACAC CACGGGCCCC GGGGGCGTCA TCCTGGGCGG GCTGGGACTG GAAGAGCTGG GTATAAACCT CCAGGATATG CTGGGCAATA TGCTCCCCCG GCGTAAGCGC AAGCGCCTGG TAACCGTAGC CGAAGCGCGG CGGATCCTCA CCCAGCAGGA AGCCGACAAG CTCATCGACA TGGACGATGT CGCTGCCATA GCCGTCCAGC GGGTGGAACA GGAGGGCATT ATCTTCCTGG ACGAAATTGA TAAAATAGCT GGCCGGGAGA GCAGCCACGG CCCTGATGTT TCCCGGGAAG GAGTCCAGCG GGATATCCTA CCCATTGTTG AAGGAACTAC GGTGCAAACC AAATACGGCC CGGTAAAAAC AGACCATATT CTCTTTATTG CCGCCGGCGC CTTCCACGTG GCCAAACCGG CGGACCTAAT CCCTGAACTC CAGGGGCGTT TCCCCTTGCG GGTCGAACTT AAGAGTTTGG GTCGTGAAGA TTTTCAGCGT ATTTTAACTG AACCCAAAAA TTCCCTGTTA AAGCAATATA CAGCATTACT TGCAGTAGAT GGTATAGAAT TACAATTTTC AGCCGATGCT ATTGCGGAAA TTGCCGATAT TGCTTATACT GTAAATACTC AAGGCGAAGA CATCGGTGCC CGGCGCCTGC ACACCATTCT AGAAAAAATA CTGCAGGATC TCCTCTTTGA AGCGCCAGAG GTGCAGGAGC GTAAAGTAGT AATCGATCGC ACCTATGTAC GTAAACAATT AGGTGACATC ATGCAGCGTA CCGATGTGCA AGCATATATA CTCTAA
|
Protein sequence | MDFTPRQIVA ELDRYIIGQE EAKKCVAVAL RNRYRRQKLN PELRDEVLPK NIIMIGPTGV GKTEIARRLA KLVGAPFLKV EATRFTEVGY VGRDVESMIR ELVENAVRMV KIEKRAEVEA KAAKMAEKRL LDLLVPRQGK EKGTHNPWEV LFGGAQVNSE GTVLEEESLR EKRAILREKL RRQELEDMMV EVEVEDTTGP GGVILGGLGL EELGINLQDM LGNMLPRRKR KRLVTVAEAR RILTQQEADK LIDMDDVAAI AVQRVEQEGI IFLDEIDKIA GRESSHGPDV SREGVQRDIL PIVEGTTVQT KYGPVKTDHI LFIAAGAFHV AKPADLIPEL QGRFPLRVEL KSLGREDFQR ILTEPKNSLL KQYTALLAVD GIELQFSADA IAEIADIAYT VNTQGEDIGA RRLHTILEKI LQDLLFEAPE VQERKVVIDR TYVRKQLGDI MQRTDVQAYI L
|
| |