Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1106 |
Symbol | hslU |
ID | 7977593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1158208 |
End bp | 1159605 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644798059 |
Product | ATP-dependent protease ATP-binding subunit HslU |
Protein accession | YP_002949232 |
Protein GI | 239826608 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1220] ATP-dependent protease HslVU (ClpYQ), ATPase subunit |
TIGRFAM ID | [TIGR00390] ATP-dependent protease HslVU, ATPase subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000000196691 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCAGAAG CATTAACGCC TCGGCAAATT GTCGAAAAGC TTGATCAATT TATCGTTGGG CAAAAGGAAG CGAAAAAAGC AGTTGCGATT GCGCTAAGAA ATCGTTATCG CCGCAGTTTG CTTGATGAAA AATTACGGGA TGAAGTTGTC CCGAAAAATA TTTTAATGAT CGGTCCGACA GGGGTCGGAA AAACGGAAAT CGCCAGACGG TTGGCGAAGT TGGTTGGCGC TCCGTTTGTC AAAGTGGAGG CAACAAAGTT TACGGAAGTT GGATATGTCG GCCGCGATGT CGAATCGATG GTGCGCGATC TTGTGGAAAC GTCCGTTCGA TTAGTAAAAG AACGGAAAAT GAACGAAGTG AAAGATCGCG CTGAACAGCA AGCGAATAAA CGGCTTGTTG AGTTATTAGT GCCGGGAAAG CAAAAACAAA CGATGAAAAA CCCTCTTGAA CTATTATTCG GTGGAGCGCA AACATCGCAG CAAGATACAT ATCAAACGTA TGAAGACGAT CACATCGAAC AAAAACGGAG ACAAGTCGCT TGGCAGCTGG CGAACGGTCA GCTGGAAGAT GAAATGGTGA CAATCGAAGT AGAAGAACAA CAGCCGATGT TTTTTGACTT TTTGCAAGGG GCCGGCATTG AGCAAATGGG AATGAATATG CAAGACGCGC TAAGCAGCCT CATTCCGAAG CGCCGCAAAA AACGCAAATT AAAAGTAAGA GAAGCGCGTA AAGTGCTTAC GAATGAGGAA GCGCAGAAGT TAATCGACAT GGATGAAGTA ACACAGGAAG CAATACGTCT TGCAGAACAG TCTGGCATCA TTTTTATCGA TGAAATCGAT AAAATTGCGC GCAGCGGGCA AGCCGCTTCT TCAGCGGATG TATCGAGAGA AGGAGTGCAA CGCGACATTT TGCCAATTGT TGAAGGCTCC ACCGTTATGA CGAAATACGG CCCTGTTAAG ACAGATCATA TATTGTTTAT CGCTGCCGGA GCATTTCATA TGGCAAAACC GTCCGACTTA ATCCCGGAGT TGCAAGGCCG GTTTCCGATT CGTGTCGAGC TAACGAAACT TTCAGTTGAC GATTTCGTAA AAATATTAGT AGAGCCTGAT AATGCTCTTA TTAAACAATA TAAGGCGCTT CTTGCGACGG AAGGTATAAA TCTTGAATTT TCTGACGATG CTATTCGTAA GATTGCCGAA GTCGCCTTTG AAGTGAATCA GACAACCGAT AATATCGGAG CAAGACGCCT TCATACGATC ATGGAAAAGC TGCTTGAAGA TTTATTGTTT GAAGCTCCGG ATATTACGCT AGATGAAGTA GTCATTACAC CTCAGTATGT CGAACAAAAA CTAGGCAACA TTGTCAAAAA CAAAGATTTA AGTGAATTCA TTTTGTAA
|
Protein sequence | MSEALTPRQI VEKLDQFIVG QKEAKKAVAI ALRNRYRRSL LDEKLRDEVV PKNILMIGPT GVGKTEIARR LAKLVGAPFV KVEATKFTEV GYVGRDVESM VRDLVETSVR LVKERKMNEV KDRAEQQANK RLVELLVPGK QKQTMKNPLE LLFGGAQTSQ QDTYQTYEDD HIEQKRRQVA WQLANGQLED EMVTIEVEEQ QPMFFDFLQG AGIEQMGMNM QDALSSLIPK RRKKRKLKVR EARKVLTNEE AQKLIDMDEV TQEAIRLAEQ SGIIFIDEID KIARSGQAAS SADVSREGVQ RDILPIVEGS TVMTKYGPVK TDHILFIAAG AFHMAKPSDL IPELQGRFPI RVELTKLSVD DFVKILVEPD NALIKQYKAL LATEGINLEF SDDAIRKIAE VAFEVNQTTD NIGARRLHTI MEKLLEDLLF EAPDITLDEV VITPQYVEQK LGNIVKNKDL SEFIL
|
| |