Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GYMC61_1998 |
Symbol | hslU |
ID | 8525862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. Y412MC61 |
Kingdom | Bacteria |
Replicon accession | NC_013411 |
Strand | + |
Start bp | 2012187 |
End bp | 2013584 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | ATP-dependent protease ATP-binding subunit HslU |
Protein accession | YP_003253096 |
Protein GI | 261419414 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGCGG AAACGTTGAC TCCACGGCAG ATTGTCGAAA AGCTCGATCA GTTCATCGTC GGGCAAAAAG AGGCAAAAAA GGCGGTGGCG ATCGCGCTTC GCAACCGGTA CCGCCGCAGT TTGCTCGATG AAAAATTGCG CGATGAAGTG ATGCCGAAAA ACATTTTAAT GATTGGCCCG ACCGGGGTCG GGAAGACGGA GATCGCCCGC CGGCTCGCCA AACTCGTCGG CGCCCCGTTC ATCAAAGTCG AAGCGACGAA ATTCACCGAA GTCGGTTATG TCGGGCGCGA CGTCGAATCG ATGGTGCGCG ATTTAGTGGA AACGTCAGTT AGGCTCGTGA AAGAACGAAA AATGAACGAA GTGAAAGACA GGGCTGAACA GCAGGCGAAC AAGCGGCTTG TTGAACTGCT CGTTCCGGGC AAGCCAAAGC AGACAATCAA AAATCCGCTT GAGCTGCTGT TTGGCGGCCA AGGAGCCCAA GCGGACAACA GCTACAGCCA TGAGGATGAA CAGGTGGAAC AAAAGCGGCG CCAAGTTGCT TGGCAGTTGG CAAACGGGCA GTTGGAAAAC GAGATGGTGA CGATTGAAAT CGAGGAACAG ACGCCGTTAT GGTTTGACTT TTTGCAAGGG GCAGGCATTG AGCAGATGGG GATGAACATG CAGGACGCCT TGAGCAGCCT CATGCCGAAG CGGCGCAAAA AGCGGCGTCT CAAAGTGAGT GAAGCGCGCA AAGTGCTCAT CAACGAGGAA GCGCAAAAGC TGATCGACAT GGATGAAGTG ACGCAAGAGG CCGTCCGCCT GGCTGAGCAG TCCGGCATCA TTTTTATCGA TGAAATCGAC AAAATCGCCC GCAGCGGAGC GGTGTCCGGC TCGGCCGACG TCTCGCGCGA AGGGGTGCAG CGCGACATTT TGCCAATTGT CGAAGGGTCG ACCGTCATGA CGAAGTACGG ACCGGTGAAA ACAGACCATA TTTTATTCAT CGCCGCCGGC GCGTTCCATA TGGCGAAGCC GTCGGATTTG ATCCCTGAGC TGCAAGGCCG TTTCCCGATC CGCGTCGAGC TTGCGAAACT TTCTGTCGAC GATTTCGTAA GAATATTAGT CGAGCCGAAT AACGCGCTCA TTAAACAATA TCAAGCTCTT TTGGCAACAG AAGGTATAAG TCTTGAATTT TCTGACGATG CTATTCGTAA GATTGCCGAG GTGGCGTTTG AAGTAAACCA GACGACCGAC AACATCGGTG CGCGCCGGTT GCACACGATT TTGGAAAAAC TGCTGGAAGA CTTGCTGTTT GAGGCGCCGG ACATCGGAAT CGACAAGGTC GTCATCACAC CGCAATATGT CGAGCAAAAA CTCGGCAGCA TCGTCAAAAA CAAAGATTTA AGCGAGTTTA TTTTATGA
|
Protein sequence | MMAETLTPRQ IVEKLDQFIV GQKEAKKAVA IALRNRYRRS LLDEKLRDEV MPKNILMIGP TGVGKTEIAR RLAKLVGAPF IKVEATKFTE VGYVGRDVES MVRDLVETSV RLVKERKMNE VKDRAEQQAN KRLVELLVPG KPKQTIKNPL ELLFGGQGAQ ADNSYSHEDE QVEQKRRQVA WQLANGQLEN EMVTIEIEEQ TPLWFDFLQG AGIEQMGMNM QDALSSLMPK RRKKRRLKVS EARKVLINEE AQKLIDMDEV TQEAVRLAEQ SGIIFIDEID KIARSGAVSG SADVSREGVQ RDILPIVEGS TVMTKYGPVK TDHILFIAAG AFHMAKPSDL IPELQGRFPI RVELAKLSVD DFVRILVEPN NALIKQYQAL LATEGISLEF SDDAIRKIAE VAFEVNQTTD NIGARRLHTI LEKLLEDLLF EAPDIGIDKV VITPQYVEQK LGSIVKNKDL SEFIL
|
| |