Gene GYMC61_1998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_1998 
SymbolhslU 
ID8525862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp2012187 
End bp2013584 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content53% 
IMG OID 
ProductATP-dependent protease ATP-binding subunit HslU 
Protein accessionYP_003253096 
Protein GI261419414 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGCGG AAACGTTGAC TCCACGGCAG ATTGTCGAAA AGCTCGATCA GTTCATCGTC 
GGGCAAAAAG AGGCAAAAAA GGCGGTGGCG ATCGCGCTTC GCAACCGGTA CCGCCGCAGT
TTGCTCGATG AAAAATTGCG CGATGAAGTG ATGCCGAAAA ACATTTTAAT GATTGGCCCG
ACCGGGGTCG GGAAGACGGA GATCGCCCGC CGGCTCGCCA AACTCGTCGG CGCCCCGTTC
ATCAAAGTCG AAGCGACGAA ATTCACCGAA GTCGGTTATG TCGGGCGCGA CGTCGAATCG
ATGGTGCGCG ATTTAGTGGA AACGTCAGTT AGGCTCGTGA AAGAACGAAA AATGAACGAA
GTGAAAGACA GGGCTGAACA GCAGGCGAAC AAGCGGCTTG TTGAACTGCT CGTTCCGGGC
AAGCCAAAGC AGACAATCAA AAATCCGCTT GAGCTGCTGT TTGGCGGCCA AGGAGCCCAA
GCGGACAACA GCTACAGCCA TGAGGATGAA CAGGTGGAAC AAAAGCGGCG CCAAGTTGCT
TGGCAGTTGG CAAACGGGCA GTTGGAAAAC GAGATGGTGA CGATTGAAAT CGAGGAACAG
ACGCCGTTAT GGTTTGACTT TTTGCAAGGG GCAGGCATTG AGCAGATGGG GATGAACATG
CAGGACGCCT TGAGCAGCCT CATGCCGAAG CGGCGCAAAA AGCGGCGTCT CAAAGTGAGT
GAAGCGCGCA AAGTGCTCAT CAACGAGGAA GCGCAAAAGC TGATCGACAT GGATGAAGTG
ACGCAAGAGG CCGTCCGCCT GGCTGAGCAG TCCGGCATCA TTTTTATCGA TGAAATCGAC
AAAATCGCCC GCAGCGGAGC GGTGTCCGGC TCGGCCGACG TCTCGCGCGA AGGGGTGCAG
CGCGACATTT TGCCAATTGT CGAAGGGTCG ACCGTCATGA CGAAGTACGG ACCGGTGAAA
ACAGACCATA TTTTATTCAT CGCCGCCGGC GCGTTCCATA TGGCGAAGCC GTCGGATTTG
ATCCCTGAGC TGCAAGGCCG TTTCCCGATC CGCGTCGAGC TTGCGAAACT TTCTGTCGAC
GATTTCGTAA GAATATTAGT CGAGCCGAAT AACGCGCTCA TTAAACAATA TCAAGCTCTT
TTGGCAACAG AAGGTATAAG TCTTGAATTT TCTGACGATG CTATTCGTAA GATTGCCGAG
GTGGCGTTTG AAGTAAACCA GACGACCGAC AACATCGGTG CGCGCCGGTT GCACACGATT
TTGGAAAAAC TGCTGGAAGA CTTGCTGTTT GAGGCGCCGG ACATCGGAAT CGACAAGGTC
GTCATCACAC CGCAATATGT CGAGCAAAAA CTCGGCAGCA TCGTCAAAAA CAAAGATTTA
AGCGAGTTTA TTTTATGA
 
Protein sequence
MMAETLTPRQ IVEKLDQFIV GQKEAKKAVA IALRNRYRRS LLDEKLRDEV MPKNILMIGP 
TGVGKTEIAR RLAKLVGAPF IKVEATKFTE VGYVGRDVES MVRDLVETSV RLVKERKMNE
VKDRAEQQAN KRLVELLVPG KPKQTIKNPL ELLFGGQGAQ ADNSYSHEDE QVEQKRRQVA
WQLANGQLEN EMVTIEIEEQ TPLWFDFLQG AGIEQMGMNM QDALSSLMPK RRKKRRLKVS
EARKVLINEE AQKLIDMDEV TQEAVRLAEQ SGIIFIDEID KIARSGAVSG SADVSREGVQ
RDILPIVEGS TVMTKYGPVK TDHILFIAAG AFHMAKPSDL IPELQGRFPI RVELAKLSVD
DFVRILVEPN NALIKQYQAL LATEGISLEF SDDAIRKIAE VAFEVNQTTD NIGARRLHTI
LEKLLEDLLF EAPDIGIDKV VITPQYVEQK LGSIVKNKDL SEFIL