Gene GYMC61_2026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_2026 
Symbol 
ID8525890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp2035991 
End bp2037991 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content55% 
IMG OID 
ProductCheA signal transduction histidine kinase 
Protein accessionYP_003253124 
Protein GI261419442 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACATGA GCCAATATTT GGATTTATTT ATTGATGAAA GCAAAGAGCA TTTGCAGGCG 
ATCAACGAGC GGCTGTTGGA ACTTGAAAAA ACTCCGGAGG ACATGTCTGT GGTGAATGAT
ATTTTCCGTT CGGCCCATAC GTTGAAAGGC ATGTCCGCCA CGATGGGGTT TGAAGATTTG
GCCAATTTGA CGCACCAAAT GGAAAACGTG CTCGATGGCA TCCGCAATCG GCGGCTTTCC
GTTACCCCGG AATTGCTTGA CGTCATTTTT GAGGCCGTCG ACCATTTGGA GGCGATGATC
AGTTCCATCG CTGCAGGCGG CGACGGAACG CGCGATGTAA GGAGAACAGT CGAACAGCTG
AAACGAATCG AGCAAGGGGA GATGCCGAAC AAGCAGGCAG CAAGGGAAGA ACCGCCCCTT
GAACATGCGT ATGGGGAATT TGAATACCAT GTGCTCGAAC AGGCGAAGGA GCAGGGATTT
TCCGTCTACG AAATCCGCGT TCGGCTTCGC GATGATTGCT TGTTGAAAGC AGCGCGCGTC
TATATGGTGT TTGAACAGCT GAATGAAGTC GGAGAAATTG TGAAAGCAAC GCCGCCGGTC
GAGATGCTGG AGGAAGAACA GTTTGACCGG GAGTTTCTCG TTACGGTCGT ATCCAAAGCG
CCAGCCGATG AGTTGCAAAA GCGGCTGATG GGCATTTCGG AAATTGATGA CGTCAAGGTG
TCTATGCTAT CGAGCGATGA ACCGTCGGCA GAAAGCGAGA AAGCGGCTGC GCCCCAACAA
CCGGCCGCTA TGGAGCAGGC GGCGGCCGTT CAGGCCGAAG CGGAGGCGCC GGAAAAACAA
ACAGCGAAAC AGGCGACGAA AACGATCCGC GTCAACATTG AACGGCTCGA TCGGTTGATG
AACTTATTTG AAGAATTGGT TGTCGACCGC GGTCGGCTTG AGCAAATTTC CCGCGAGCTG
AACCATGCCG AATTGACGGA AACGGTCGAG CGGATGTCTC GCATCTCGAG CGATTTGCAG
ACGATCATTT TAAATATGCG CATGGTGCCG GTCGAAACGG TGTTCAACCG CTTCCCGCGC
ATGGTGCGCC AGCTAGCCCG CGAGCTCGGC AAAAAGGTGC GCCTTGACAT CATCGGCGCG
GATACCGAGC TTGACCGGAC GGTGATCGAT GAAATCGGCG ACCCGCTTGT CCATTTGATC
CGCAATGCGC TCGACCACGG CATCGAAGCG CCGGACGTCC GGGCGGCGCG CGGAAAACCG
GAAGAAGGGA CCGTTCAATT GCGAGCGTAC CATAGCGGCA ACCATGTCTT TATTGAAATC
GAGGATGACG GCGCCGGCAT TTCCCGGGAG AAGGTGCTGC AAAAGGCGAA AAGCCGCGGC
ATTGTCTCGC CGCAGGCGGC GGAGCATTTG AACGATCAGC AAATTTACGA GCTTATTTTC
GCTCCCGGCT TTTCGACCGC TGAGCAAGTT TCTGACATTT CCGGCCGCGG CGTCGGTTTG
GATGTCGTCA AAAGCACGAT TGAGTCGCTC GGCGGCACCG TTTCGGTCGA TTCGCAGCCT
GGAAAAGGGT CGCTCTTTTC GATTCAGCTG CCGCTCACAT TGTCGATCAT TTCTGTGTTG
CTCGTTCAAA TCGCCGAGGA AACGTACGCG ATTCCGCTGT CATCGATCAT TGAGACGGCG
CTGGTGAAAA AGGAAGAGAT TTTTTCCGCC CACAACCAGC CGGTCATCGA TTTTCGCGGC
AAAATCGTGC CGCTCGTCCG CCTGAAAGAC GTCTTCGCTG TTCCTGGAGC GGCCGATGAC
GGAGATGCGG TGGCGGTCGT GATCGTCCGG AAAGGGGAAA AACTGGCGGC GCTGGCGGTC
GACTCGTTTA TCGGGCAGCA AGAAGTCGTG TTGAAATCGC TAGGAAACTA TTTATCTTCG
GTTTTTGCCA TCTCGGGGGC GACGATTTTG GGAGACGGCC GAGTGGCGCT GATTATCGAC
TGCAACGCGC TCGTGAAGTA G
 
Protein sequence
MDMSQYLDLF IDESKEHLQA INERLLELEK TPEDMSVVND IFRSAHTLKG MSATMGFEDL 
ANLTHQMENV LDGIRNRRLS VTPELLDVIF EAVDHLEAMI SSIAAGGDGT RDVRRTVEQL
KRIEQGEMPN KQAAREEPPL EHAYGEFEYH VLEQAKEQGF SVYEIRVRLR DDCLLKAARV
YMVFEQLNEV GEIVKATPPV EMLEEEQFDR EFLVTVVSKA PADELQKRLM GISEIDDVKV
SMLSSDEPSA ESEKAAAPQQ PAAMEQAAAV QAEAEAPEKQ TAKQATKTIR VNIERLDRLM
NLFEELVVDR GRLEQISREL NHAELTETVE RMSRISSDLQ TIILNMRMVP VETVFNRFPR
MVRQLARELG KKVRLDIIGA DTELDRTVID EIGDPLVHLI RNALDHGIEA PDVRAARGKP
EEGTVQLRAY HSGNHVFIEI EDDGAGISRE KVLQKAKSRG IVSPQAAEHL NDQQIYELIF
APGFSTAEQV SDISGRGVGL DVVKSTIESL GGTVSVDSQP GKGSLFSIQL PLTLSIISVL
LVQIAEETYA IPLSSIIETA LVKKEEIFSA HNQPVIDFRG KIVPLVRLKD VFAVPGAADD
GDAVAVVIVR KGEKLAALAV DSFIGQQEVV LKSLGNYLSS VFAISGATIL GDGRVALIID
CNALVK