Gene GYMC61_2820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_2820 
Symbol 
ID8526697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp2879756 
End bp2881612 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content58% 
IMG OID 
Productsqualene/oxidosqualene cyclase 
Protein accessionYP_003253881 
Protein GI261420199 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGGCTG ATGAACGAAG TGCGCTCATC GATGCGCTCA AACGGTCGCA AAGCGTCGAC 
GGATCGTGGC GGTTTCCGTT TGAAACCGGC ATTTCCACCG ATGCCTATAT GATCATTTTA
CTGCGGACGC TCGGAATACA TGATGAACCG TTGATCCAGG CGCTCGTCGA GCGGATCGAA
AGCCGGCAGG ACGCGAACGG GGCGTGGAAG CTGTTTGCCG ATGAAGGCGA TGGCAATGTG
ACAGCGACAG TTGAGGCGTA TTATGCCTTG CTTTACTCTG GATATCGAAA AAAAACCGAT
TCGCATATGC AAAAGGCGAA AGCGCGCATT TTGGAAGTGG GCGGTTTAGA ACGCGTCCAC
TTGTTTACGA AAGTGATGCT CGCATTGACC GGACAGCACT CGTGGCCAAG ACGGTTTCCG
CTGCCGCTTG TCTTTTTCCT TCTCCCCCCG TCGTTTCCGC TCAATATGTA TGACCTATCT
GTATACGGAA GGGCGAACAT GGTCCCGCTT CTTGTCGTCG CGGAGCGCCG CTACAGCCGG
AAAACGGACA ACAGTCCGGA TCTTTCCGAT TTGGCCGCTT CCCGCAATGA TTGGCGGCTG
CCGGACACCG AGGCGCTATG GTCGTACGTG AAGCGGTCGC TCACCGGACT TCCCGCTTGG
CTGCATCGTG CCGCCGAACA GCGCGCCGTC CGCTATATGT TGGAGCATAT CGAGCCGGAC
GGAACGCTGT ACAGCTATTT CAGCTCGACG TTTTTGTTGA TTTTTGCGCT GCTGGCGCTT
GGTTATCCAA AAGACGACCC GCATATCGCC CGGGCTGTTC GCGGTTTGCG CTCACTGCGA
ACCGAAATCG ATGGGCATAC GCATATGCAA TATACAACCG CTTCCGTCTG GAATACGGCG
TTGGCGAGCT ATGCGCTGCA GGAAGCGGGC GTGCCGCCGA CCGACCGGAC GATTGAGAAA
GCGAACCGCT ATTTGTTGTC GCGCCAGCAC ATTCGCTACG GCGACTGGGC GGTGCACAAC
CCGTACGGCG TACCGGGCGG CTGGGGATTT TCCGATGTGA ATACGATGAA TCCGGACGTC
GACGATACAA CGGCCGCGCT GCGCGCCATC CGCCGGGCGG CAGCGAAAGA GACGGCGTTT
CGCCATGCAT GGGACCGGGC GAATCGATGG CTGTTTTCGA TGCAAAACGA TGACGGCGGG
TTTGCGGCGT TTGAAAAGAA CGTAGGCAAA CGGTTTTGGC GGTATTTGCC GATCGAAGGG
GCGGAGTTTT TATTGATGGA TCCGTCAACA GCCGATTTGA CCGGACGGAC GCTCGAATAT
TTCGGAACGT TCGCTGGATT AACGAAAGAC CACTCCGCCA TCGCCCGCGC CATCGACTGG
CTGCTTGACC ATCAGGAAGC CGACGGTTCG TGGTATGGGC GCTGGGGGAT TTGCTATGTG
TACGGCACAT GGGCGGCGGT GACCGGGCTC TCAGCCGTCG GCGTTCCAAT CGATCACCCG
GCGATGCAAA AAGCGGTCCG TTGGTTGTTG AGCATCCAAA ACGATGACGG CGGCTGGGGT
GAATCGTGCA AAAGCGACGG AGCCAAGACG TATGTGCCGC TTGGCGCCAG CACGCCCGTC
CATACCGCTT GGGCGCTCGA TGCACTGATC GCTGCCGCCG AGCGGCCGAC CCCGGAAATG
AAAGCCGGCG TTCGCGCCCT AGTCCGTATG CTTCATCACC CGGATTGGAC CGCCTCGTAC
CCGGTCGGAC AAGGGATGGC CGGCGCCTTT TACATCCATT ACCATGGCTA CCGCTACATT
TTTCCGCTGT TGGCGCTCGC CCATTACGAG CAAAAGTTCG GACCGTTTGT GGATTAG
 
Protein sequence
MVADERSALI DALKRSQSVD GSWRFPFETG ISTDAYMIIL LRTLGIHDEP LIQALVERIE 
SRQDANGAWK LFADEGDGNV TATVEAYYAL LYSGYRKKTD SHMQKAKARI LEVGGLERVH
LFTKVMLALT GQHSWPRRFP LPLVFFLLPP SFPLNMYDLS VYGRANMVPL LVVAERRYSR
KTDNSPDLSD LAASRNDWRL PDTEALWSYV KRSLTGLPAW LHRAAEQRAV RYMLEHIEPD
GTLYSYFSST FLLIFALLAL GYPKDDPHIA RAVRGLRSLR TEIDGHTHMQ YTTASVWNTA
LASYALQEAG VPPTDRTIEK ANRYLLSRQH IRYGDWAVHN PYGVPGGWGF SDVNTMNPDV
DDTTAALRAI RRAAAKETAF RHAWDRANRW LFSMQNDDGG FAAFEKNVGK RFWRYLPIEG
AEFLLMDPST ADLTGRTLEY FGTFAGLTKD HSAIARAIDW LLDHQEADGS WYGRWGICYV
YGTWAAVTGL SAVGVPIDHP AMQKAVRWLL SIQNDDGGWG ESCKSDGAKT YVPLGASTPV
HTAWALDALI AAAERPTPEM KAGVRALVRM LHHPDWTASY PVGQGMAGAF YIHYHGYRYI
FPLLALAHYE QKFGPFVD