Gene GYMC61_2177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_2177 
Symbol 
ID8526042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp2199740 
End bp2202985 
Gene Length3246 bp 
Protein Length1081 aa 
Translation table11 
GC content47% 
IMG OID 
Producthelicase domain protein 
Protein accessionYP_003253272 
Protein GI261419590 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGACAA AAGGGATGAT TGACAACAGA CAAAACGGCC TAGTCGGCGA TGCGTTGAAA 
GCGCATATCA CAAAAGGCAG CAAACTATCG ATTGCAGCGG CCCATTTTAC GTTATATGCG
TTTGCAGAGT TGAAAAAAGA ACTGGCGCAA ATCGATGAAT TTCGCTTTAT TTTCACAGAG
CCTGCCTTTG TGCGCGGTGA TCGTTCATTC AATGAATGGA TGAAGAAAAA CGAAGCGACG
CTGTATGGAG TAGAAGAAGA ACGAAAATAC AAAATCGAAC TTAACCAAGC ATACATAGCA
AGGGAACTGG CGAAATGGCT GAAACAGAAA GCAAGGGTGA AATCGGTCGT CAATCAGCGC
ATTCAAGGCA GCATATATCA TGTGCGAAAT GAGGATGGGT CACAAATCGG CTTGATCGGC
GGGGCGCCGT TTTCCAGTCC GGGGCTCGGC TACAGCAGCT CCCCCCATTT TTATTTTGCT
CATATAATGG ATGATCGAGA TTCCAGCGCC CAGCTGCTTC GCCAGTTTGA GGCGATTTGG
CAAGATGAAC AGGCGCTGCA AGATGTGAAA GATGAAATTT TGCGGCGCTT GGAAGTCATG
TATCAAGAAC ATCCGCCAGA GTTTATTTAC TTTGTCACAC TTTATTACTT ATTCAAAGAT
TTTTTAAAAG AAGCGACAAA CTACGAAACG TTGCAGACGA GAACTGGTTT CCAAAATACG
GTGATTTGGA ACAAACTTTA TGACTTTCAG CGCGATGGGG TCCTTGGGGC TATCAATAAA
ATTGAAACGT ACGGGGGCTG CATCATCGCC GACAGTGTCG GGCTTGGAAA AACGTTCGAA
GCGTTGGCGG TGATCAAATA TTACGAATTG CGCAACCATC GCGTTCTCGT ATTGGCGCCG
AAAAAATTGC GGGAAAACTG GGCCATCTAC CGTTTAAATG ATAAGCGAAA CATTTTAGCC
GCAGACCGCT TTTCGTATGA CTTGCTCAAC CATACCGACT TATCGCGTGA GCGCGGCTAC
AGCGGCGATA TTAATTTGGA ACATGTGAAT TGGGGAAATT ATGATTTAGT TGTTATCGAC
GAGTCGCACA ACTTTCGCAA CAACGATCCG CGCAATGACC GGGTGACGAG ATATTCCCGC
TTGATGAACG ATATTATGAA AGCGGGGGTG AAAACGAAGG TATTGATGCT TTCCGCCACC
CCCGTCAACA ACAAGCTGGA TGACTTGAAA AATCAAATTG CCTTTATTAC AGAAGGAAAT
GACAAGGCAT TGGCGGAAAC GGCAAACATC AAAAGCATCA GCCAGACGAT TCGCCGCGCC
CAGTCGCAGT TTAACAAATG GAGCCAGCTT CCCGAAGAAG AGCGGACGAC CGAGCGGTTG
TTGGATATGC TCGATTGGGA TTATTTTGCT TTGCTTGATT CGCTGACAAT CGCCCGTTCG
CGCCGCCATA TCGAAAAATA TTACAATGTG GATGCCATCG GGCCGTTTCC CACGCGTCTA
AAGCCGATCA ATCTGAAAGA AAAAATCGAC AGCAAAGATG AGTTTCCGCC GCTTGAGAAA
ATCAATAACG ACATCTTGCG GCTTCGGATG GCGGTCTATT CGCCGATGCA ATACATTTTG
CCGAACAAAA AAGCGGACTA TAGCGAAAAA TACGACACAA AGGTCGCAAA CGGCAGAGTG
TTTAAGCAAA CCGACCGCGA GCACAACCTT GTTTACTTAA TGAAGTCGAA TTTGCTGAAG
CGGTTGGAAA GCAGCGTTCA TTCGTTTTCT TTGACATTGC AAAACATTAT CCAGCAAATC
GACGAACATG TTGAGAAAAT CAACCGTTGC GATGGAAGTT CCGGGACAAT TGGAGAGATG
GACGGGATCG ACGCGGAAGA TCCGGAATTG GAAGAGGCGT TGATCGGATC GAAAGTCAAA
ATCTTTTTGA AAGACATGGA TTTGATCCGT TGGAAACAAG ATTTGTTATA CGACCGGGAG
ATTTTGCTGA GGCTGCTTCA CCAGGCGCGG AACGTAACGC CGGATCGCGA TCAGAAGCTG
TTGGCGTTGA AACGATTGAT CGAACATAAA ATCAAGCACC CGATTAACGA TCAGAACAAA
AAACTGCTCA TTTTCACCGC TTTTGCCGAT ACGGCGAAAT ACTTATACGA GAATTTGCAT
CAATGGGTGC AAAACCAATT TGGCTTGCAC TGCGCGGTCG TGACGGGCGC TGATCGGCCG
AAAACAACAT TGAAGATGAA GAAAGTCGAT TTCAATCATG TGTTGATGAA CTTTTCGCCC
GTCTCGAAAG AGCGGGGTAA AGTGATGCCG GAAATGAAAG ACGAAATCGA CATCCTCATC
GCCACGGACT GTATTTCCGA AGGACAGAAC TTGCAAGACT GCGATTATCT TGTCAACTAT
GACATCCATT GGAACCCGGT CCGCATCATC CAGCGGTTCG GCCGCATCGA CCGGATCGGC
AGCAAAAACA AACAGATCCA GCTCGTAAAC TTCTGGCCAT CAATCGAGCT TGACGAATAC
ATTCAGCTCG TCAATCGGGT CAAAGGCCGA ATGACGATCC TGGATATTTC ATCGACCGGG
GAAGAAAACG TCATTGCCGA CAACAGCAAC GAAATGAATG ACTTGGAATA CCGCCGCAAA
CAGCTGGAAA AATTGCAAAA TGAAGTCATC GACTTAGAGG ACATTTCCGG AAATATCTCG
CTGACGGATT TTACACTCGA CGACTTCCGA ATGGACTTGT TGAATTTCAT GAAGGAACAT
AAGGAAGAGC TCGAGCGGGC CCCGCTCGGA CTGTTCAGCA TTGTCGCGAA TCAAAATGAA
AAACTGAAAG ACGAAATTCA GCCGGGCGTC ATTTTCTGCC TGAAACAAAC GGCGCCGATG
GCATCGGCGC ATGAACAAAA CGCGCTTTAT CCGTATTACT TAGTGTACGT GCGGGAAGAC
GGCACGGTGT TGTATAACCA TGTCCACGTA AAGAAAGTGT TGGATTTATA CCGGTCGCTT
TGCAACGGCA AAAAAGACGT TGAATGGGGC TTGTACAAGG CGTTTTATCA AGAGACGAAA
AACGGAAAAG ACATGGGGCA ATACAAGGCG CTGCTGGAAA AAGCGGTGGA AGAGATCGTC
GGCAAGATGG ACCAGCAATT GATGCTCAAC ATCTTCAGCC TTGGGAACTT GGACGCCTTC
GTGACGAACG CCAACACGAG TTTGCAAGAT TTTGAGATCG TTTCGTATTT GATCATTAAA
GGGTGA
 
Protein sequence
MKTKGMIDNR QNGLVGDALK AHITKGSKLS IAAAHFTLYA FAELKKELAQ IDEFRFIFTE 
PAFVRGDRSF NEWMKKNEAT LYGVEEERKY KIELNQAYIA RELAKWLKQK ARVKSVVNQR
IQGSIYHVRN EDGSQIGLIG GAPFSSPGLG YSSSPHFYFA HIMDDRDSSA QLLRQFEAIW
QDEQALQDVK DEILRRLEVM YQEHPPEFIY FVTLYYLFKD FLKEATNYET LQTRTGFQNT
VIWNKLYDFQ RDGVLGAINK IETYGGCIIA DSVGLGKTFE ALAVIKYYEL RNHRVLVLAP
KKLRENWAIY RLNDKRNILA ADRFSYDLLN HTDLSRERGY SGDINLEHVN WGNYDLVVID
ESHNFRNNDP RNDRVTRYSR LMNDIMKAGV KTKVLMLSAT PVNNKLDDLK NQIAFITEGN
DKALAETANI KSISQTIRRA QSQFNKWSQL PEEERTTERL LDMLDWDYFA LLDSLTIARS
RRHIEKYYNV DAIGPFPTRL KPINLKEKID SKDEFPPLEK INNDILRLRM AVYSPMQYIL
PNKKADYSEK YDTKVANGRV FKQTDREHNL VYLMKSNLLK RLESSVHSFS LTLQNIIQQI
DEHVEKINRC DGSSGTIGEM DGIDAEDPEL EEALIGSKVK IFLKDMDLIR WKQDLLYDRE
ILLRLLHQAR NVTPDRDQKL LALKRLIEHK IKHPINDQNK KLLIFTAFAD TAKYLYENLH
QWVQNQFGLH CAVVTGADRP KTTLKMKKVD FNHVLMNFSP VSKERGKVMP EMKDEIDILI
ATDCISEGQN LQDCDYLVNY DIHWNPVRII QRFGRIDRIG SKNKQIQLVN FWPSIELDEY
IQLVNRVKGR MTILDISSTG EENVIADNSN EMNDLEYRRK QLEKLQNEVI DLEDISGNIS
LTDFTLDDFR MDLLNFMKEH KEELERAPLG LFSIVANQNE KLKDEIQPGV IFCLKQTAPM
ASAHEQNALY PYYLVYVRED GTVLYNHVHV KKVLDLYRSL CNGKKDVEWG LYKAFYQETK
NGKDMGQYKA LLEKAVEEIV GKMDQQLMLN IFSLGNLDAF VTNANTSLQD FEIVSYLIIK
G