Gene GWCH70_2068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2068 
Symbol 
ID7977303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2129415 
End bp2132414 
Gene Length3000 bp 
Protein Length999 aa 
Translation table11 
GC content35% 
IMG OID644798882 
ProductDNA methylase N-4/N-6 domain protein 
Protein accessionYP_002950052 
Protein GI239827428 
COG category[L] Replication, recombination and repair 
COG ID[COG2189] Adenine specific DNA methylase Mod 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGACGA AATATGAACA GCTAATCAGT TTGTTAGAAG AGATGTTTCA GTTTGATAAC 
GAAGATTTAG ATTTTGGTAT TTATCGCATT ATGAACCAAA AACGTGAAGA AATAAGAAAG
TTTCTTCATC AAGATTTACT ACCGCAAGTA AAACAGGCAT TTGAAAAGTA TCAAAGTGTT
GACCAAGTGA ATATTCGCAA AGAATTAGAG CAGCTCAAGA AAAGCTTACA AGACGCAGGA
GTTGAGCCAG AAACATCTCC AAAATATAAG GCTTTGCAAG AAAAGTTGTC TCAAAGTGTC
GACATTAGTG TGTTAGAGAA TGAAGTGTAT TCTCATTTAA TTACTTTCTT TGGCCGTTAC
TATGATAAAG GAGATTTTGT CTCTCAACGT AGATATAAGA AAGATACATA TGCTATTCCT
TATGAAGGAG AAGAAGTTAA GCTTTACTGG GCCAACGCTG ACCAATATTA TGTCAAAACT
TCTGAGTATT TTAGGGACTA TTCATTTAAG CTACCTTCTG GGAAAAAGGT ACACTTTGTT
CTAACAGAGG CATCTACAGA GCAAGACAAT AATAAAGAAC AAGAGGGAAA AGAACGTCGG
TTTATCCTAT GTGAAGAATC CCCTTTGTAT GAGGAGCAGG GGGAACTTTA TATTCGTTTT
GTATACAGAG TGGATAAAGA GAAACAAGCG GTGTTAAATC AACGGGCTAT CGAGAAGATT
TTAAGAACTG AAGGTTACAC TGATTGGATT CAAGAGTTAT GCACGTTGGC ACCAACAGAG
AAAAACAAGG AACGAACATT GCTTGAGAAG CATTTAAATG ACTATACTAC TAAAAATACT
TTTGACTATT TTATTCATAA AGACCTCGGC GGCTTTCTGA GACGTGAATT AGACTTTTAT
ATCAAGAATG AAATTATGCA CTTGGATGAT TTAGATACGG AAAACGAAGC TCAAATTGAG
CAATATTTAT CCAAAATAAA GGTTATCAAA AGTATCGGAC ATAAAATCAT TAAATTCTTA
GAGCAAATCG AAAATTTCCA GAAAAAACTT TGGTTGAAAA AGAAGTTTGT TGTGGAGACA
AATTATTGTG TCACGCTGGA CCGTGTACCA GAGGAATTAT ACCCAGAAAT TGTGCAGAAT
TTTGAGCAAA TTGAAGAGTG GAAACGTTTA TTTGCTATTG AGGATATTTC CGGTTATTGT
GAGCCATTAA CAGTAGAGTT TCTGAAATCA AATCCATATT TGGTTTTGGA TACCAAATTT
TTTGATAGGT CATTTGTTGA AAAATTATTA ATGGGAATTG ACGATATTGA AGGCCAATTA
GATGGCGTAC TCATCAGAAG TGAGAATTTC CAAGGACTTA ACCTAATAAG AAAACGTTAT
GAGAGACAGG CTAAATGTAT CTATATTGAC CCACCTTATA ATACAGGGCC TTCTGAAATT
CTTTATAAAA ATAATTTTAA ACACTCTTCA TGGCTTAGTT TAATTGAAAA TAGATTAAAT
ATATCCAAAA ACCTCCTAAA AGATAAAGGG GTTATTATTA TTGCAATAGA TGATTATGAA
CTAGTTCATT TATGTCAATT AGTAGATAAT ATACTTCCGA GTTATGAGAG AAACATTATT
GTTGTTAATC ATCATCCCCA GGGAAGTGGA GGGAAAAACA TTTCACGAAC CCATGAATAT
GCGGTAGTCT TAACTCCTAA AGGAATGGAT ATTTTAAGGG GAAGTGTGAA AGAGGATTAT
GTAGAACATA GGAGTTTTAT GCGAAGCGGT ACCGCTGAGA ATAATTTTAG ATATGGACGC
CCTAATAGTT TTTATGCAAT TTTGGTCGAT GAGGCAACCT TTGAAATTAA GGGAATAGAG
AAACCTCCCA CTGGAAGTGA TTATCCAAAG GGGAAGACAG AAGAAGGATG GGTTAGAGTA
TACCCTTTGA GTAGGGATGG AAGTGAAAGG GTATGGAGAT TATCGTATGA GGGCGCATGT
CGGGCATTAG AAAATAACGG ATTATATTGT TCACCGAACT TAACGATTTA TCAAGTGATA
AATCACAATA AAAAGAGAGT TACATTATTT AGTAATTGGA TAGATAAAAA ATACAATGCT
GGAACTCACG GGACAAATTT AATATCGGAT TTATTTGGAG TAAATGGTTT GTTTTCGTAT
CCAAAATCAT TATATACTGT TTCAGATATT GTAGATGCTT CGACATATGA TGAAGAAGGA
GCATTAATTA TTGATTATTT TGCTGGGTCT GGAACAACGG GTCATGCAGT AATATCTTTA
AATCGAGAGG ATAACGGAAA CCGAAAATAT GTGCTTATTG AAATGGGCGA ATACTTTGAT
ACAGTATTAA AACCTCGAAT ACAAAAAGTC ATTTACTCAA AAGATTGGAA AGACGGTAAA
CCTGTCTCTC GCGAGGGCAT TAGTCATATG TTCAAGTATA TTCGCCTTGA GTCTTATGAA
GATACATTAA ACAATCTCGA AATTAAACGA ACAGAGCAAC AACAATTGGC ATTGGACTTG
ATGCCAACAG AAACTCGTGA GGAGTACCTT CTATCCTATA TGCTTGATAT CGAAACGAAA
GATAGCGCTT CGCTCTTAAA TCTTGATGTA TTCGAAAATC CGTTTGATTA CAAATTAAAA
ATTTCCGACG GTACAGAAAC GACAGTAAGA ACGGTCGATG TGGTAGAAAC TTTTAATTAT
TTGCTAGGAC TTACAGTTCG GCAAATGGAA GTCGTTCAAG GATTTAAAGT GATTAAGGGA
GAATTGCCTA CTGGAGAGAG AGCACTCATT ATCTGGAGAA ATCTAAAAGA GAAGTCAAAT
GAGGATTTGG AAAGGTTCTT TACTAAAAGC AAGTATAATA CCCGTGATAA CGAGTTTGAC
TATATCTATG TTAACGGTGA CAATCACTTA GAAAACATCA AATTGCAAGG GGACATGTGG
AAAGTAAAAC TAATTGAAGA AGAATTTAAG CGTCTCATGT TTGATGTGCA AGATGTGTAA
 
Protein sequence
MKTKYEQLIS LLEEMFQFDN EDLDFGIYRI MNQKREEIRK FLHQDLLPQV KQAFEKYQSV 
DQVNIRKELE QLKKSLQDAG VEPETSPKYK ALQEKLSQSV DISVLENEVY SHLITFFGRY
YDKGDFVSQR RYKKDTYAIP YEGEEVKLYW ANADQYYVKT SEYFRDYSFK LPSGKKVHFV
LTEASTEQDN NKEQEGKERR FILCEESPLY EEQGELYIRF VYRVDKEKQA VLNQRAIEKI
LRTEGYTDWI QELCTLAPTE KNKERTLLEK HLNDYTTKNT FDYFIHKDLG GFLRRELDFY
IKNEIMHLDD LDTENEAQIE QYLSKIKVIK SIGHKIIKFL EQIENFQKKL WLKKKFVVET
NYCVTLDRVP EELYPEIVQN FEQIEEWKRL FAIEDISGYC EPLTVEFLKS NPYLVLDTKF
FDRSFVEKLL MGIDDIEGQL DGVLIRSENF QGLNLIRKRY ERQAKCIYID PPYNTGPSEI
LYKNNFKHSS WLSLIENRLN ISKNLLKDKG VIIIAIDDYE LVHLCQLVDN ILPSYERNII
VVNHHPQGSG GKNISRTHEY AVVLTPKGMD ILRGSVKEDY VEHRSFMRSG TAENNFRYGR
PNSFYAILVD EATFEIKGIE KPPTGSDYPK GKTEEGWVRV YPLSRDGSER VWRLSYEGAC
RALENNGLYC SPNLTIYQVI NHNKKRVTLF SNWIDKKYNA GTHGTNLISD LFGVNGLFSY
PKSLYTVSDI VDASTYDEEG ALIIDYFAGS GTTGHAVISL NREDNGNRKY VLIEMGEYFD
TVLKPRIQKV IYSKDWKDGK PVSREGISHM FKYIRLESYE DTLNNLEIKR TEQQQLALDL
MPTETREEYL LSYMLDIETK DSASLLNLDV FENPFDYKLK ISDGTETTVR TVDVVETFNY
LLGLTVRQME VVQGFKVIKG ELPTGERALI IWRNLKEKSN EDLERFFTKS KYNTRDNEFD
YIYVNGDNHL ENIKLQGDMW KVKLIEEEFK RLMFDVQDV