Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2068 |
Symbol | |
ID | 7977303 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2129415 |
End bp | 2132414 |
Gene Length | 3000 bp |
Protein Length | 999 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 644798882 |
Product | DNA methylase N-4/N-6 domain protein |
Protein accession | YP_002950052 |
Protein GI | 239827428 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2189] Adenine specific DNA methylase Mod |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGACGA AATATGAACA GCTAATCAGT TTGTTAGAAG AGATGTTTCA GTTTGATAAC GAAGATTTAG ATTTTGGTAT TTATCGCATT ATGAACCAAA AACGTGAAGA AATAAGAAAG TTTCTTCATC AAGATTTACT ACCGCAAGTA AAACAGGCAT TTGAAAAGTA TCAAAGTGTT GACCAAGTGA ATATTCGCAA AGAATTAGAG CAGCTCAAGA AAAGCTTACA AGACGCAGGA GTTGAGCCAG AAACATCTCC AAAATATAAG GCTTTGCAAG AAAAGTTGTC TCAAAGTGTC GACATTAGTG TGTTAGAGAA TGAAGTGTAT TCTCATTTAA TTACTTTCTT TGGCCGTTAC TATGATAAAG GAGATTTTGT CTCTCAACGT AGATATAAGA AAGATACATA TGCTATTCCT TATGAAGGAG AAGAAGTTAA GCTTTACTGG GCCAACGCTG ACCAATATTA TGTCAAAACT TCTGAGTATT TTAGGGACTA TTCATTTAAG CTACCTTCTG GGAAAAAGGT ACACTTTGTT CTAACAGAGG CATCTACAGA GCAAGACAAT AATAAAGAAC AAGAGGGAAA AGAACGTCGG TTTATCCTAT GTGAAGAATC CCCTTTGTAT GAGGAGCAGG GGGAACTTTA TATTCGTTTT GTATACAGAG TGGATAAAGA GAAACAAGCG GTGTTAAATC AACGGGCTAT CGAGAAGATT TTAAGAACTG AAGGTTACAC TGATTGGATT CAAGAGTTAT GCACGTTGGC ACCAACAGAG AAAAACAAGG AACGAACATT GCTTGAGAAG CATTTAAATG ACTATACTAC TAAAAATACT TTTGACTATT TTATTCATAA AGACCTCGGC GGCTTTCTGA GACGTGAATT AGACTTTTAT ATCAAGAATG AAATTATGCA CTTGGATGAT TTAGATACGG AAAACGAAGC TCAAATTGAG CAATATTTAT CCAAAATAAA GGTTATCAAA AGTATCGGAC ATAAAATCAT TAAATTCTTA GAGCAAATCG AAAATTTCCA GAAAAAACTT TGGTTGAAAA AGAAGTTTGT TGTGGAGACA AATTATTGTG TCACGCTGGA CCGTGTACCA GAGGAATTAT ACCCAGAAAT TGTGCAGAAT TTTGAGCAAA TTGAAGAGTG GAAACGTTTA TTTGCTATTG AGGATATTTC CGGTTATTGT GAGCCATTAA CAGTAGAGTT TCTGAAATCA AATCCATATT TGGTTTTGGA TACCAAATTT TTTGATAGGT CATTTGTTGA AAAATTATTA ATGGGAATTG ACGATATTGA AGGCCAATTA GATGGCGTAC TCATCAGAAG TGAGAATTTC CAAGGACTTA ACCTAATAAG AAAACGTTAT GAGAGACAGG CTAAATGTAT CTATATTGAC CCACCTTATA ATACAGGGCC TTCTGAAATT CTTTATAAAA ATAATTTTAA ACACTCTTCA TGGCTTAGTT TAATTGAAAA TAGATTAAAT ATATCCAAAA ACCTCCTAAA AGATAAAGGG GTTATTATTA TTGCAATAGA TGATTATGAA CTAGTTCATT TATGTCAATT AGTAGATAAT ATACTTCCGA GTTATGAGAG AAACATTATT GTTGTTAATC ATCATCCCCA GGGAAGTGGA GGGAAAAACA TTTCACGAAC CCATGAATAT GCGGTAGTCT TAACTCCTAA AGGAATGGAT ATTTTAAGGG GAAGTGTGAA AGAGGATTAT GTAGAACATA GGAGTTTTAT GCGAAGCGGT ACCGCTGAGA ATAATTTTAG ATATGGACGC CCTAATAGTT TTTATGCAAT TTTGGTCGAT GAGGCAACCT TTGAAATTAA GGGAATAGAG AAACCTCCCA CTGGAAGTGA TTATCCAAAG GGGAAGACAG AAGAAGGATG GGTTAGAGTA TACCCTTTGA GTAGGGATGG AAGTGAAAGG GTATGGAGAT TATCGTATGA GGGCGCATGT CGGGCATTAG AAAATAACGG ATTATATTGT TCACCGAACT TAACGATTTA TCAAGTGATA AATCACAATA AAAAGAGAGT TACATTATTT AGTAATTGGA TAGATAAAAA ATACAATGCT GGAACTCACG GGACAAATTT AATATCGGAT TTATTTGGAG TAAATGGTTT GTTTTCGTAT CCAAAATCAT TATATACTGT TTCAGATATT GTAGATGCTT CGACATATGA TGAAGAAGGA GCATTAATTA TTGATTATTT TGCTGGGTCT GGAACAACGG GTCATGCAGT AATATCTTTA AATCGAGAGG ATAACGGAAA CCGAAAATAT GTGCTTATTG AAATGGGCGA ATACTTTGAT ACAGTATTAA AACCTCGAAT ACAAAAAGTC ATTTACTCAA AAGATTGGAA AGACGGTAAA CCTGTCTCTC GCGAGGGCAT TAGTCATATG TTCAAGTATA TTCGCCTTGA GTCTTATGAA GATACATTAA ACAATCTCGA AATTAAACGA ACAGAGCAAC AACAATTGGC ATTGGACTTG ATGCCAACAG AAACTCGTGA GGAGTACCTT CTATCCTATA TGCTTGATAT CGAAACGAAA GATAGCGCTT CGCTCTTAAA TCTTGATGTA TTCGAAAATC CGTTTGATTA CAAATTAAAA ATTTCCGACG GTACAGAAAC GACAGTAAGA ACGGTCGATG TGGTAGAAAC TTTTAATTAT TTGCTAGGAC TTACAGTTCG GCAAATGGAA GTCGTTCAAG GATTTAAAGT GATTAAGGGA GAATTGCCTA CTGGAGAGAG AGCACTCATT ATCTGGAGAA ATCTAAAAGA GAAGTCAAAT GAGGATTTGG AAAGGTTCTT TACTAAAAGC AAGTATAATA CCCGTGATAA CGAGTTTGAC TATATCTATG TTAACGGTGA CAATCACTTA GAAAACATCA AATTGCAAGG GGACATGTGG AAAGTAAAAC TAATTGAAGA AGAATTTAAG CGTCTCATGT TTGATGTGCA AGATGTGTAA
|
Protein sequence | MKTKYEQLIS LLEEMFQFDN EDLDFGIYRI MNQKREEIRK FLHQDLLPQV KQAFEKYQSV DQVNIRKELE QLKKSLQDAG VEPETSPKYK ALQEKLSQSV DISVLENEVY SHLITFFGRY YDKGDFVSQR RYKKDTYAIP YEGEEVKLYW ANADQYYVKT SEYFRDYSFK LPSGKKVHFV LTEASTEQDN NKEQEGKERR FILCEESPLY EEQGELYIRF VYRVDKEKQA VLNQRAIEKI LRTEGYTDWI QELCTLAPTE KNKERTLLEK HLNDYTTKNT FDYFIHKDLG GFLRRELDFY IKNEIMHLDD LDTENEAQIE QYLSKIKVIK SIGHKIIKFL EQIENFQKKL WLKKKFVVET NYCVTLDRVP EELYPEIVQN FEQIEEWKRL FAIEDISGYC EPLTVEFLKS NPYLVLDTKF FDRSFVEKLL MGIDDIEGQL DGVLIRSENF QGLNLIRKRY ERQAKCIYID PPYNTGPSEI LYKNNFKHSS WLSLIENRLN ISKNLLKDKG VIIIAIDDYE LVHLCQLVDN ILPSYERNII VVNHHPQGSG GKNISRTHEY AVVLTPKGMD ILRGSVKEDY VEHRSFMRSG TAENNFRYGR PNSFYAILVD EATFEIKGIE KPPTGSDYPK GKTEEGWVRV YPLSRDGSER VWRLSYEGAC RALENNGLYC SPNLTIYQVI NHNKKRVTLF SNWIDKKYNA GTHGTNLISD LFGVNGLFSY PKSLYTVSDI VDASTYDEEG ALIIDYFAGS GTTGHAVISL NREDNGNRKY VLIEMGEYFD TVLKPRIQKV IYSKDWKDGK PVSREGISHM FKYIRLESYE DTLNNLEIKR TEQQQLALDL MPTETREEYL LSYMLDIETK DSASLLNLDV FENPFDYKLK ISDGTETTVR TVDVVETFNY LLGLTVRQME VVQGFKVIKG ELPTGERALI IWRNLKEKSN EDLERFFTKS KYNTRDNEFD YIYVNGDNHL ENIKLQGDMW KVKLIEEEFK RLMFDVQDV
|
| |