Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1299 |
Symbol | |
ID | 7976078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1357982 |
End bp | 1361416 |
Gene Length | 3435 bp |
Protein Length | 1144 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 644798241 |
Product | Eco57I restriction endonuclease |
Protein accession | YP_002949414 |
Protein GI | 239826790 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0387167 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAAC GAGCATTAAA AGAATTTGCT GTTTATGCAC GCAACGAACT ACGCAATCAA ATCGCCTTGC GAGCACAAGC CTTTGGGATT ACTCCCGAAG GCTCTCCTAC GCTTGTAACA GGTGCGGATT ATGTTGAAAT CAACGGCAAA AAACTGCCAT TATCTTATAA AAGCGGTCTA CAAAAACTGT TAAAAGAAGT TGAAACAAAA GGCTATGACC AAGTGATTGA AGAAGTCGCG TACACATGGT TTAACCGCTT AATTGCAATT CGCTTCATGG AAGTGCATAA CTATTTGCCA TCAAAAATTC GCGTTTTATC AAGCGAAACG AGAGGAAAAG TGGACCCGGA TATTTTAACC GATTACCAAT ATACGGACTT GCCAGTCGAT AAAGAGGAGA TTGCATCTCT TTTACAACAA GGAAAGCGTG AGGAAGCGTA CCGGAAGTTA TTGATTGCCC AGTGTAATGA ATTGCATCAA ATTATGGATT TTCTTTTCGA AAAGATAGCA GACTATACAG AACTACTATT GCCAGAGTCG TTGCTACATG CGGATTCGTT GATTAACAAA CTAGGAAAAG AATTAGGAGA CGAAAACTTC GAACATGTAG AAGTCATTGG CTGGTTGTAT CAATATTATA TGTCTGAAAA AAAGGAACAA GTAGGCGGAT TGAAAAACAC AGCTGTCAAA AAAGAAGATT TACCTGTTGT TACCCAACTT TTTACGCCAA AATGGATTGT GCAATATATG GTGCAAAATT CATTAGGAAA GCTATATGAT GAATGGAAGC CGGAAAATCA TCTTGTAAAA GATTGGGAGT ACTATTTGAA ATCATCCGAG AAGCTCCCAA TACCAGAAAA CATCTCTTTA GAAGAAATAA AGGTCATCGA TCCTGCTTGT GGCTCAGGGC ACATTTTAGT TTATGCATTT GATTTGTTGT ATGACATGTA TTTAGAAGCG GGATATCCAG AACGCGAAAT TCCTAGACTC ATTATTGAAA AAAATCTATA TGGATTAGAT ATAGATAAAC GTGCTGTTCA ACTGGCAAGC TTTGCGTTAA TGATGAAAGG ACAAGAAAAA TACCGCCGTT TTCTAAAAAA GGCAACAGAT TTAAAACTAA ATATTCATGA GTTTGTAGAT AGTGAGCCTA TCTCAGAAGA AGTACTCGCT TTCCTAGGAG AAAAAGTAGG AGATGTCAGT TGGGTTGCCG CTTTACAAGA GAAGTTTGAA AATGCAAAAC AGTTTGGATC GTTACTTGTT CCAGACGAAC AGGCATCGTT TTATTTAAAG TATATTGAAG CAATCGAATC CTATGACGTT AATGAAGTAG AACTATTAGA AGAAACATAT ATTATTGAGT TAAAAGAAAA ACTTCTTCCG CTATTAAAAC AGGCGTATTT ATTGGCATTA AAATATGAAG TCGTTGTAAC GAACCCACCT TATCATAATA AATACAATCC TGTATTAAAG AAGTTTATGA ACGATAACTA TAAAGATTAT AAATCAGACT TATACTCTGC TTTTATCTAT CGCTGCACGC AAATGACGGT TGAAAATGGG TTTGCAGCAC TTATGACGCC ATTTACATGG ATGTTTATTT CGTCCCATGA AAAACTAAGA AAGTATATTA TTGAGAACCA AAGTATTTCA AGTTTAATTC AGCTCGAATA CTCAGCTTTT ACCGAAGCAA CTGTTCCGAT TTGTACATTT GTAATTCAGA ATCAAAATCG GACATCCATT GGGGAATATA TCAGATTAGA AGAATTTAAA GGTGCTGACT TACAGCCAAT AAAGGTAAAA GAAGCAGTAA AAAATAATGT AGACTATCGT TATTCTTGTG ATAGTAAAAG CTTTAATGCA ATTCCAGGTT CCCCGATTGC GTATTGGCTG AACAAAAAAG CAAGAGATGG GTTCCAATAT GTAATTGGTA ACAATTATTC AGCGGTAGCT GGTATATCAA CAGGAAATAA TAATCGATAT ATATTTGATT GGTATGAAGT AGACTATTCA CAAATCTCTT TTTCTAAAAA AGGAGGAATT GATTGTAAAA AATATTATCC GCATGCGAAA GGTGGAGACT ATCGTAGGTG GTATGGTAAC CGCATAAATG TTATTAAGTA TAATGAGCAA TCCATTAAGG AAATGAGTAG CTTGCCCGGT TTTCGACATG ATGGAAGAAT GCACTACTTT AAGCACCTAA TATCTTGGAG CAAAATTACT AGCTCTATCT TTAGTGCTAG ATATTATGAG CCCATGTTTG TATTCGATAG TGCTGCGCCT AGTATTTCTA TTGATAAAAT AGATTATAAT TTGCTTGGAT ATCTTAATTC GAAGGTAGCA TATTATTTTA TGAAAGTAAT AAATCCTACT TTGAATTATC CTCCAGGTTA TATGGAGTTA TTGCCTTTTA GTCATTCGGA AAATAAGGTA ATTAATAGTA TAGTATCTGA AAACATTGAA ATTTGCATAA ATGAATGGAA TTCATTCGAA ACCTCATGGG ACTTCAAAAA GCATCCATTC CTCACATACC GCGGAAACGC AAAGACATTA GAGGAATGCT ACGCAAACTG GGCGGACCAT GCCGAAAAGC AGTTCCGTCA ACTGCAGAAA AATGAGGAAG AACTGAATCG TATTTTTATT GAATTGTACG GTTTGCAAGA TGAACTAACG CCAGAAGTGC CGGATGAGGA AGTAACCGTT CGCCGCGCGG ATCGTGTCCG TGATGCGAAG TCGTTCTTGT CTTACTGTGT TGGTTTAATG ATGGGGCGTT ATTCGTTGGA TGTCGAAGGT CTTGCTTATG CAGGCGGTGA GTGGGACGCA TCGAAGTATA AAACGTTCCA GCCCGATAAA GACGGCATCA TTCCATTGAC GGAAACAGCG TATTTTGAAG ATGATGTCAT CAGCCGCTTG CAAGAGCTAT TAATCATTAT GTTCGGTGAA GAGACGTTAG CGGAGAATCT TCGTTGGCTA GCGGAGTCAT TAACGATGAA AAACAACGAA ACACCGACAG AGCGTTTGCG TCGTTATTTC TTTGACGAGT TCTACGCAGA TCATTGCAAA ATTTATCAAA AGCGGCCGAT TTACTGGATG GCAGAATCAG GGCCGAAAAA AGCATTCCGC GCATTATTCT ACTTGCACCG TTACACGCCA GAAACACTAG CAACAATGCG CTTTACGTAC GTACAAAACT TACAAGAAAA ACTTCGTCAA GAGCAAAAAC GCCTCGAGCA AGACCTTATC AATCCGGATC TATCTTCTGC GATGAAGAAG CGTTATGAGA AGCAATTGAA ACAAATTAGA GCCCAGCAGG AAGAGCTCGT TGAATTCGAT AAAAAATTAG CGGAACTTGC CAACCAACGC ATTGCCTTAG ATTTAGATGA CGGAGTGGTT GTGAACTACG AGAAATTAAA AACGATCCTT GCGAAAATAA GATAA
|
Protein sequence | MNKRALKEFA VYARNELRNQ IALRAQAFGI TPEGSPTLVT GADYVEINGK KLPLSYKSGL QKLLKEVETK GYDQVIEEVA YTWFNRLIAI RFMEVHNYLP SKIRVLSSET RGKVDPDILT DYQYTDLPVD KEEIASLLQQ GKREEAYRKL LIAQCNELHQ IMDFLFEKIA DYTELLLPES LLHADSLINK LGKELGDENF EHVEVIGWLY QYYMSEKKEQ VGGLKNTAVK KEDLPVVTQL FTPKWIVQYM VQNSLGKLYD EWKPENHLVK DWEYYLKSSE KLPIPENISL EEIKVIDPAC GSGHILVYAF DLLYDMYLEA GYPEREIPRL IIEKNLYGLD IDKRAVQLAS FALMMKGQEK YRRFLKKATD LKLNIHEFVD SEPISEEVLA FLGEKVGDVS WVAALQEKFE NAKQFGSLLV PDEQASFYLK YIEAIESYDV NEVELLEETY IIELKEKLLP LLKQAYLLAL KYEVVVTNPP YHNKYNPVLK KFMNDNYKDY KSDLYSAFIY RCTQMTVENG FAALMTPFTW MFISSHEKLR KYIIENQSIS SLIQLEYSAF TEATVPICTF VIQNQNRTSI GEYIRLEEFK GADLQPIKVK EAVKNNVDYR YSCDSKSFNA IPGSPIAYWL NKKARDGFQY VIGNNYSAVA GISTGNNNRY IFDWYEVDYS QISFSKKGGI DCKKYYPHAK GGDYRRWYGN RINVIKYNEQ SIKEMSSLPG FRHDGRMHYF KHLISWSKIT SSIFSARYYE PMFVFDSAAP SISIDKIDYN LLGYLNSKVA YYFMKVINPT LNYPPGYMEL LPFSHSENKV INSIVSENIE ICINEWNSFE TSWDFKKHPF LTYRGNAKTL EECYANWADH AEKQFRQLQK NEEELNRIFI ELYGLQDELT PEVPDEEVTV RRADRVRDAK SFLSYCVGLM MGRYSLDVEG LAYAGGEWDA SKYKTFQPDK DGIIPLTETA YFEDDVISRL QELLIIMFGE ETLAENLRWL AESLTMKNNE TPTERLRRYF FDEFYADHCK IYQKRPIYWM AESGPKKAFR ALFYLHRYTP ETLATMRFTY VQNLQEKLRQ EQKRLEQDLI NPDLSSAMKK RYEKQLKQIR AQQEELVEFD KKLAELANQR IALDLDDGVV VNYEKLKTIL AKIR
|
| |