Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1643 |
Symbol | |
ID | 7976360 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 1721130 |
End bp | 1722416 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 644798524 |
Product | restriction modification system DNA specificity domain protein |
Protein accession | YP_002949696 |
Protein GI | 239827072 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGTTACA GTGAATGGAA AACGTATTCT CTTAAAGACA TCTGCACAGA TATTTCTTAC GGATATACAG CTAGTGCAAA AGAGGAAAAA GTAGGGCCGA AATTTCTAAG AATTACAGAT TTAAGAAATG AGTTTATTGA TTGGGAATCG GTCCCTTATT GTTCCATAAA TGAAAAGGAC TATAAAAAAT ACAAGTTGGA AATTGGAGAT TTATGTATAG CCCGTACCGG AGCAACGACC GGAATAAACA CAGTAATTGA GGAGGATGTA GATGCTGTTT TTGCTTCTTA TTTGGTTAGA TTTAAGTTGA ACAAAGAAAT AGTTGATCCT ACATTTATAA AGTATATCTT TAAATCTAAT ATGTGGTATG GATATGTTAA TTCTATTATA AGCGGTTCAG CTCAACCCGG AGCAAACGCT CAGCAAATGA GTAACTTTAA AATGAGTATT CCTGATCTGG ATGAACAAAA AAAGATAGCT TCTGTTCTAT CTGTATTAGA TAAAAAAATC GTACTAAATA ATAAAATAAA CAAAACCCTT GAAGAAATGG CTCAAGCAAT TTTCAAACGT TGGTTTGTTG ATTTCGAGTT TCCAAATGAA AACGGTAAAC CTTATAAATC AAGCGGTGGG AAGTTTGTAG AAAGTGAGTC AGGGATGATA CCAGAGGGGT GGAAAGAAGG GACTCTAGAT AACTTAGTTG TCATAAATAC TGCATCTGTC GATCCCAAAG AAAATCCTGA GATTTTATAT GAACATTATA GTATACCTGC CTTTGACGAA CAGAAATATC CCAAATTTGA ATATGGCAGA GAAATTAAGA GCAATAAATA TCTTGTCAGA CCTAACTCTT TTCTTGTATC AAAGCTTAAT CCAACAACTA AAAGAGTATG GGATCCGCTA TGTATTACTG AAAATGCAAT TTCTTCTACG GAATTTATTA ATTATCTACC AAAAGATATT TCTTATCAAT CATATTTATA TTGTATGTTA AATTCAGAAA GATTCTCAGA ACATTTAATT AAACATGCAA CAGGATCGAC AGGAAGCAGA CAGAGAGTAA AACCTGCAGA AACATTAACC TTCAATGTTA TTTTACCGGA TACAGAAACA CTAAAAAAAT TTGATAACCT TATAAGACCG ATAAGAGAGA AATTAAAAAT AAACCAAATT AATAGCGCGG TTTTAAAGGA TGTTAGAGAT ATCCTTCTCC CAAAACTGAT GTCCGGTGAA ATTCGCGTTC CTGATGCCGA GCGGGAGGTG GAGGAATGTT TACAGAAGAG CAATTAG
|
Protein sequence | MSYSEWKTYS LKDICTDISY GYTASAKEEK VGPKFLRITD LRNEFIDWES VPYCSINEKD YKKYKLEIGD LCIARTGATT GINTVIEEDV DAVFASYLVR FKLNKEIVDP TFIKYIFKSN MWYGYVNSII SGSAQPGANA QQMSNFKMSI PDLDEQKKIA SVLSVLDKKI VLNNKINKTL EEMAQAIFKR WFVDFEFPNE NGKPYKSSGG KFVESESGMI PEGWKEGTLD NLVVINTASV DPKENPEILY EHYSIPAFDE QKYPKFEYGR EIKSNKYLVR PNSFLVSKLN PTTKRVWDPL CITENAISST EFINYLPKDI SYQSYLYCML NSERFSEHLI KHATGSTGSR QRVKPAETLT FNVILPDTET LKKFDNLIRP IREKLKINQI NSAVLKDVRD ILLPKLMSGE IRVPDAEREV EECLQKSN
|
| |