Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3443 |
Symbol | |
ID | 7979514 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012794 |
Strand | + |
Start bp | 6885 |
End bp | 8222 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 644800205 |
Product | restriction modification system DNA specificity domain protein |
Protein accession | YP_002951344 |
Protein GI | 239828721 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAGAA AGATGAAGGA TAGTGGGGTT GAGTGGATTG GAGAAATACC TAGTGATTGG AAGATTTTAA GGTTAAAGAA TGTACTTAAA GAAAGAAATG AAAAAAATAG TCCAATAAAA ACAAATGAGA TTTTGTCTTT AACGATAGAA AAAGGAGTTA TTCCATATAA AGAAAAAAAA TCTGGTGGTA ATAAAGCTAA AGAAGATCTA TCAAATTATA AATTAGCTTA TCCAAACGAT ATTGTATTAA ATAGTATGAA TGTAATAGTA GGGGCGGTTG GTATATCAAA ATATTATGGA TGTGTAAGTC CTGTTTATTA TGTATTATAT TCTGATGATG TAGAACAAAA TATTAGATTT TATAATTACT TATTCCAATC ATCTGCTTTT CAAAAGAGTT TAATTGGTTT AGGAAACGGT ATTATGATGA AACAGTCGAG TACAGGAAAA TTAAACACAA TAAGGTTGAG AATACCTTTA GATAGATTAA AAAATGTATA TCTTCCTGTA CCGCCAGTTT CAGTCCAACA AAAAATCGTC AATTTCCTAG ATGAAAAAGT GTCACATATT GATACGATTA TCGAAAAAAA CAAACAGTCT ATTGAGGAAT TAAAAAAGTA TAAACAATCG TTAATCGCGG AAACGGTCAC AAAAGGGTTA GATCCTAATG TAGAGATGAA AGATAGTGGG ATTGAGTGGG TTGGGGAGAT ACCAAAGCAT TGGGAAATTA GAAGATTAAG AGATATATCA ATTATTACTA GAGGAACAGT TGATAAAAGC AAAGAGAAAA ATGAAATACC TGTTTATTTA GTACAATATA CGAATGTTTA TTATAAAAGA GAACAAAAAA TAAATGATGA TGATTATTTA CCAATTACTG TTTCAGAAAA TGAATATAAA AAATATAAAG TAAGAAAGGG AGATATATTA TTAACAGCAA GTTCAGAAAC AAAAGATGAT ATAGGTCATA GTACGGTAAT AGTTGAAGAT TTACCGAATC ATGTTTTTGG ATCAGACATA ATTAGAATCA GAATACCTAA TAAAATAGTT GATCTAAATT ATAAAAAGTA TTTTATGGAG AATTATTATT ATTTAGCAAA ATTTGATAAA TTATCTAGGG GTATAACACG GTTTAGATTT GGTATGGATC AATTTAAATC ATTAAAATAT GTTATTCCTC CTATTGAAGA GCAGGTCAAA ATTGCAAAAT ATCTTGATAA TATTACTAAT CATATCAATC AATTAATTTG TAATAAGGAA AAATTAATAA ACGAACTGGA ATCCTACAAA AAATCCCTCA TCTACGAATA TGTCACAGGC AAAAAGGAGG TTATGTAA
|
Protein sequence | MSRKMKDSGV EWIGEIPSDW KILRLKNVLK ERNEKNSPIK TNEILSLTIE KGVIPYKEKK SGGNKAKEDL SNYKLAYPND IVLNSMNVIV GAVGISKYYG CVSPVYYVLY SDDVEQNIRF YNYLFQSSAF QKSLIGLGNG IMMKQSSTGK LNTIRLRIPL DRLKNVYLPV PPVSVQQKIV NFLDEKVSHI DTIIEKNKQS IEELKKYKQS LIAETVTKGL DPNVEMKDSG IEWVGEIPKH WEIRRLRDIS IITRGTVDKS KEKNEIPVYL VQYTNVYYKR EQKINDDDYL PITVSENEYK KYKVRKGDIL LTASSETKDD IGHSTVIVED LPNHVFGSDI IRIRIPNKIV DLNYKKYFME NYYYLAKFDK LSRGITRFRF GMDQFKSLKY VIPPIEEQVK IAKYLDNITN HINQLICNKE KLINELESYK KSLIYEYVTG KKEVM
|
| |