Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GYMC61_0288 |
Symbol | |
ID | 8524094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. Y412MC61 |
Kingdom | Bacteria |
Replicon accession | NC_013411 |
Strand | + |
Start bp | 294724 |
End bp | 297735 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | |
Product | type I site-specific deoxyribonuclease, HsdR family |
Protein accession | YP_003251463 |
Protein GI | 261417781 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTTACAG AAGAGCAGCT AGAGCAAGCC GTCATTGAAT ATTTTCAAGA GCTCGGCTAC CCATATATGC CAGCGAAGGA GTTAAAGCGA GATAAAAAGG ACGTTTTATT GCTTGACCGT TTGGAAGAAG CGTTAGTGAA ATTGAATCCA GAAGTGCCTG TTGAGATCAT TCGCGAGGTG ATGCGCAAAA TCCACTATTT TGAAACGAAT GACGTGCTTA CAAACAACAA AATGTTTCAT AAGTACTTAA CAGAAGCGGT AGTAGTGCCT GAGCTTGTGA ATGGAGAGAC GGTTTACCAT CATGTTCGAC TCATCGATTG GGAAACGCCA GAAAACAACG ATTTTCTCGT TGTCAACCAA TTGGAAGTCA TTGAAAAAGG ACAGGAGAAA ATTCCAGACA TCGTGCTGTA TGTCAACGGT CTTCCGCTTG TTATTGCAGA GTTAAAAAGC ACGTCACGCG AAGAAGTCGA TATTGAAGAT GCATACAAGC AGCTAAAAAA TTATATGAAC GTCCACATCC CTTCATTGTT CTACTACAAT GCGTTTCTTG TCATTAGTGA CGGGGTTCAG GCGCGCGCGG GAACGATCAC CGCGCCGCTT GACCGGTTTA TGGCTTGGAA AAAGATCAAT ATCGAAGATG ATGTCATAGA AAACCGGGAG TTAGAAACGT TAATATTCGG CTTGTTCGAG CCAAAACGCT TTTTGGATGT CATCAAAAAC TTTACGCTAT TTGCAAACGA AGCAAAAATA ATGGCGGCGT ATCACCAATA TTACGGCATG AAGAAAGCAG TGGCTTCTAC GATCCAAGCC ATTCATACAG ATAAACGTGC AGGAGTCATT TGGCATACGC AAGGAAGTGG CAAAAGTTAT TCGATGGTGT TTCTTGCGGG GAACTTAGTC AGACAGGAAC AGCTGAAAAA TCCAACCATT GTCGTGATCA CCGATCGCAA CGATTTAGAC GGACAGTTAT TTGAAACGTT TTGCGGGGCG AGTGAATTTT TACGGCAAAC ACCATTACAA GCGGAAACGC GCAGCCATTT GAAAGAGTTG TTGGAACATC GGCAAACGGG CGGCATCGTT TTTTCAACGA TTCAAAAGTT TGAAGAAGAA ACCGGCTTGC TTTCTGAACG GGAAAACATC ATCGTGATGG TTGACGAAGC CCACCGCTCC CAATACGGCG TTGATCCGAA ATACGATATC GAAACGGGGG AGCAAAAGTA CGGGTATGCG AAATATTTGC GTGAAGCTTT GCCGAATGCG ACGTATATCG CATTTACAGG GACGCCGATT GAAACGACCG ATCGATCGAC GACCGGCTTG TTCGGCGATG TCATTGATGT GTATGATATG ACCCAAGCGG TGGAAGACGG GGCGACGGTC AAAATTTATT ACGAATCTCG CTTGGCGAAA GTGAAATTGG ACGAGAAGAA AATGAATGAA ATTGACCAAG AATATTGGCG CATGCAAGTT CACGAAGGCG TCGGCGACTA TATTATTGAA CAAAGCCAAA AAAGCTTGAG CCGCATGGAG CAAATCATAG GCGACCAAGA CCGAATTCGC GAAGTCGTCA CGGACATTAT CCACCATTAC GAGGAGCGCG AACATCTTGT CAAAGGAAAA GCGATGATCA TTGCTTATTC GCGCAACACC GCGTTTGCAA TGTATAAAGA GATCATGCGT CAACGTCCGG ACTGGAAAGA CAAAGTGAAA ATTGTCATGA CCGAAAACAA TCAGGATCCG GAGGAGCTCG CGAAGCTTGT CGGAAATAAA CAAACGCGGA AACAGCGGGA AAAAGAATTT AAAGATGTCG ATCATCCGTT TAAAATCGTC ATTGTTGTTG ATATGTGGCT CACCGGTTTT GACGTGCCGG CGCTCGATAC AATGTATATT GACAAGCCCA TGAAGGCGCA TAACTTGATG CAAGCGATCG CCCGCGTCAA CCGCGTTTAT CCGGGGAAAA CAGGCGGATT GATTGTTGAT TACATCGGCT TAAAGAAAAA TTTAATGGAA GCGTTGCAAA CGTATACGAA GCGCGACCAA GATAAAGTGC AAGAAAATAC CCAAGCCCGC GACATCGCGT TAAACATCCT CGAAGTGTTG CGCAATATGT TCCATTCGTT TGATTATCGC GCCTTTTTCG GTGATAGCGA CAAAAAGCGT TATGAAGTCA TTCGCGACGG AGCAGAATTT GTGCAGCAAA CGGAAAAAAG AAAATTGCTG TTTATGACGG AAACGAAAAA GCTGAAAGAT GTTTATAAAA TTTGCACCGG CTTGCTTTCG AAAGAACAAA AAGAGGAAAT TTCCTATTTT ATCGCTGTTC GTTCCTTTAT TATGAAATCT TCGCGAACAG GCACACCTGA CTTAAAAGAA GTGAATGAAC GAATCGCGAA AATGTTGGAA GAAGCGATTT TGGAAGATGA AGTGATGGTG TTGACCCAAG CGGTTTCATC GGAAAGTTTT GATTTGTTGA ATGAGGAGAA CATCAAAAAA TTACGCGCCT TGCCGCAAAA AAATATTGCG TCGACCATTT TAATGCGCGT ATTAAAGCAA AAATTGCAAG ATGTGAAAAA GACAAACATG ACGGTGAGCC AAACATTTTC CAAACGTTTT GAAAAAATAT TAGAAAAATA CAACAATCGA AATGATTACA CGGATGTGTA TGAAGTATTT GAGGAATTGC TTAAATTTAA AGAAGAGTTG CAGGCGGCGA TTGAAGAAGG GAAACAGCTT GGCTTAACCG AGGAGGAGAA GGCGTTTTTC GACGTGTTAG GTTCTGACCC GGATATAAAA AAATTAATGG AAGATGAGGT ATTAATTCAA ATCGCAAAGG ATTTGGCGAA AACGGTAAAG GAAAACCGGA CGCACGATTG GGATAAAAAA GCGCAAGCCC AAGCGCGCAT GCGCCTTGAA ATTAAGAAGG TGCTGCGCAA GTACGATTAT CCGCCAAATA AACAGCCGAA AGCGGTGGAA GATGTGCTTG AGCAGGCGAA GCTGCAGTGC ATGAATATGT AA
|
Protein sequence | MFTEEQLEQA VIEYFQELGY PYMPAKELKR DKKDVLLLDR LEEALVKLNP EVPVEIIREV MRKIHYFETN DVLTNNKMFH KYLTEAVVVP ELVNGETVYH HVRLIDWETP ENNDFLVVNQ LEVIEKGQEK IPDIVLYVNG LPLVIAELKS TSREEVDIED AYKQLKNYMN VHIPSLFYYN AFLVISDGVQ ARAGTITAPL DRFMAWKKIN IEDDVIENRE LETLIFGLFE PKRFLDVIKN FTLFANEAKI MAAYHQYYGM KKAVASTIQA IHTDKRAGVI WHTQGSGKSY SMVFLAGNLV RQEQLKNPTI VVITDRNDLD GQLFETFCGA SEFLRQTPLQ AETRSHLKEL LEHRQTGGIV FSTIQKFEEE TGLLSERENI IVMVDEAHRS QYGVDPKYDI ETGEQKYGYA KYLREALPNA TYIAFTGTPI ETTDRSTTGL FGDVIDVYDM TQAVEDGATV KIYYESRLAK VKLDEKKMNE IDQEYWRMQV HEGVGDYIIE QSQKSLSRME QIIGDQDRIR EVVTDIIHHY EEREHLVKGK AMIIAYSRNT AFAMYKEIMR QRPDWKDKVK IVMTENNQDP EELAKLVGNK QTRKQREKEF KDVDHPFKIV IVVDMWLTGF DVPALDTMYI DKPMKAHNLM QAIARVNRVY PGKTGGLIVD YIGLKKNLME ALQTYTKRDQ DKVQENTQAR DIALNILEVL RNMFHSFDYR AFFGDSDKKR YEVIRDGAEF VQQTEKRKLL FMTETKKLKD VYKICTGLLS KEQKEEISYF IAVRSFIMKS SRTGTPDLKE VNERIAKMLE EAILEDEVMV LTQAVSSESF DLLNEENIKK LRALPQKNIA STILMRVLKQ KLQDVKKTNM TVSQTFSKRF EKILEKYNNR NDYTDVYEVF EELLKFKEEL QAAIEEGKQL GLTEEEKAFF DVLGSDPDIK KLMEDEVLIQ IAKDLAKTVK ENRTHDWDKK AQAQARMRLE IKKVLRKYDY PPNKQPKAVE DVLEQAKLQC MNM
|
| |