Gene GYMC61_0288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_0288 
Symbol 
ID8524094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp294724 
End bp297735 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content44% 
IMG OID 
Producttype I site-specific deoxyribonuclease, HsdR family 
Protein accessionYP_003251463 
Protein GI261417781 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTTACAG AAGAGCAGCT AGAGCAAGCC GTCATTGAAT ATTTTCAAGA GCTCGGCTAC 
CCATATATGC CAGCGAAGGA GTTAAAGCGA GATAAAAAGG ACGTTTTATT GCTTGACCGT
TTGGAAGAAG CGTTAGTGAA ATTGAATCCA GAAGTGCCTG TTGAGATCAT TCGCGAGGTG
ATGCGCAAAA TCCACTATTT TGAAACGAAT GACGTGCTTA CAAACAACAA AATGTTTCAT
AAGTACTTAA CAGAAGCGGT AGTAGTGCCT GAGCTTGTGA ATGGAGAGAC GGTTTACCAT
CATGTTCGAC TCATCGATTG GGAAACGCCA GAAAACAACG ATTTTCTCGT TGTCAACCAA
TTGGAAGTCA TTGAAAAAGG ACAGGAGAAA ATTCCAGACA TCGTGCTGTA TGTCAACGGT
CTTCCGCTTG TTATTGCAGA GTTAAAAAGC ACGTCACGCG AAGAAGTCGA TATTGAAGAT
GCATACAAGC AGCTAAAAAA TTATATGAAC GTCCACATCC CTTCATTGTT CTACTACAAT
GCGTTTCTTG TCATTAGTGA CGGGGTTCAG GCGCGCGCGG GAACGATCAC CGCGCCGCTT
GACCGGTTTA TGGCTTGGAA AAAGATCAAT ATCGAAGATG ATGTCATAGA AAACCGGGAG
TTAGAAACGT TAATATTCGG CTTGTTCGAG CCAAAACGCT TTTTGGATGT CATCAAAAAC
TTTACGCTAT TTGCAAACGA AGCAAAAATA ATGGCGGCGT ATCACCAATA TTACGGCATG
AAGAAAGCAG TGGCTTCTAC GATCCAAGCC ATTCATACAG ATAAACGTGC AGGAGTCATT
TGGCATACGC AAGGAAGTGG CAAAAGTTAT TCGATGGTGT TTCTTGCGGG GAACTTAGTC
AGACAGGAAC AGCTGAAAAA TCCAACCATT GTCGTGATCA CCGATCGCAA CGATTTAGAC
GGACAGTTAT TTGAAACGTT TTGCGGGGCG AGTGAATTTT TACGGCAAAC ACCATTACAA
GCGGAAACGC GCAGCCATTT GAAAGAGTTG TTGGAACATC GGCAAACGGG CGGCATCGTT
TTTTCAACGA TTCAAAAGTT TGAAGAAGAA ACCGGCTTGC TTTCTGAACG GGAAAACATC
ATCGTGATGG TTGACGAAGC CCACCGCTCC CAATACGGCG TTGATCCGAA ATACGATATC
GAAACGGGGG AGCAAAAGTA CGGGTATGCG AAATATTTGC GTGAAGCTTT GCCGAATGCG
ACGTATATCG CATTTACAGG GACGCCGATT GAAACGACCG ATCGATCGAC GACCGGCTTG
TTCGGCGATG TCATTGATGT GTATGATATG ACCCAAGCGG TGGAAGACGG GGCGACGGTC
AAAATTTATT ACGAATCTCG CTTGGCGAAA GTGAAATTGG ACGAGAAGAA AATGAATGAA
ATTGACCAAG AATATTGGCG CATGCAAGTT CACGAAGGCG TCGGCGACTA TATTATTGAA
CAAAGCCAAA AAAGCTTGAG CCGCATGGAG CAAATCATAG GCGACCAAGA CCGAATTCGC
GAAGTCGTCA CGGACATTAT CCACCATTAC GAGGAGCGCG AACATCTTGT CAAAGGAAAA
GCGATGATCA TTGCTTATTC GCGCAACACC GCGTTTGCAA TGTATAAAGA GATCATGCGT
CAACGTCCGG ACTGGAAAGA CAAAGTGAAA ATTGTCATGA CCGAAAACAA TCAGGATCCG
GAGGAGCTCG CGAAGCTTGT CGGAAATAAA CAAACGCGGA AACAGCGGGA AAAAGAATTT
AAAGATGTCG ATCATCCGTT TAAAATCGTC ATTGTTGTTG ATATGTGGCT CACCGGTTTT
GACGTGCCGG CGCTCGATAC AATGTATATT GACAAGCCCA TGAAGGCGCA TAACTTGATG
CAAGCGATCG CCCGCGTCAA CCGCGTTTAT CCGGGGAAAA CAGGCGGATT GATTGTTGAT
TACATCGGCT TAAAGAAAAA TTTAATGGAA GCGTTGCAAA CGTATACGAA GCGCGACCAA
GATAAAGTGC AAGAAAATAC CCAAGCCCGC GACATCGCGT TAAACATCCT CGAAGTGTTG
CGCAATATGT TCCATTCGTT TGATTATCGC GCCTTTTTCG GTGATAGCGA CAAAAAGCGT
TATGAAGTCA TTCGCGACGG AGCAGAATTT GTGCAGCAAA CGGAAAAAAG AAAATTGCTG
TTTATGACGG AAACGAAAAA GCTGAAAGAT GTTTATAAAA TTTGCACCGG CTTGCTTTCG
AAAGAACAAA AAGAGGAAAT TTCCTATTTT ATCGCTGTTC GTTCCTTTAT TATGAAATCT
TCGCGAACAG GCACACCTGA CTTAAAAGAA GTGAATGAAC GAATCGCGAA AATGTTGGAA
GAAGCGATTT TGGAAGATGA AGTGATGGTG TTGACCCAAG CGGTTTCATC GGAAAGTTTT
GATTTGTTGA ATGAGGAGAA CATCAAAAAA TTACGCGCCT TGCCGCAAAA AAATATTGCG
TCGACCATTT TAATGCGCGT ATTAAAGCAA AAATTGCAAG ATGTGAAAAA GACAAACATG
ACGGTGAGCC AAACATTTTC CAAACGTTTT GAAAAAATAT TAGAAAAATA CAACAATCGA
AATGATTACA CGGATGTGTA TGAAGTATTT GAGGAATTGC TTAAATTTAA AGAAGAGTTG
CAGGCGGCGA TTGAAGAAGG GAAACAGCTT GGCTTAACCG AGGAGGAGAA GGCGTTTTTC
GACGTGTTAG GTTCTGACCC GGATATAAAA AAATTAATGG AAGATGAGGT ATTAATTCAA
ATCGCAAAGG ATTTGGCGAA AACGGTAAAG GAAAACCGGA CGCACGATTG GGATAAAAAA
GCGCAAGCCC AAGCGCGCAT GCGCCTTGAA ATTAAGAAGG TGCTGCGCAA GTACGATTAT
CCGCCAAATA AACAGCCGAA AGCGGTGGAA GATGTGCTTG AGCAGGCGAA GCTGCAGTGC
ATGAATATGT AA
 
Protein sequence
MFTEEQLEQA VIEYFQELGY PYMPAKELKR DKKDVLLLDR LEEALVKLNP EVPVEIIREV 
MRKIHYFETN DVLTNNKMFH KYLTEAVVVP ELVNGETVYH HVRLIDWETP ENNDFLVVNQ
LEVIEKGQEK IPDIVLYVNG LPLVIAELKS TSREEVDIED AYKQLKNYMN VHIPSLFYYN
AFLVISDGVQ ARAGTITAPL DRFMAWKKIN IEDDVIENRE LETLIFGLFE PKRFLDVIKN
FTLFANEAKI MAAYHQYYGM KKAVASTIQA IHTDKRAGVI WHTQGSGKSY SMVFLAGNLV
RQEQLKNPTI VVITDRNDLD GQLFETFCGA SEFLRQTPLQ AETRSHLKEL LEHRQTGGIV
FSTIQKFEEE TGLLSERENI IVMVDEAHRS QYGVDPKYDI ETGEQKYGYA KYLREALPNA
TYIAFTGTPI ETTDRSTTGL FGDVIDVYDM TQAVEDGATV KIYYESRLAK VKLDEKKMNE
IDQEYWRMQV HEGVGDYIIE QSQKSLSRME QIIGDQDRIR EVVTDIIHHY EEREHLVKGK
AMIIAYSRNT AFAMYKEIMR QRPDWKDKVK IVMTENNQDP EELAKLVGNK QTRKQREKEF
KDVDHPFKIV IVVDMWLTGF DVPALDTMYI DKPMKAHNLM QAIARVNRVY PGKTGGLIVD
YIGLKKNLME ALQTYTKRDQ DKVQENTQAR DIALNILEVL RNMFHSFDYR AFFGDSDKKR
YEVIRDGAEF VQQTEKRKLL FMTETKKLKD VYKICTGLLS KEQKEEISYF IAVRSFIMKS
SRTGTPDLKE VNERIAKMLE EAILEDEVMV LTQAVSSESF DLLNEENIKK LRALPQKNIA
STILMRVLKQ KLQDVKKTNM TVSQTFSKRF EKILEKYNNR NDYTDVYEVF EELLKFKEEL
QAAIEEGKQL GLTEEEKAFF DVLGSDPDIK KLMEDEVLIQ IAKDLAKTVK ENRTHDWDKK
AQAQARMRLE IKKVLRKYDY PPNKQPKAVE DVLEQAKLQC MNM