Gene GYMC61_1672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_1672 
SymbolhsdR 
ID8525536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp1692100 
End bp1695342 
Gene Length3243 bp 
Protein Length1080 aa 
Translation table11 
GC content53% 
IMG OID 
Producttype I restriction enzyme EcoKI subunit R 
Protein accessionYP_003252787 
Protein GI261419105 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGCATA ACTTTCAATT TCTAGAAGGA AAATGGGATG TGCTTGCCCG GGTAGGGGAG 
ACGGCCGAGC GGCATGTGCA TCAAAACCCA AACGTGGCGA TCAGCGAGCT GCGCAAGTTG
GCCGAGACGA TGACGAAATA CATATTGGCG TTCGAAGGCA TTCGCGAAGA GCGCGGCACC
GATCAGCAAG ACCGCCTCAA GGTGCTGCTG TATGAACAAA TCATCCCAAA AGAGATATAT
GACATCTTCA CGATGATCCG CTTAAAAGGC AATCAAGCGG TGCATGATCC CAATTATGGG
GACGTGCATG AGGCAAAGAC GCTATTGCAC ATGGCGTTTC GCCTAGCTGT ATGGTTTATG
GAAGTGTACG GCGACTGGTC GTTCGAAGCT CCCCAGTATC GAGAGCCGCT GCCGTCATCG
TCTGAATCGA CGGATGAACT GAATCGGCTC ATCGAATCGT ATAAACAGCG GCTTGCCGAC
TTGGAAGCGG AGCTTTCGCA CATCCGCGAG GCGGGGCTGT ATACGAGTGC CGAGGAAAAA
CAAAAGCGGC GTGACTATTC GCGGCGGGCG GTGGCGAATT TCGAACTGAC TGAGGCAGAG
ACGCGCCTCT TCATCGATGA ACAGCTGCGG GCGGCTGGCT GGGAGGCTGA TTCGGAAAAA
CTGCGCTTTG CCAAAGGCGC GCGGCCAGAG AAGGGGCGGA ATTTGGCGAT TGCCGAATGG
CCGCTTCGGC ACGGGGTTGC CGATTATGCT CTTTTTATCG GCTTGGAGCT GGTCGGCCTC
ATTGAAGCGA AACGCGCAAG CAAAGACATC CCCGCCGATA TCGAACAGGC AAAACAATAC
GCCCGTCTCG TTGTGCGCCA TGGGCAAGAA GTGATTCACG AGCCGTGGGG GGAGTATTTC
GTTCCATTTT TGTTCGCCAC CAACGGCCGT CCGTACGTGA AACAGCTTGA ACAAAAGTCC
GGCATTTGGT TTTTGGACGC CCGGAAATCG ACGAACCATC CGCGCCCGCT GCAAGGCTGG
TATACGCCGG ACGGGTTGAA ACAGCTGCTG GAACAAGACA TTGAACGATC CGAACAGCGG
CTGCGCGATG AGACGTTCGA TTATTTGAAA CTGCGGCCGT ACCAAGTGCG GGCGATTCAG
GCTGTTGAGC GGGCGCTTGA AGACAAGCGC CGGAGCGTGC TTGTCGCCAT GGCGACGGGA
ACGGGAAAAA CGCGCATGGC GATCGGCTTG ATTTACCGCT TGCTCAAAGC GAAACGGTTT
CGGCGCATTT TGTTTCTCGT TGACCGAAAA GCGCTCGCCG AACAAGCGGA AGCGGCGTTT
AAAGAAAGCC GAATGGAGCA CTTTCAGACG TTTGCCGAGA TTTACGGTTT GCAGTCGCTG
TACGACCAAA AGCCCGATCC GGAGACGAAA GTGCATATCG CCACCGTTCA AGGAATGTTG
AAGCGCATTT TTTACAACGA CCGTCCCGAA GACGTTCCGC CGATTGACCA GTATGACTGC
ATCATCGTCG ACGAAGCCCA CCGCGGCTAT ACGTTGGATA AAGAAATGAA CGAAATCGAG
CTGGAGTTTA AAGACCATCG CGATTATGTA AGCAAGTATC GGCAAGTGCT CGATTACTTC
GATGCCGTCC GCATCGGTTT GACCGCGACG CCCGCCTTGC ATACGACGGA CATTTTCGGC
CCGCCCGTGT TTACGTATTC GTATCGCGAG GCGGTCATCG ACGGGTATTT GGTCGACCAT
GAGCCGCCGT ATCAGTTTGA TACGGTGCTG AAACGGGAAG GGATTACGTG GGCGAAAGGA
GAGACGGTTG ATGTCTACGA CGCCGTATCC CATACGGTGT CGCAAGAGTA TTTGGAGGAT
GAACTGAACA TTGATGTTTC GCACTTTAAC ACGAAAGTCG TAACGGAAAG CTTTAACCGC
GTCATCATTC GCGAGCTCGT CAACTATATC GCTCCGGATG ATGAAGGGAA GACGCTCATT
TTTGCCGCGA CGGACGATCA CGCTGATCTC GTCGTCCGAC TGCTGAAAGA AGAATTTGCG
CGCGTATACG GAGAGTTCGA TGACAATGCG GTCATGAAAA TCACCGGATC GATCAAAGAC
CCGTCAGGAG CGATCCGCAG GTTTAAAAAC GAAAAATACC CGACAATCGC GGTGACGGTC
GATTTGTTAA CGACAGGGGT CGATGTGCCG GCGATTACGA ACCTCGTCTT TTTGCGCCGC
GTTCGCTCGC GCATTTTGTA TGAACAAATG CTCGGCCGGG CGACAAGGCG GTGCGATGAG
ATTGGGAAAG ACCATTTCAA CATCTTTGAC GCCGTCGGCA TTTACGAAAC GTTAAAGCCG
TATACAAGCA TGAAACCGGT CGTCGCCCGC CCGCAGGCGA CGCTGACGGA GCTGTTCGAT
GAACTCGAAC AGCTTGAACA GACCGCCCAT CTGGAATACC AAAAAGAACA AATCATTGCG
AAGATGCAAC GGAAAAAGCG GACGTGGTCC GACAGGCAAC ACGAAGATTT CCGCGTGCTT
TCCGGCGGAA AAACGGTCGA TGAATTCATC GACTGGCTGA AATCGCTGCC GTCTGATGAA
CTGAAAGACG GGCTGAAAGA ATATAAATCG ATGTTCCGCT ATTTGGACGA AAACCGGTAC
CGCGAGCGCA AGCAATACAT TTCCCACCAT GAAGACAAGC TGCTTGGCGT CAAACGAGGC
TACGGCAATG CGGAAAAACC GGACGACTAT TTAGAGGCGT TCGGTGAATT TATCCGCACG
AACATGAATA AAATCCCGGC GCTGATGATT GTCTGCCAGC GGCCGTCCGA ACTGACGAGG
GAAGAGTTGA AACAACTGCG ATTGGAACTT GACCGAAGAG GCTTTTCCGA GAAAAAACTG
CAGGCGGCAT GGCGGGAGGC GAAAAATGAC GACATCGCCG CCGACATTAT CGCCTTCATC
CGCCAACAGG CGCTCGGCGA CCCGCTCATC AGTCATGAAG AACGAATTCG CCGCGCCATG
AACGCCATTT ACCGGATGAA ACCATGGCCG CCGCTGCAAA AGAAATGGCT TGAGCGGATC
GAAAAACAGC TGCTGCAAGA ATATGTCCTC CATCCCGACC CGGAAAAAGC GTTTGACTAC
GAACCGTTTA AAAGCCACGG CGGCTTTAAG CAGCTGAACA ATATTTTCGG CGGTCAATTG
CCGCAAATCG TGCGCGAAAT TAATGAAAAC TTGTACAACT ACGCCAAGAA GGAGCAAGCC
TAG
 
Protein sequence
MAHNFQFLEG KWDVLARVGE TAERHVHQNP NVAISELRKL AETMTKYILA FEGIREERGT 
DQQDRLKVLL YEQIIPKEIY DIFTMIRLKG NQAVHDPNYG DVHEAKTLLH MAFRLAVWFM
EVYGDWSFEA PQYREPLPSS SESTDELNRL IESYKQRLAD LEAELSHIRE AGLYTSAEEK
QKRRDYSRRA VANFELTEAE TRLFIDEQLR AAGWEADSEK LRFAKGARPE KGRNLAIAEW
PLRHGVADYA LFIGLELVGL IEAKRASKDI PADIEQAKQY ARLVVRHGQE VIHEPWGEYF
VPFLFATNGR PYVKQLEQKS GIWFLDARKS TNHPRPLQGW YTPDGLKQLL EQDIERSEQR
LRDETFDYLK LRPYQVRAIQ AVERALEDKR RSVLVAMATG TGKTRMAIGL IYRLLKAKRF
RRILFLVDRK ALAEQAEAAF KESRMEHFQT FAEIYGLQSL YDQKPDPETK VHIATVQGML
KRIFYNDRPE DVPPIDQYDC IIVDEAHRGY TLDKEMNEIE LEFKDHRDYV SKYRQVLDYF
DAVRIGLTAT PALHTTDIFG PPVFTYSYRE AVIDGYLVDH EPPYQFDTVL KREGITWAKG
ETVDVYDAVS HTVSQEYLED ELNIDVSHFN TKVVTESFNR VIIRELVNYI APDDEGKTLI
FAATDDHADL VVRLLKEEFA RVYGEFDDNA VMKITGSIKD PSGAIRRFKN EKYPTIAVTV
DLLTTGVDVP AITNLVFLRR VRSRILYEQM LGRATRRCDE IGKDHFNIFD AVGIYETLKP
YTSMKPVVAR PQATLTELFD ELEQLEQTAH LEYQKEQIIA KMQRKKRTWS DRQHEDFRVL
SGGKTVDEFI DWLKSLPSDE LKDGLKEYKS MFRYLDENRY RERKQYISHH EDKLLGVKRG
YGNAEKPDDY LEAFGEFIRT NMNKIPALMI VCQRPSELTR EELKQLRLEL DRRGFSEKKL
QAAWREAKND DIAADIIAFI RQQALGDPLI SHEERIRRAM NAIYRMKPWP PLQKKWLERI
EKQLLQEYVL HPDPEKAFDY EPFKSHGGFK QLNNIFGGQL PQIVREINEN LYNYAKKEQA