Gene GWCH70_2067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2067 
Symbol 
ID7977302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2126254 
End bp2129400 
Gene Length3147 bp 
Protein Length1048 aa 
Translation table11 
GC content37% 
IMG OID644798881 
Producttype III restriction protein res subunit 
Protein accessionYP_002950051 
Protein GI239827427 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGAGAA ATTCAAACAG AGCAACTGCT ATTCCCTTTA ACAAACGGCT TGTTTTGAAC 
CGTTTTATGC TTCATTTATT TGAAGCGGAT GATTTTGATG ATTTAGTAAT GGGGATGAAA
GAGCCGGAAT TAGAGGGGTG GGATAGTGAT AACATATCCC ACTTCTACCA CTTTCTTTCC
AATCGGACTT TTGACCGTCA CAAGTTAAGT AATGAGATGT TACGTCAGTA TGATGAAAAT
ATTGTAAGAC ATACAATGCG TATCAATCTA AAGAGAAAAG AAGCGATTAA GTGGAAATAC
TTCCAATATT TGTCCCTCTT GTTTACAGAA ATATATCTGG ACCGTTATTT TACGGACCCA
GAAGCGTTGT TGAATGAGTT AAATAACTTC TTAGATACAT TCAATGCAAA GGAAAATGCT
CAAATTGACA AGTACAGCCG AGAGGACTTA AATAAGTTAG CCTTTTGGAA CGCAACAGGT
TCTGGAAAAA CGCTATTGAT GCATGTGAAC ATATTACAAT ATTTGCATTA CTTATCAAAG
CATCGTAAAG AAAATGAGCT AAATCAAATT ATCCTTCTTA CTCCAAAGGA GGACTTATCC
ATTCAGCATT TAAGAGAATT TGAGCTTTCT GGAATCGATG CAGAATTGTT TGATAAAGAC
CAGGGTAGCC TGTTCCGTCA GCAAAGTGTG ACCATCATTA ATATCCAAAG AATCAGAGAG
GAGTCCGGGG AAAAGACGGT AGCCGTTGAT ACATTCGAAG GTAACAATTT GGTTCTTGTG
GATGAGGGAC ATCGTGGTTC ACGTGGACAA GAGTGGAAGA GAATGCGAGA CCGTTTAAGT
GAACAAGGGT TTGCATTCGA ATATTCGGCA ACATTTGGTC AAGCAGTTTC AGGAAATAAG
GACTTACAAC AAGAATACGC CAAGTGTATC TTATTTGATT ACTCCTACCG TTATTTCTAT
AGGGATGGAT TCGGAAAGGA TTTTCAAATC CTGAATTTGG CTGATGATTC AAATGAGGAT
ATTCGTCGTC TCTACTTAAC AGCATGTTTG GTAACATTCT ATCAACAGCT TAAGCTATAT
CAAGAGCATG AGCATTCTAT ACGACCTTTT CTGATTGAAA AACCTTTATT AGTGTTTGTA
GGAAGTAGTG TAAATGCAGT TAGAAGAGAA AGAGGGAAAA AGGTTTCTGA CGTAACAGAT
GTTCTTTTAT TCATTGCACA ATTTGCAAAA GAAGAAAGAC GAAGTATATC ATATCTAGAG
CGGTTAATGA ATGGTAATGC TGGTTTACGG GACGAAAAAG GCAGGGAAAT ATTCCGTAAT
GCATTTACCT ATTTAACAAC GTTGGAAATG ACCCCGGAAG CTCTATACGC GGATATGTTA
TCTACAGTCT TCCATGCAAG TCATTCTGGA GCAGTGTTAC ATGTTGTGAA TTTGAAGGGA
GTAGACAATG AAATTGCTTT ACGATTGGGG GACAATAAGC CATTTGGTGT TATCAATGTA
GGTGACGCCG CGGCATTATG CAAACTATGT GAAACGTATC CTGAGTTACA TGTAACGGAA
GAACAATTTA GAGATTCTTT GTTTCATCGG TTAAATGAGC GTGAATCGGA CATTCATGTA
CTGATTGGTT CCAAGAAGTT TACAGAAGGT TGGAGTAGCT GGCGGGTAAG TACCATGGGG
CTAATGAATA TGGGGAGGGG TGAAGGTGCG GAGATTATTC AACTCTTTGG TCGTGGTGTA
CGCCTTAAAG GATATGGGAT GAGTTTGAAG CGAAGTGAAG CTTTACGATT AGAAAATACA
GATATTCCAC ATCATATTAA GGTTTTAGAA ACACTTAATA TTTTTGGTGT TCGAGCTGAT
TATATGCAGC AGTTTAGAGA ATACCTTGAG GATGAAGGAA TAAAAACGGA TAACGATAAG
GAAACCATAC ACTTAAAAGT TATTCCTGAT TACGCAAGAA GAAAGATAAA CTTAAAAACC
CTTGCCCTAA AGGATATCGA TATCAACTCC TTTAAGAAAA AGGGGCCAAA ACCTTCCCTT
GAACCAGCGA CCGAAGAAAT GAATTTAAAA GTAATATTGA ATTGGTACCC TAAAATTCAG
AGGACAGAAA GCCGAAAACA GTTCTCTTCA CAAGCATCGG TAATACCTGA TGTAATGGAT
GAGTGTAAGT TAGAGTCCAT GCACCTTGCA TTCATGGATA TAGACCGGAT ATATTTTGAA
CTTCAAAAGT TTAAAAATGA ACGTTCTTGG TACAATCTCA AGCTTTCAAA AGAAAAGATT
AAAGAACTAC TCAACAGACA GGATTGGTAT ATCCTTTATA TTCCGAAAGC GGAGATGGTC
TTTGATTCTT TTCAAAAGGT CAAAAGATGG GAAGAGATTG CAATTTCCCT ATTGAAGAAG
TATTGTGAGC AATATTATTT GTACCGAAAA AAAGAGTGGG AAGCTCCTTT TATGGGGTAC
AAGGATTTAA ACGAAGAGGA TAAGAACTTT ATTCGTGAGT ATCGCGTAAC TTATGATACA
TCTGAAACGA CCTTGAAAAC AAAGCTAGAG CAGCTTCAGA AGATGCTTGA ATCAAACAAC
ATGAAGCCGG TTCAACATGG TACAATGGAA ATATTTGATT TTGAGCAACA TCTTTATAAA
CCTCTAATTT ATCTAGAGGG CAATGTGGCT ACTGTATCAC CGAAACCACT GAACAAGGGT
GAATACCAAT TTATTAATGA TTTACGAGCT TACTATGAGA AGAATAAAAT CTTCTTCCAA
GACAAAGAGT TATATTTGCT ACGTAATCAA TCAAGAGGTA AAGGAATTGG ATTCTTCGAA
GCGGGAAATT TCCATCCGGA TTTTATCTTA TGGATTCTCT ATCAGGGTAA GCAATACATC
ACATTTGTTG ACCCCAAAGG CATAAGAAAC ATGTCTGTGT ATGATAAGAA AATCCAGTTT
TACCGGACCA TCAAGGAGAA AGAAGCTGAA CTTGGGAATA GCTCCATTGT GCTGAACTCG
TTCATCATTT CAAATACGGA ATATGTAAAT CTCTTAAATA CAGGAACAAA GTTAAGCAAA
GAAGAGCTTG AGAACTTTAA TGTATTGTTC CAAGTAGAGG ACAAAGCAAC ATACATTGGT
AAGATGATTA ACAAGATTCT AGCATAA
 
Protein sequence
MPRNSNRATA IPFNKRLVLN RFMLHLFEAD DFDDLVMGMK EPELEGWDSD NISHFYHFLS 
NRTFDRHKLS NEMLRQYDEN IVRHTMRINL KRKEAIKWKY FQYLSLLFTE IYLDRYFTDP
EALLNELNNF LDTFNAKENA QIDKYSREDL NKLAFWNATG SGKTLLMHVN ILQYLHYLSK
HRKENELNQI ILLTPKEDLS IQHLREFELS GIDAELFDKD QGSLFRQQSV TIINIQRIRE
ESGEKTVAVD TFEGNNLVLV DEGHRGSRGQ EWKRMRDRLS EQGFAFEYSA TFGQAVSGNK
DLQQEYAKCI LFDYSYRYFY RDGFGKDFQI LNLADDSNED IRRLYLTACL VTFYQQLKLY
QEHEHSIRPF LIEKPLLVFV GSSVNAVRRE RGKKVSDVTD VLLFIAQFAK EERRSISYLE
RLMNGNAGLR DEKGREIFRN AFTYLTTLEM TPEALYADML STVFHASHSG AVLHVVNLKG
VDNEIALRLG DNKPFGVINV GDAAALCKLC ETYPELHVTE EQFRDSLFHR LNERESDIHV
LIGSKKFTEG WSSWRVSTMG LMNMGRGEGA EIIQLFGRGV RLKGYGMSLK RSEALRLENT
DIPHHIKVLE TLNIFGVRAD YMQQFREYLE DEGIKTDNDK ETIHLKVIPD YARRKINLKT
LALKDIDINS FKKKGPKPSL EPATEEMNLK VILNWYPKIQ RTESRKQFSS QASVIPDVMD
ECKLESMHLA FMDIDRIYFE LQKFKNERSW YNLKLSKEKI KELLNRQDWY ILYIPKAEMV
FDSFQKVKRW EEIAISLLKK YCEQYYLYRK KEWEAPFMGY KDLNEEDKNF IREYRVTYDT
SETTLKTKLE QLQKMLESNN MKPVQHGTME IFDFEQHLYK PLIYLEGNVA TVSPKPLNKG
EYQFINDLRA YYEKNKIFFQ DKELYLLRNQ SRGKGIGFFE AGNFHPDFIL WILYQGKQYI
TFVDPKGIRN MSVYDKKIQF YRTIKEKEAE LGNSSIVLNS FIISNTEYVN LLNTGTKLSK
EELENFNVLF QVEDKATYIG KMINKILA