Gene GWCH70_3444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3444 
Symbol 
ID7979515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012794 
Strand
Start bp8223 
End bp11261 
Gene Length3039 bp 
Protein Length1012 aa 
Translation table11 
GC content34% 
IMG OID644800206 
Producttype III restriction protein res subunit 
Protein accessionYP_002951345 
Protein GI239828722 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCTTTTG AAGATAGTGA AAAACGATTT GAAGAGGATA TAGAAACCTA CCTTCTTACG 
GAAGGCGGGT ATGTGAAAGG CGACCAATCG AATTATGATA AAGAAAGAGC CATTGATCTC
AATCAGTTAA TTGATTTTAT CAAAGAAACA CAAGAAAAAG AATGGACTCG TTATGAAAAA
ATTTACGGTG AAGAAGCACC AAAAAAATTA TATAAGCGTT TAAATGACGA AATCGAAACA
AATGGACTGC TGCATGTGCT TCGTCATGGA ATTACTGATC GTGGCGTAAA GCTCAAAATA
GCCTCCTTTC GCCCTGAAAC AACGTTAAAT GAAAAAACAA TCAGAGATTA TCAAGCTAAT
AAACTGACTG TCATACGTCA ATTCGCTTAT TCTACAGAAA ATCATAATAC GTTGGATATG
GTTTTATCGT TGAATGGGAT TCCTATTGTA GCATTGGAGT TAAAGAATCA GTTTAAAGGA
CAGTCCGTTG AAAATGGAAA AAAACAGTTT ATGTATGATC GTGACCCAAG AGAAAAAATT
TTTCAATTTA ACAAACGCAT ACTAGTTTAT TTTGTAGTTG ATTTATATGA GGTTTGGATG
ACCACAAAGT TAGATGGAAA AAACACTGTT TTTTTACCGT TCAATCAAGG CTCAAACGGT
GCGGGAGAAG TAGGAGGGGC TGGAAACCCT GAAAATCCAG ACGGTTATGC GACCTCTTAT
TTATGGGAAA AAGTATTGCA TAAAGATAGC TTAATGAATA TATTGCAGCG ATTCATGCAT
TTAGAAGTAA AAAAAGAAAA GTTTATCAAA AACGGAAAGG AAAGCGTCAA AACAAGTTCT
AAGCTGATTT TCCCACGCTA TCATCAGTTA GATGTGGTCA GAAAACTTGT TGAAGATGTT
CGTCAAAAAG GAAGCGGGGA AAATTATCTT ATCCAACACA GCGCAGGTTC TGGAAAGTCG
AACAGCATAG CATGGTTGGC ATATCATTTA GCAAGTTTGC ACAATGAAGA TAATGAAAGC
ATTTTTACAT CAGTTATCGT GGTAACAGAT CGAACAGTAC TTGATCGGCA ATTACAACAA
ACAATTTCCA GTTTTGATCA TACAACAGGG CTTGTAGAAA CCATTGATGA CAAAAAGACT
TCCAAAGATT TAAGAGATGC TATTAATAAC GGAAAACGAA TTATTATCAC GACACTTCAA
AAGTTTCCTG TTATTTATGA AGAAGTAGAA GTCAACAAGG GAAGTCGCTT TGCCGTTATT
GTAGATGAAG CACATTCCTC CCAGACAGGA AAAAGTGCGA AAAAATTAAA AGCAGCATTA
GCAGATACAG AGGAAGCGTT ACGGGAATAT GCAGAACTGG AAGCAGAAAT CGAAGCTGAG
CAGTTAGATT TTGAAGATGA AATCGTTCAG GAACTTCTGA CACATGGCAG ACATAAAAAT
TTAAGTTTCT TTGCTTTTAC AGCTACTCCT AAAGAAAAGA CATTAGAAAT GTTTGGGACA
AAACAACCAG ATGGAACGTT TAAACCTTTT CATATTTACA GTATGCGTCA AGCTATCGAA
GAAGGATTTA TATTAGATGT ATTACAAAAT TATATGACCT ACAAAACATA TTATCGTATT
GCCAAAAACA CATCAGAGAA TCCGGAATTA TCGACAACAC AGGGAGTAAA AGCTATCAAG
CGTTATCAAT CATTACATCC ATATACTCTA CAACAAAAGA CGGCTATCAT GGTCGAACAA
TTTCGGAATG TAACTAGACA TAAAATTGGC GGTAAAGCAA AAGCAATGGT TGTCACAGCT
TCTCGTTTAC ATGCAGTCCG TTATTTCCAT GAGTTTAAGA AGTATATTAA GGAAAAAGGC
TACGATGATA TAGATGTGCT GGTAGCTTTT TCAGGCGTTG TTATTGATCA AAATGAAGAA
TACCGAGAAG AAACTTTGAA CAAAACAAAA GATGGAAAAC GAATTAAAGA AAGTCAATTG
AAAGAAGCTT TTCATTCAGA TGATTTCAAT ATATTAATTG TTGCGGAGAA ATATCAAACA
GGATTTGATG AACCATTACT TCATACAATG TTTATCGATA AGAAATTATC AGGAGTAAAA
GCTGTACAAA CTTTATCTCG GTTAAACAGG ACATATCCCG GAAAAGAGGA TACATTTATT
CTGGATTTTG TCAATGAAGC TGAAGATATA AAAAAAGCTT TCCAGCCTTA TTATGAAGTA
ACCGAACTGG ATAAAGAGAT TGATGTAAAT CTCATATATG ATACGAGAAC AAAGCTGAGA
AACTTTAAAA TTTACAATGA TCAAGACATT AAGAAACTAA CGAGAATTTA TTTTAAAAAA
GGGAAACAAA CAGAAAAGGA CTTAGGAAAA ATAGCCAGCC ACTTAATTCC AATTATTAAA
CGCTATGAAG AACTTGATGA AGAAACACAA TATAAATTTC GGGTTACGGT TCGTAACTTT
AATAAATGGT ATTCATATAT AACGCAACTC GTCAGAATGT TTGACAAAGA GTTGCATGAA
GAGTACATTT TCACATCGTA TTTAATTAAG TTTATTCCGA AAAATAGTGC AGAAAAAATC
AATATCGAAG ACAAAGTTAA ATTAGAGTAT TACAAGTTAG AAGAAACATT TAAAGGGACC
ATCACATTAG AATCAAACAG CCCAGAAAAT GTACTCAAAA ATTCAGATAA TGTTGATACA
GGCATAAAAC CTCCAGATGA TCAAGACTTA TTAGAAAATA TTATTCAACG AGTAAACAAA
AGATTTGAAG GCAAATTTAC AGAAGCTGAT CGAGTCATCG TAGAAGGAAT TTACAAAAAA
ACTGTTAAAG GCAATGAAAA ATTAAGGAGA TTTGCCAGAA ACAATGACGA AGAAATGTTT
AATAAAAGTA TCTTTCCTGA CGTGTTTGAA AAGGTCGCCC AAGAACTGTA TATGGAACAA
ATGAACGCTT ATTCTAAATT GTTTGAGGAT CGATCCTTTT ATAATGCAGT AATGGAGGCT
GTAGCGAAAG AAGTGTATAA GGAATTGAGA CGTGAATGA
 
Protein sequence
MAFEDSEKRF EEDIETYLLT EGGYVKGDQS NYDKERAIDL NQLIDFIKET QEKEWTRYEK 
IYGEEAPKKL YKRLNDEIET NGLLHVLRHG ITDRGVKLKI ASFRPETTLN EKTIRDYQAN
KLTVIRQFAY STENHNTLDM VLSLNGIPIV ALELKNQFKG QSVENGKKQF MYDRDPREKI
FQFNKRILVY FVVDLYEVWM TTKLDGKNTV FLPFNQGSNG AGEVGGAGNP ENPDGYATSY
LWEKVLHKDS LMNILQRFMH LEVKKEKFIK NGKESVKTSS KLIFPRYHQL DVVRKLVEDV
RQKGSGENYL IQHSAGSGKS NSIAWLAYHL ASLHNEDNES IFTSVIVVTD RTVLDRQLQQ
TISSFDHTTG LVETIDDKKT SKDLRDAINN GKRIIITTLQ KFPVIYEEVE VNKGSRFAVI
VDEAHSSQTG KSAKKLKAAL ADTEEALREY AELEAEIEAE QLDFEDEIVQ ELLTHGRHKN
LSFFAFTATP KEKTLEMFGT KQPDGTFKPF HIYSMRQAIE EGFILDVLQN YMTYKTYYRI
AKNTSENPEL STTQGVKAIK RYQSLHPYTL QQKTAIMVEQ FRNVTRHKIG GKAKAMVVTA
SRLHAVRYFH EFKKYIKEKG YDDIDVLVAF SGVVIDQNEE YREETLNKTK DGKRIKESQL
KEAFHSDDFN ILIVAEKYQT GFDEPLLHTM FIDKKLSGVK AVQTLSRLNR TYPGKEDTFI
LDFVNEAEDI KKAFQPYYEV TELDKEIDVN LIYDTRTKLR NFKIYNDQDI KKLTRIYFKK
GKQTEKDLGK IASHLIPIIK RYEELDEETQ YKFRVTVRNF NKWYSYITQL VRMFDKELHE
EYIFTSYLIK FIPKNSAEKI NIEDKVKLEY YKLEETFKGT ITLESNSPEN VLKNSDNVDT
GIKPPDDQDL LENIIQRVNK RFEGKFTEAD RVIVEGIYKK TVKGNEKLRR FARNNDEEMF
NKSIFPDVFE KVAQELYMEQ MNAYSKLFED RSFYNAVMEA VAKEVYKELR RE