Gene GWCH70_1647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1647 
Symbol 
ID7976363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1724843 
End bp1726657 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content32% 
IMG OID644798527 
Productprotein of unknown function DUF450 
Protein accessionYP_002949699 
Protein GI239827075 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.573593 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGTAG AACACAATTT AAATTTAAAA AGTATCCTTA CAGAACAAGG ATTTAACTAC 
TATGTTTCTA AAGATATGGA AACGAATCTT TCTGTAGAAG GGATATTAAC AGAGGAATTT
GTCAATTCGA TAACTAAAAT TAATCCATTT ATTAATAAAT CCGAAGCATT TTCTCTTATG
AAAGAGAAAT TTAGTGGAAC ATCGCAATCA GTTTTTACAA ATAAAACGTT TATTAAGACA
TTGCAGGAAG GACTTCCGAA GAAGATAAAC GGTAAAATAA AAAATATTAG ATTTATCGAT
TTGGAGAACC TGAATAACAA CCAGTTCTCG GTTATTGAAG AATATCGTTT CGAAAAGAAT
GGGCGGACAA CTATTGTTGA TTTCGTTTTA TGTGTTAACG GCTTACCTAT AATAATCATT
GAAAAAACAA GCGCAGTTGA ACATGGATTT ATGAGGTTAA AGCGCTACAT GGAAGATGTC
CCTATTTTTG CTATTTGTGA GTTGTTCTCA GTGATTATAG ATGGAATTAC AGTTAGATAT
GCTATGATTA GCGATTTTGC TGAGCGGAAT GAATATGAAA TTAGCTCATT TGATACATTC
GTCAAGGAAA TGCTCACACC GATATCGCTT TTAGAACTTA TAAAAGATTA CACTATTTTT
TTAGACAATA ATGGAGTGTC ACGGAAATTT ATCTTTAAGG ATTACCAGCG TAGAGCTATT
TCGAAAACAA TACATAAACT TATCACAACC ACTGAAAAGA GAGTCGGGAT AATTAATCAG
CAATTTGGAA CGGGACAAAC CCTAACGTTA TTACATTTAG CGAGAAATAT CCTCACTTCA
AGAAGTTTGA ATAGTCCAAA AGTGTTAATT GTTACGGATA GGATTTCTAA AAGTGAGCAA
TTATTACATG TATTTCACTC AACATACGGT TTAATAGCGC AAATAGCCAA GTCGAGTGAT
GATTTGTATG CTCTTCTTCA TAATGAATCT GTTCCTATAA TCTTTACTAC GGTTCAGAAA
TTCCGTGAAG AATATAAAAT TGATGAAGAG ATCATTGTAA TTGTTGATAA TTGTCATCGA
ACACAGACTG GAAGCTTTGC TTTAAATATG AGAAATGCTA TCCCTAATGC AAGAATAATC
GGCATCACTT CTTTAACTGT TACTAAAAAA ATAAAGCAGC TGTTTGGCGA GGTTATCGAG
CAGTTTACGA TGGAAGATGC GATACAAAGA GGGTTAATGT TACCAGTATA TTATGAGAAT
AGAGTTGATT TTAAAAGTAA CTTAATTGTT TTAGATACGG ATGAGGAAAC ATCTCACTTA
GAAGTTAAGA AAAAAGCTTT TGATATCATT AACCATTTCA ATTCTCACTG TCAAAGTAAT
AGTTCAAAAG GTTTGCTGAT AACAAAAAAT AGGTTAGACG CAATCCGCTA TAAAGCAATT
TTTGATGAAA TTAACCATAT AGAAGCGGCC TTATATATTT CTGTACATCA ACATGATTCT
CAAGAAATAA AAAGAATTCT AGAAAATAAA AATGAGGTTT TGCAACGTTT TTGTAACGAG
GAAGATAATT TGAAAATACT TATTACTTGT TCCGCTATTC CAGCGTCATT GCCTTTTGTA
CAAATCGCTT ATCTTGATAA GTTCATATCT GAGAACTTAA TGCAAGAAAT TATTGGTTTA
TTAGGGATGA GATATAAAGA TAAACAATAT GGCTTAATTG TTGATTATGA TGGGAGAAAT
AACGTAACAA TAATACAAAA AATAAAATGC ATAGATTCTG TCGATACTAA GGGGGGAAAA
CAATGGGAGT ATTGA
 
Protein sequence
MFVEHNLNLK SILTEQGFNY YVSKDMETNL SVEGILTEEF VNSITKINPF INKSEAFSLM 
KEKFSGTSQS VFTNKTFIKT LQEGLPKKIN GKIKNIRFID LENLNNNQFS VIEEYRFEKN
GRTTIVDFVL CVNGLPIIII EKTSAVEHGF MRLKRYMEDV PIFAICELFS VIIDGITVRY
AMISDFAERN EYEISSFDTF VKEMLTPISL LELIKDYTIF LDNNGVSRKF IFKDYQRRAI
SKTIHKLITT TEKRVGIINQ QFGTGQTLTL LHLARNILTS RSLNSPKVLI VTDRISKSEQ
LLHVFHSTYG LIAQIAKSSD DLYALLHNES VPIIFTTVQK FREEYKIDEE IIVIVDNCHR
TQTGSFALNM RNAIPNARII GITSLTVTKK IKQLFGEVIE QFTMEDAIQR GLMLPVYYEN
RVDFKSNLIV LDTDEETSHL EVKKKAFDII NHFNSHCQSN SSKGLLITKN RLDAIRYKAI
FDEINHIEAA LYISVHQHDS QEIKRILENK NEVLQRFCNE EDNLKILITC SAIPASLPFV
QIAYLDKFIS ENLMQEIIGL LGMRYKDKQY GLIVDYDGRN NVTIIQKIKC IDSVDTKGGK
QWEY