Gene GWCH70_1643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1643 
Symbol 
ID7976360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1721130 
End bp1722416 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content34% 
IMG OID644798524 
Productrestriction modification system DNA specificity domain protein 
Protein accessionYP_002949696 
Protein GI239827072 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGTTACA GTGAATGGAA AACGTATTCT CTTAAAGACA TCTGCACAGA TATTTCTTAC 
GGATATACAG CTAGTGCAAA AGAGGAAAAA GTAGGGCCGA AATTTCTAAG AATTACAGAT
TTAAGAAATG AGTTTATTGA TTGGGAATCG GTCCCTTATT GTTCCATAAA TGAAAAGGAC
TATAAAAAAT ACAAGTTGGA AATTGGAGAT TTATGTATAG CCCGTACCGG AGCAACGACC
GGAATAAACA CAGTAATTGA GGAGGATGTA GATGCTGTTT TTGCTTCTTA TTTGGTTAGA
TTTAAGTTGA ACAAAGAAAT AGTTGATCCT ACATTTATAA AGTATATCTT TAAATCTAAT
ATGTGGTATG GATATGTTAA TTCTATTATA AGCGGTTCAG CTCAACCCGG AGCAAACGCT
CAGCAAATGA GTAACTTTAA AATGAGTATT CCTGATCTGG ATGAACAAAA AAAGATAGCT
TCTGTTCTAT CTGTATTAGA TAAAAAAATC GTACTAAATA ATAAAATAAA CAAAACCCTT
GAAGAAATGG CTCAAGCAAT TTTCAAACGT TGGTTTGTTG ATTTCGAGTT TCCAAATGAA
AACGGTAAAC CTTATAAATC AAGCGGTGGG AAGTTTGTAG AAAGTGAGTC AGGGATGATA
CCAGAGGGGT GGAAAGAAGG GACTCTAGAT AACTTAGTTG TCATAAATAC TGCATCTGTC
GATCCCAAAG AAAATCCTGA GATTTTATAT GAACATTATA GTATACCTGC CTTTGACGAA
CAGAAATATC CCAAATTTGA ATATGGCAGA GAAATTAAGA GCAATAAATA TCTTGTCAGA
CCTAACTCTT TTCTTGTATC AAAGCTTAAT CCAACAACTA AAAGAGTATG GGATCCGCTA
TGTATTACTG AAAATGCAAT TTCTTCTACG GAATTTATTA ATTATCTACC AAAAGATATT
TCTTATCAAT CATATTTATA TTGTATGTTA AATTCAGAAA GATTCTCAGA ACATTTAATT
AAACATGCAA CAGGATCGAC AGGAAGCAGA CAGAGAGTAA AACCTGCAGA AACATTAACC
TTCAATGTTA TTTTACCGGA TACAGAAACA CTAAAAAAAT TTGATAACCT TATAAGACCG
ATAAGAGAGA AATTAAAAAT AAACCAAATT AATAGCGCGG TTTTAAAGGA TGTTAGAGAT
ATCCTTCTCC CAAAACTGAT GTCCGGTGAA ATTCGCGTTC CTGATGCCGA GCGGGAGGTG
GAGGAATGTT TACAGAAGAG CAATTAG
 
Protein sequence
MSYSEWKTYS LKDICTDISY GYTASAKEEK VGPKFLRITD LRNEFIDWES VPYCSINEKD 
YKKYKLEIGD LCIARTGATT GINTVIEEDV DAVFASYLVR FKLNKEIVDP TFIKYIFKSN
MWYGYVNSII SGSAQPGANA QQMSNFKMSI PDLDEQKKIA SVLSVLDKKI VLNNKINKTL
EEMAQAIFKR WFVDFEFPNE NGKPYKSSGG KFVESESGMI PEGWKEGTLD NLVVINTASV
DPKENPEILY EHYSIPAFDE QKYPKFEYGR EIKSNKYLVR PNSFLVSKLN PTTKRVWDPL
CITENAISST EFINYLPKDI SYQSYLYCML NSERFSEHLI KHATGSTGSR QRVKPAETLT
FNVILPDTET LKKFDNLIRP IREKLKINQI NSAVLKDVRD ILLPKLMSGE IRVPDAEREV
EECLQKSN