Gene GWCH70_3440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3440 
Symbol 
ID7979511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012794 
Strand
Start bp2883 
End bp4772 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content32% 
IMG OID644800202 
ProductN-6 DNA methylase 
Protein accessionYP_002951341 
Protein GI239828718 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.000836739 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTAATAG AAAAAATGAT GTTTGAACAT ATTGATTTAT TAAGAGGAGA AACAGAAACT 
GTAAATTACA AATTAATTTT ATTACCTGTT TATAGTTTGA AGTTTTTGGA AGAGAAAAAT
CTTATACCTG ATGAAATGAG AATAAAAGAA ATTTTAAATC ACGAAAGCAA CATCGCTGAT
CAATTACAAA GATCGTTTCA ATATGTCGAA GATAATTTTC ATGCATTAAA AGGTGTTTAT
ACAATTTTTC CTGAAAATGT AGTAAGTAAT CGTACTTTGT TCCAGTTGTT GCTCAAAATA
AATGCAATGA CTTTATCTGT TAAAGAATGG GCGGAGTTGG CAGAGGAATT ATTATATCAT
TCGTATGAAT GGGAAGGTGT TCGAGGTGGA GAAAATTATT CTCCAAAGAG TATTAATCAA
TTAGGTATTG AATTGCTTAA TCCAATTTCA GGTACTTTCT ATGATGGAAC TGCTGGTTTT
GGAGGAACAT TGGTTTCAGC CCTTGAATAC TCAAAACAAA ATAACGGGGA ACTCAAATTA
TATGGTCAAG AAATTGATCA TACCAGTTGG GCATTGGCAA AATTAAACTT ATTGTTGCAT
GACAAATTAG ATGCAGAACT AATACAAGGC GACGCATTGT TGAATCCGGC TTTTATTGAT
GGAGATCGCT TAAAGAAATT CAATTTTATT ATGATGGATT TTCCATGGGT AGAGTTGAGA
AATCATTATG AAACGTTAAA GCAAGATAAA TATAATCGTT TTATTTATGG GATACCACCG
AGAAGAAGTG CTGATTTTGC ATTCATTATG CACACATTAG CTTCGTTGGA AAGTGACGGT
AAGGCAGTAT TAGTAGTCCC GGGTAGAACA TTATTTGCAA GCGGAATGGA ACAGTCTATT
CGTCAAAATC TTATTGCGGC TGATGTGATT GAAGCAGTAA TAGCTTTGCC TGCCGGATTA
TATAAGCACA CGGGAATTCA AACTAATCTC CTAATCTTAA ATAAGAATAA GTCTTTAGAC
CGAAAAGGAA GAATCTTATT TATTAATGCG GAAAATGAAT TTCAAACCAA ACAAAGATAT
TTGAAGGTTT TAACTAAAGA CAATATAGAT AAAATCATTA GCACCTACCG AAATGGATTA
GAAATAGAAC AGTTTAGTAA ATTTGTTTCT TCAAATGAAA TTGAAGAGGC CAATCTATTT
TACAAAAAAT ATTTAACGGA AAAAGTAATT GATACTGATT TTTTTGGAAA AGTGCAGTTC
GTTAAAGAAA GTAAAGAATA TAGCTTGAAT ATTTATCCTT TAAAAAAATT GACTGAGAAG
ATATTCAGAG GAATGAATGT TTCTTCAAAT TCTATAGAAG AAGGAACGGG AGAATTTAAA
CTTGTTAAAT TATCAGACGT ACAAGACGGA GAAATATTAC TTGACGATTT AAGCAGCATT
AGATTTAAAA GGAACAGCAG AATTGATATG TATCTCTTAC GTAAAGGGGA TGTGATTGTT
TCTAACCGTG GTACAACGAT CAAAGTTGCT GTTGTTCCTG AAAACGAAGG AAATTTAATA
TTATCTCATA ATTTCTTGGG TATTCGATGT AAAGATGACA TCGATCCATA TTATTTAAAA
GCGTATCTGG AAAGTCCTGT TGGGATGTAT TACTTAATAA ACAGTCAAGT TGGAACGAAC
ATATTAACAA TAAACCCAAA AGACTTAAAA GAAATTCCTG TAAAACTTAC ATCCTTGGAT
GAGCAGAGAA AAATTGCAAA CGAAATAAGG GAAGCTGTAA TCACTTATAA AGAAAAAATA
CGTCAAGCAG AACAAGAAAG AAATGCTAGT TTACTAAAAG CCTATGAAAA AATGGGGATC
AGCAGTTTGT TTAAAATAAT AGAACAATAA
 
Protein sequence
MVIEKMMFEH IDLLRGETET VNYKLILLPV YSLKFLEEKN LIPDEMRIKE ILNHESNIAD 
QLQRSFQYVE DNFHALKGVY TIFPENVVSN RTLFQLLLKI NAMTLSVKEW AELAEELLYH
SYEWEGVRGG ENYSPKSINQ LGIELLNPIS GTFYDGTAGF GGTLVSALEY SKQNNGELKL
YGQEIDHTSW ALAKLNLLLH DKLDAELIQG DALLNPAFID GDRLKKFNFI MMDFPWVELR
NHYETLKQDK YNRFIYGIPP RRSADFAFIM HTLASLESDG KAVLVVPGRT LFASGMEQSI
RQNLIAADVI EAVIALPAGL YKHTGIQTNL LILNKNKSLD RKGRILFINA ENEFQTKQRY
LKVLTKDNID KIISTYRNGL EIEQFSKFVS SNEIEEANLF YKKYLTEKVI DTDFFGKVQF
VKESKEYSLN IYPLKKLTEK IFRGMNVSSN SIEEGTGEFK LVKLSDVQDG EILLDDLSSI
RFKRNSRIDM YLLRKGDVIV SNRGTTIKVA VVPENEGNLI LSHNFLGIRC KDDIDPYYLK
AYLESPVGMY YLINSQVGTN ILTINPKDLK EIPVKLTSLD EQRKIANEIR EAVITYKEKI
RQAEQERNAS LLKAYEKMGI SSLFKIIEQ