Gene GWCH70_1386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1386 
Symbol 
ID7978183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1455870 
End bp1457105 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content35% 
IMG OID644798309 
ProductMcrBC 5-methylcytosine restriction system component-like protein 
Protein accessionYP_002949482 
Protein GI239826858 
COG category[V] Defense mechanisms 
COG ID[COG4268] McrBC 5-methylcytosine restriction system component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCAGAAT ATGGTTCACT TAACATAAAG GATTATGTTT TAACAGAAAA AGAAAAAGAA 
TATTTGAATA ACATGATGAG GTTATCTGCT ACTCCTCCAT TCAAATGGTA CGAAATGAAA
AATGGACTTT ATTTTGAGTT TTCAAGTTGG ATAGGAGTTC TTGAGCTTGA AAGGGTGCGA
ATAGTTATTA AACCTAAATT TAATCGCGGA TTTCATGATG TTGTCGATAT GCTGTTGTTT
TGTGAGGATT TGCCTCTATC TTATGAGCAC GCAACGATGG CAGCGTATGA CTCTCATTCA
CTTATGGAGA TGATTGCACG GTTATTTGTA AAAGAAGTAG AAATGATTCT GAATAAAGGA
ATTGTGAAGG AATATATAGT CGAAGAAGAC AATCTTACTT GCCTTCGTGG GCGGGTAGAT
ATTCGGCAGC ATTTAAGAAC CAACTTTATG ACACCAACCA AAGTATATTG CCGATATGAT
GAGCTGGATA CGAATATATT AGAGAATCAA GTTATTCGAA TGGCCCTTGA GGTTGTGAAA
CATTTTTCCT TAACAAAACA AACAATGAGG CAGATTAACC GTCTTGCCGA TGAATTTATG
ATGATAGCCG ATCCATATTT TAGTTATGAG TGGCCGAACT TCTCTTATCA CCGATTAAAT
CAGCACTATG AAAAGGCTCA TAAGTTAGCT TATTATATTT GGAAACAAAT ATACGTAAAC
CAACTTTATC AATTTCAATA TCGAAGTCAT TATTCTTATT TGATAGATAT GAATGAGTTG
TTTGAAAAAT TTGTAGCTAA ATTATTAAAA AAATATTTAC CTGGCGCAGC AAAAGTACAT
GCGCAGCGGA GATTTAAGAA AGCCATTACG AAAAATGGAG ATGGTTATCA CGATATCATT
CTTGATTTGT TAGTAGAGTT TCCTGACAAA GATCCTATTG TCCTTGATAC AAAATACAAA
CAGTATAGCA AATATAAAGT GGAGAACGCA GATATTTATC AACTTGCCTT TTATGCACAG
TTTGTCACAA AGTCAAGTAA TCATTATAAG GCGATCATCG TTCATCCAGA ATATGCTGGG
GAAGACGCTT GCGAGGAAGT CATCGATCTA TTGCCTGGAA CCTTCCATCA AGGAAAATTA
TTTGTAAAAC CTGTATCTAT TGAAAAAGTG TTGGCTGCGG TTAAAAGAAA AGATATTGAG
TTTTTACAAA AACAAGCTGA AAAGTTAATA CTGTAA
 
Protein sequence
MAEYGSLNIK DYVLTEKEKE YLNNMMRLSA TPPFKWYEMK NGLYFEFSSW IGVLELERVR 
IVIKPKFNRG FHDVVDMLLF CEDLPLSYEH ATMAAYDSHS LMEMIARLFV KEVEMILNKG
IVKEYIVEED NLTCLRGRVD IRQHLRTNFM TPTKVYCRYD ELDTNILENQ VIRMALEVVK
HFSLTKQTMR QINRLADEFM MIADPYFSYE WPNFSYHRLN QHYEKAHKLA YYIWKQIYVN
QLYQFQYRSH YSYLIDMNEL FEKFVAKLLK KYLPGAAKVH AQRRFKKAIT KNGDGYHDII
LDLLVEFPDK DPIVLDTKYK QYSKYKVENA DIYQLAFYAQ FVTKSSNHYK AIIVHPEYAG
EDACEEVIDL LPGTFHQGKL FVKPVSIEKV LAAVKRKDIE FLQKQAEKLI L