Gene GWCH70_3443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3443 
Symbol 
ID7979514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012794 
Strand
Start bp6885 
End bp8222 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content28% 
IMG OID644800205 
Productrestriction modification system DNA specificity domain protein 
Protein accessionYP_002951344 
Protein GI239828721 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGAA AGATGAAGGA TAGTGGGGTT GAGTGGATTG GAGAAATACC TAGTGATTGG 
AAGATTTTAA GGTTAAAGAA TGTACTTAAA GAAAGAAATG AAAAAAATAG TCCAATAAAA
ACAAATGAGA TTTTGTCTTT AACGATAGAA AAAGGAGTTA TTCCATATAA AGAAAAAAAA
TCTGGTGGTA ATAAAGCTAA AGAAGATCTA TCAAATTATA AATTAGCTTA TCCAAACGAT
ATTGTATTAA ATAGTATGAA TGTAATAGTA GGGGCGGTTG GTATATCAAA ATATTATGGA
TGTGTAAGTC CTGTTTATTA TGTATTATAT TCTGATGATG TAGAACAAAA TATTAGATTT
TATAATTACT TATTCCAATC ATCTGCTTTT CAAAAGAGTT TAATTGGTTT AGGAAACGGT
ATTATGATGA AACAGTCGAG TACAGGAAAA TTAAACACAA TAAGGTTGAG AATACCTTTA
GATAGATTAA AAAATGTATA TCTTCCTGTA CCGCCAGTTT CAGTCCAACA AAAAATCGTC
AATTTCCTAG ATGAAAAAGT GTCACATATT GATACGATTA TCGAAAAAAA CAAACAGTCT
ATTGAGGAAT TAAAAAAGTA TAAACAATCG TTAATCGCGG AAACGGTCAC AAAAGGGTTA
GATCCTAATG TAGAGATGAA AGATAGTGGG ATTGAGTGGG TTGGGGAGAT ACCAAAGCAT
TGGGAAATTA GAAGATTAAG AGATATATCA ATTATTACTA GAGGAACAGT TGATAAAAGC
AAAGAGAAAA ATGAAATACC TGTTTATTTA GTACAATATA CGAATGTTTA TTATAAAAGA
GAACAAAAAA TAAATGATGA TGATTATTTA CCAATTACTG TTTCAGAAAA TGAATATAAA
AAATATAAAG TAAGAAAGGG AGATATATTA TTAACAGCAA GTTCAGAAAC AAAAGATGAT
ATAGGTCATA GTACGGTAAT AGTTGAAGAT TTACCGAATC ATGTTTTTGG ATCAGACATA
ATTAGAATCA GAATACCTAA TAAAATAGTT GATCTAAATT ATAAAAAGTA TTTTATGGAG
AATTATTATT ATTTAGCAAA ATTTGATAAA TTATCTAGGG GTATAACACG GTTTAGATTT
GGTATGGATC AATTTAAATC ATTAAAATAT GTTATTCCTC CTATTGAAGA GCAGGTCAAA
ATTGCAAAAT ATCTTGATAA TATTACTAAT CATATCAATC AATTAATTTG TAATAAGGAA
AAATTAATAA ACGAACTGGA ATCCTACAAA AAATCCCTCA TCTACGAATA TGTCACAGGC
AAAAAGGAGG TTATGTAA
 
Protein sequence
MSRKMKDSGV EWIGEIPSDW KILRLKNVLK ERNEKNSPIK TNEILSLTIE KGVIPYKEKK 
SGGNKAKEDL SNYKLAYPND IVLNSMNVIV GAVGISKYYG CVSPVYYVLY SDDVEQNIRF
YNYLFQSSAF QKSLIGLGNG IMMKQSSTGK LNTIRLRIPL DRLKNVYLPV PPVSVQQKIV
NFLDEKVSHI DTIIEKNKQS IEELKKYKQS LIAETVTKGL DPNVEMKDSG IEWVGEIPKH
WEIRRLRDIS IITRGTVDKS KEKNEIPVYL VQYTNVYYKR EQKINDDDYL PITVSENEYK
KYKVRKGDIL LTASSETKDD IGHSTVIVED LPNHVFGSDI IRIRIPNKIV DLNYKKYFME
NYYYLAKFDK LSRGITRFRF GMDQFKSLKY VIPPIEEQVK IAKYLDNITN HINQLICNKE
KLINELESYK KSLIYEYVTG KKEVM