Gene GWCH70_1202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1202 
Symbol 
ID7977674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1253949 
End bp1256531 
Gene Length2583 bp 
Protein Length860 aa 
Translation table11 
GC content45% 
IMG OID644798154 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_002949327 
Protein GI239826703 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000170445 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTACAT ATACGCCGAT GATTCAGCAA TACTTGGACA TTAAGGCACA ATATCCAGAT 
GCCTTTTTAT TTTTTCGCCT TGGCGATTTT TACGAAATGT TTTTTGACGA CGCGATCAAA
GCGGCGCAGG AACTGGAAAT TACGCTGACA AGCCGTGATG GCGGTGGCGA AGAACGGGTG
CCGATGTGCG GCGTCCCGTA TCATTCGGCG CAAGGATATA TTGAACAGTT GATTTCAAAA
GGATATAAAG TCGCGATTTG CGAACAAGTC GAAGATCCAA AAACGGCAAA AGGGGTCGTC
CGCCGCGAAG TCGTTCAGCT CATTACCCCT GGAACAGTGA TGGAAGGGAA AGGGCTGTTA
GATAAAGAAA ACAACTACTT GGCGACGGTG ACCATGTTTG ATGATGGCAC GTATGGTTTT
GCTTATACGG ATTTATCAAC AGGGGAAAAT CGCATAACTC TTCTTGCTTC GTTGGATGAT
GTGATGAATG AGCTGTATGC GATCGGTACG AAAGAAATTG TCATTTCTTC TCAATTTCCA
GAACAGTATC AGCAGCTATT AAAGGAACGC TATGATGTGA CGATTTCATA CGAGGATGAG
ACAGTGATTC CTGAAGGATT TACGTCGATC GTCGAAGCGC TTCAGCAAGA TAAGCTAAAA
ATAACATTCG GCCGTCTGCT TCATTACATT ATTCGCACAC AAAAACGGCG CCTCGATCAT
ATGCAGTCTG TTCAAGTGTA TCAAGTCGAT CATTATATGA AAATCGATTT GTACTCGAAG
CGAAATTTAG AATTAACCGA GACAATTCGC TCCAAAGGGC GGAAAGGTTC GCTGTTGTGG
CTTCTTGATG AAACAGTGAC GGCAATGGGC GGACGGTTGC TGAAACAATG GCTTGATCGC
CCGCTTTTGG ATCGCAAACA AATCGAACGG CGCTTGCACA TGGTCGAAAC ACTGATCCAT
CATTATTTTG AACGGCAGGA GCTGCGCGAA CGTCTTCGCG AAGTGTACGA CGTCGAGCGC
CTCGCTGGAC GTGTTGCCTA CGGAAACGTA AACGCACGCG ATTTAATTCA ACTGAAAAAA
TCGCTTCAGC AAATCCCGGC GTTAAAAGAT ATTGTTGAAA AACTTCCGGA TCATGAAGCG
AAGCAGCTTG CCAATAAACT TGATCCATGT TCGGAACTTG TCGATCTATT AGAGCGGTCG
ATTCAAGAAA ATCCGCCATT GTCCGTCAAA GAAGGAAACA TCATTAAAGA CGGATATAAC
GAAACGCTTG ATCGTTATCG TGATGCAAGC CGCAATGGGA AAGCATGGAT TGCCCAGCTA
GAAAGCAAAG AACGGGAATT AACTGGGATT AAATCGCTAA AAATCGGCTA TAACCGCGTG
TTCGGTTATT ACATTGAAGT GACGAAGCCA AATCTTCATT TGTTGCCAAA GGGACGTTAT
GAGCGAAAAC AAACACTAGC AAACGCTGAA CGTTTTATTA CCCAGGAATT AAAAGAAAAA
GAAGCGCTCA TTTTAGAAGC GGAAGAAAAA AGCATCGAAC TAGAATACGA ATTGTTTGTG
GACATTCGCG AACGCGTAAA ACAATATATT CCGCGTTTGC AATCATTGGC GAAAACGATT
AGCGAACTCG ATGTCTTGCA GTCGTTTGCA ACCGTAAGCG AAGAGCGTCA TTACGTAAAA
CCGCAGTTTT CCGATAATCG TGAGCTGATC ATTCAAGCGG GCCGCCATCC AGTAGTGGAA
AAAGTGCTTG GGGCGCAAAC GTATGTACCG AACGATTGTT ATATGAATAA AGAGCGGGAA
CTGTTGTTAA TTACGGGACC GAATATGTCC GGAAAAAGCA CGTACATGCG GCAAATTGCC
CTTACTGTCA TTATGGCGCA AATTGGCTGC TTTGTACCGG CAGAGAAAGC AGTCCTCCCA
ATTTTTGACC AAGTGTTTAC GAGAATTGGT GCGGCGGATG ATTTAGTATC TGGGCAAAGT
ACGTTTATGG TCGAAATGCT CGAAGCGCGC AATGCGATCG TTCACGCGAC ACAAAACAGC
TTAATTTTGT TTGATGAAAT CGGACGCGGC ACGTCTACGT ATGATGGGAT GGCATTGGCG
CAAGCGATCA TCGAATACAT TCATGATCAT ATTGGCGCGA AAACGTTATT TAGCACACAT
TATCATGAAT TAACGGATCT GGAGCAATCG CTTGCCAAGC TGAAAAACGT TCATGTGAGA
GCCGTTGAGG AAAATGGAAA AGTCGTGTTT CTTCATAAAA TTGAAGAAGG ACCAGCCGAC
CAAAGTTACG GCATTCATGT CGCCGAGCTT GCTGAGCTTC CGGCTTCTCT CATTCAGCGC
GCCAAAGAAA TTTTAGCCGA GCTTGAGCAG CAAGAACAGC GAAAAGAACA GCCAAGCGGC
AAGAACGAGG CGGTCTTCGA ACAGCTCAGC ATGTTTGCCG AAGAGCAGCC TTCAAAAGAA
GAATCCCATC TATCGAAAAA AGAGAAAAAG GCGCTTGAGG CATTAAAATC GGTCAATTTA
TTGGAAACAA CGCCGCTTGA AGCGTTAAAC AAATTATACG AAATTCAAAA ACTATTAAAG
TAA
 
Protein sequence
MATYTPMIQQ YLDIKAQYPD AFLFFRLGDF YEMFFDDAIK AAQELEITLT SRDGGGEERV 
PMCGVPYHSA QGYIEQLISK GYKVAICEQV EDPKTAKGVV RREVVQLITP GTVMEGKGLL
DKENNYLATV TMFDDGTYGF AYTDLSTGEN RITLLASLDD VMNELYAIGT KEIVISSQFP
EQYQQLLKER YDVTISYEDE TVIPEGFTSI VEALQQDKLK ITFGRLLHYI IRTQKRRLDH
MQSVQVYQVD HYMKIDLYSK RNLELTETIR SKGRKGSLLW LLDETVTAMG GRLLKQWLDR
PLLDRKQIER RLHMVETLIH HYFERQELRE RLREVYDVER LAGRVAYGNV NARDLIQLKK
SLQQIPALKD IVEKLPDHEA KQLANKLDPC SELVDLLERS IQENPPLSVK EGNIIKDGYN
ETLDRYRDAS RNGKAWIAQL ESKERELTGI KSLKIGYNRV FGYYIEVTKP NLHLLPKGRY
ERKQTLANAE RFITQELKEK EALILEAEEK SIELEYELFV DIRERVKQYI PRLQSLAKTI
SELDVLQSFA TVSEERHYVK PQFSDNRELI IQAGRHPVVE KVLGAQTYVP NDCYMNKERE
LLLITGPNMS GKSTYMRQIA LTVIMAQIGC FVPAEKAVLP IFDQVFTRIG AADDLVSGQS
TFMVEMLEAR NAIVHATQNS LILFDEIGRG TSTYDGMALA QAIIEYIHDH IGAKTLFSTH
YHELTDLEQS LAKLKNVHVR AVEENGKVVF LHKIEEGPAD QSYGIHVAEL AELPASLIQR
AKEILAELEQ QEQRKEQPSG KNEAVFEQLS MFAEEQPSKE ESHLSKKEKK ALEALKSVNL
LETTPLEALN KLYEIQKLLK