Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1202 |
Symbol | |
ID | 7977674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1253949 |
End bp | 1256531 |
Gene Length | 2583 bp |
Protein Length | 860 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644798154 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_002949327 |
Protein GI | 239826703 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000170445 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTACAT ATACGCCGAT GATTCAGCAA TACTTGGACA TTAAGGCACA ATATCCAGAT GCCTTTTTAT TTTTTCGCCT TGGCGATTTT TACGAAATGT TTTTTGACGA CGCGATCAAA GCGGCGCAGG AACTGGAAAT TACGCTGACA AGCCGTGATG GCGGTGGCGA AGAACGGGTG CCGATGTGCG GCGTCCCGTA TCATTCGGCG CAAGGATATA TTGAACAGTT GATTTCAAAA GGATATAAAG TCGCGATTTG CGAACAAGTC GAAGATCCAA AAACGGCAAA AGGGGTCGTC CGCCGCGAAG TCGTTCAGCT CATTACCCCT GGAACAGTGA TGGAAGGGAA AGGGCTGTTA GATAAAGAAA ACAACTACTT GGCGACGGTG ACCATGTTTG ATGATGGCAC GTATGGTTTT GCTTATACGG ATTTATCAAC AGGGGAAAAT CGCATAACTC TTCTTGCTTC GTTGGATGAT GTGATGAATG AGCTGTATGC GATCGGTACG AAAGAAATTG TCATTTCTTC TCAATTTCCA GAACAGTATC AGCAGCTATT AAAGGAACGC TATGATGTGA CGATTTCATA CGAGGATGAG ACAGTGATTC CTGAAGGATT TACGTCGATC GTCGAAGCGC TTCAGCAAGA TAAGCTAAAA ATAACATTCG GCCGTCTGCT TCATTACATT ATTCGCACAC AAAAACGGCG CCTCGATCAT ATGCAGTCTG TTCAAGTGTA TCAAGTCGAT CATTATATGA AAATCGATTT GTACTCGAAG CGAAATTTAG AATTAACCGA GACAATTCGC TCCAAAGGGC GGAAAGGTTC GCTGTTGTGG CTTCTTGATG AAACAGTGAC GGCAATGGGC GGACGGTTGC TGAAACAATG GCTTGATCGC CCGCTTTTGG ATCGCAAACA AATCGAACGG CGCTTGCACA TGGTCGAAAC ACTGATCCAT CATTATTTTG AACGGCAGGA GCTGCGCGAA CGTCTTCGCG AAGTGTACGA CGTCGAGCGC CTCGCTGGAC GTGTTGCCTA CGGAAACGTA AACGCACGCG ATTTAATTCA ACTGAAAAAA TCGCTTCAGC AAATCCCGGC GTTAAAAGAT ATTGTTGAAA AACTTCCGGA TCATGAAGCG AAGCAGCTTG CCAATAAACT TGATCCATGT TCGGAACTTG TCGATCTATT AGAGCGGTCG ATTCAAGAAA ATCCGCCATT GTCCGTCAAA GAAGGAAACA TCATTAAAGA CGGATATAAC GAAACGCTTG ATCGTTATCG TGATGCAAGC CGCAATGGGA AAGCATGGAT TGCCCAGCTA GAAAGCAAAG AACGGGAATT AACTGGGATT AAATCGCTAA AAATCGGCTA TAACCGCGTG TTCGGTTATT ACATTGAAGT GACGAAGCCA AATCTTCATT TGTTGCCAAA GGGACGTTAT GAGCGAAAAC AAACACTAGC AAACGCTGAA CGTTTTATTA CCCAGGAATT AAAAGAAAAA GAAGCGCTCA TTTTAGAAGC GGAAGAAAAA AGCATCGAAC TAGAATACGA ATTGTTTGTG GACATTCGCG AACGCGTAAA ACAATATATT CCGCGTTTGC AATCATTGGC GAAAACGATT AGCGAACTCG ATGTCTTGCA GTCGTTTGCA ACCGTAAGCG AAGAGCGTCA TTACGTAAAA CCGCAGTTTT CCGATAATCG TGAGCTGATC ATTCAAGCGG GCCGCCATCC AGTAGTGGAA AAAGTGCTTG GGGCGCAAAC GTATGTACCG AACGATTGTT ATATGAATAA AGAGCGGGAA CTGTTGTTAA TTACGGGACC GAATATGTCC GGAAAAAGCA CGTACATGCG GCAAATTGCC CTTACTGTCA TTATGGCGCA AATTGGCTGC TTTGTACCGG CAGAGAAAGC AGTCCTCCCA ATTTTTGACC AAGTGTTTAC GAGAATTGGT GCGGCGGATG ATTTAGTATC TGGGCAAAGT ACGTTTATGG TCGAAATGCT CGAAGCGCGC AATGCGATCG TTCACGCGAC ACAAAACAGC TTAATTTTGT TTGATGAAAT CGGACGCGGC ACGTCTACGT ATGATGGGAT GGCATTGGCG CAAGCGATCA TCGAATACAT TCATGATCAT ATTGGCGCGA AAACGTTATT TAGCACACAT TATCATGAAT TAACGGATCT GGAGCAATCG CTTGCCAAGC TGAAAAACGT TCATGTGAGA GCCGTTGAGG AAAATGGAAA AGTCGTGTTT CTTCATAAAA TTGAAGAAGG ACCAGCCGAC CAAAGTTACG GCATTCATGT CGCCGAGCTT GCTGAGCTTC CGGCTTCTCT CATTCAGCGC GCCAAAGAAA TTTTAGCCGA GCTTGAGCAG CAAGAACAGC GAAAAGAACA GCCAAGCGGC AAGAACGAGG CGGTCTTCGA ACAGCTCAGC ATGTTTGCCG AAGAGCAGCC TTCAAAAGAA GAATCCCATC TATCGAAAAA AGAGAAAAAG GCGCTTGAGG CATTAAAATC GGTCAATTTA TTGGAAACAA CGCCGCTTGA AGCGTTAAAC AAATTATACG AAATTCAAAA ACTATTAAAG TAA
|
Protein sequence | MATYTPMIQQ YLDIKAQYPD AFLFFRLGDF YEMFFDDAIK AAQELEITLT SRDGGGEERV PMCGVPYHSA QGYIEQLISK GYKVAICEQV EDPKTAKGVV RREVVQLITP GTVMEGKGLL DKENNYLATV TMFDDGTYGF AYTDLSTGEN RITLLASLDD VMNELYAIGT KEIVISSQFP EQYQQLLKER YDVTISYEDE TVIPEGFTSI VEALQQDKLK ITFGRLLHYI IRTQKRRLDH MQSVQVYQVD HYMKIDLYSK RNLELTETIR SKGRKGSLLW LLDETVTAMG GRLLKQWLDR PLLDRKQIER RLHMVETLIH HYFERQELRE RLREVYDVER LAGRVAYGNV NARDLIQLKK SLQQIPALKD IVEKLPDHEA KQLANKLDPC SELVDLLERS IQENPPLSVK EGNIIKDGYN ETLDRYRDAS RNGKAWIAQL ESKERELTGI KSLKIGYNRV FGYYIEVTKP NLHLLPKGRY ERKQTLANAE RFITQELKEK EALILEAEEK SIELEYELFV DIRERVKQYI PRLQSLAKTI SELDVLQSFA TVSEERHYVK PQFSDNRELI IQAGRHPVVE KVLGAQTYVP NDCYMNKERE LLLITGPNMS GKSTYMRQIA LTVIMAQIGC FVPAEKAVLP IFDQVFTRIG AADDLVSGQS TFMVEMLEAR NAIVHATQNS LILFDEIGRG TSTYDGMALA QAIIEYIHDH IGAKTLFSTH YHELTDLEQS LAKLKNVHVR AVEENGKVVF LHKIEEGPAD QSYGIHVAEL AELPASLIQR AKEILAELEQ QEQRKEQPSG KNEAVFEQLS MFAEEQPSKE ESHLSKKEKK ALEALKSVNL LETTPLEALN KLYEIQKLLK
|
| |