Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0005 |
Symbol | mutS |
ID | 4239512 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | - |
Start bp | 5598 |
End bp | 8168 |
Gene Length | 2571 bp |
Protein Length | 856 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638103535 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_718210 |
Protein GI | 113460154 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACACAT TCGAAAATCA CACGCCAATG ATGAAACAAT ATTTAAAAAT AAAAGCGGAA AATCCGGATG TTTTGTTGTT TTATCGCATG GGTGATTTTT ATGAGCTTTT TTATGATGAT GCAAAAAAAG CTGCTGAATT ATTAGATATT TCCCTAACTA AAAGAGGGCA ATCAGCTGGA CAACCGGTTC CAATGGCTGG CGTACCTTAT CATGCAATAG AAGGATATTT AGCAAAATTG GTACACCTTG GGGAAAGTGT AGCAATTTGT GAGCAAGTTG GTGAGCCTGT TATAGCTAAA GGTCCGGTTG AACGCCAAGT TGTACGTATT GTTACACCGG GAACAGTGAG TGATGAAGCA TTGTTACCGG AAAAGCAAGA CAATCTTATC GCTACGATTT ATCAAGAAAA AACACAATTT GGCTTAGCTG TTTTAGATAT GACTTCAGGG TGTTTTCAAA TTAGCGAACT ACAAGATGCC GCCAGTTTAC AAGCTGAGTT ACAACGTATT CAACCGGTTG AATTACTATA TTCGGAAGCG TTAGAGGATA AACATTTAAT TGAACAATTC AAAGGATTAC GTCGTCGTCC TCTTTGGGAG TTTGAATTAA GTACGGCAAT TCAATTACTT AATCGCCAAT TCGGTACAAA AGATTTGCGT GGTTTTGGTG TAGAAAAAGC GGTTTTAGGA TTATGTGCTG CGGGTTGTTT GTTGCAATAT GCAAAAGAAA CGCAACGTAC CGCACTTCCT CATATTCAAA GTATTAGCTT GTTACAAAAT AGCGATACTG TACAAATTGA TGCGTCAACA CGTAGAAATT TAGAACTAAC CCAAAATCTT GCTGGCGGAA CTGAAAATAC ACTTGCTGCA ATTTTAGATA AATGTGTTAC CCCGATGGGA AGTCGCTTAC TCAAACGTTG GATTCACCAG CCTATTCGCA ATATTGAAAA ATTGCAGTGT CGTCAACAAC ATATTCAAAT GCTGTTGCAG CAAAATTTAG TTGAAGAATT ACAACCTCTT TTACGTCAAG TTGGCGATAT GGAGCGTATT CTTGCTCGGG TTGCCCTGCG TTCCGCTCGA CCTCGAGATT TAACTCGGCT ACGAACAGCG TTGGAACAAA TACCTTTTAT TCAACACCAA TTAACAAAAA TACCGCACTT TGTTGCATTT TCACAACAAA TTGCTGATTT TTCTGTGCAA TTAGCACTTT TGCAGCGAGC GATTATTGAT AATCCCCCCT TACTTATTCG TGATGGCGGT GTCATTGCTG AAGGTTACAA CGAGGAGCTT GATGAATGGC GGAGTTTGTC TGAAGGAGCA ACACGCTACT TAAAGGATTT GGAGCAACGT GAGCGTGCAA ATACCGGTAT TGATACATTA AAAATCGGTT TTAATGCGGT GCATGGTTAT TATATTCAAA TCAGTCAAGG GCAAGCACAT AAAGCACCGC TTCATTATGT ACGTCGCCAA ACATTGAAAA ATGCAGAACG TTATATTATT CCTGAACTAA AAACCTATGA AGAAAAAGTT CTAAAAGCAA AAGGAGCGTC ACTTGCGTTA GAAAAACAAC TTTATGATGA AATTTTCGAT CAATTATTAC CGCACTTAGG TGATTTACAA CTAGCCAGTT TAACTTTGGC AGAACTTGAT GTTTTAACCA ATTTAGCGGA ACGGGCAGAA ACCTTAAATT ATGTTCAACC ACAATTTAGT ACGCAAATCG GTTTGCAAAT AATGCAGGGG CGTCATCCGG TTGTAGAGCA AGTATTAAAA GATCCCTTTA TTGCTAATCC CGTAGAACTT AATCAAAAGC GTCATTTGTT GATTATTACG GGACCGAATA TGGGGGGTAA AAGTACTTAT ATGCGACAAA CGGCACTGAT TACTTTGATG GCATATATTG GCAGTTTTGT ACCTGCAGAA AGTGCGGTGA TTGGACCTAT TGATCGAATT TTTACACGTA TTGGTGCCTC TGACGATCTT GCTTCCGGAC GTTCAACTTT TATGGTTGAA ATGACTGAAA TGGCAAATAT TTTGCATCAA GCGACGGAGC AAAGTTTGGT GCTTATTGAT GAAATTGGGC GAGGAACGTC AACCTATGAT GGACTTTCTC TTGCTTGGGC TTGTGCTGAA CAATTGGCTC AAAAAATTCG TAGTTTAACT TTATTTGCTA CTCATTACTT TGAACTGACG GTTTTACCGG AAAAAATTGA CGGTATCCAT AATGTTCATC TTGATGCCAT TGAGCATAAT GACAATATTG CATTTATGCA TTCGATACAA GAAGGCGCGG CAAGTAAAAG TTATGGTTTG GCTGTTGCTG CTTTAGCCGG TGTCCCACAA AATGTCATTA AATCGGCAAA ACAGAAATTA AAACAGCTTG AAACACTTTC TCAGCAAAAC AGTTGCCAAT CACAGTCCGT TTTGACACAA GTTCAAGGGG AATTAACTTT AATGGAAGAG GAGGAGAATA CAAGTGCGGT GATTGAAACG CTAAAAACGC TTGATCCTAA TGAGTTAAGT CCGAAGCAAG CACTTGAGTG TTTATATCAG TTAAAGAAAA TGTTGAATTA A
|
Protein sequence | MHTFENHTPM MKQYLKIKAE NPDVLLFYRM GDFYELFYDD AKKAAELLDI SLTKRGQSAG QPVPMAGVPY HAIEGYLAKL VHLGESVAIC EQVGEPVIAK GPVERQVVRI VTPGTVSDEA LLPEKQDNLI ATIYQEKTQF GLAVLDMTSG CFQISELQDA ASLQAELQRI QPVELLYSEA LEDKHLIEQF KGLRRRPLWE FELSTAIQLL NRQFGTKDLR GFGVEKAVLG LCAAGCLLQY AKETQRTALP HIQSISLLQN SDTVQIDAST RRNLELTQNL AGGTENTLAA ILDKCVTPMG SRLLKRWIHQ PIRNIEKLQC RQQHIQMLLQ QNLVEELQPL LRQVGDMERI LARVALRSAR PRDLTRLRTA LEQIPFIQHQ LTKIPHFVAF SQQIADFSVQ LALLQRAIID NPPLLIRDGG VIAEGYNEEL DEWRSLSEGA TRYLKDLEQR ERANTGIDTL KIGFNAVHGY YIQISQGQAH KAPLHYVRRQ TLKNAERYII PELKTYEEKV LKAKGASLAL EKQLYDEIFD QLLPHLGDLQ LASLTLAELD VLTNLAERAE TLNYVQPQFS TQIGLQIMQG RHPVVEQVLK DPFIANPVEL NQKRHLLIIT GPNMGGKSTY MRQTALITLM AYIGSFVPAE SAVIGPIDRI FTRIGASDDL ASGRSTFMVE MTEMANILHQ ATEQSLVLID EIGRGTSTYD GLSLAWACAE QLAQKIRSLT LFATHYFELT VLPEKIDGIH NVHLDAIEHN DNIAFMHSIQ EGAASKSYGL AVAALAGVPQ NVIKSAKQKL KQLETLSQQN SCQSQSVLTQ VQGELTLMEE EENTSAVIET LKTLDPNELS PKQALECLYQ LKKMLN
|
| |