Gene HS_0005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0005 
SymbolmutS 
ID4239512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp5598 
End bp8168 
Gene Length2571 bp 
Protein Length856 aa 
Translation table11 
GC content39% 
IMG OID638103535 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_718210 
Protein GI113460154 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACACAT TCGAAAATCA CACGCCAATG ATGAAACAAT ATTTAAAAAT AAAAGCGGAA 
AATCCGGATG TTTTGTTGTT TTATCGCATG GGTGATTTTT ATGAGCTTTT TTATGATGAT
GCAAAAAAAG CTGCTGAATT ATTAGATATT TCCCTAACTA AAAGAGGGCA ATCAGCTGGA
CAACCGGTTC CAATGGCTGG CGTACCTTAT CATGCAATAG AAGGATATTT AGCAAAATTG
GTACACCTTG GGGAAAGTGT AGCAATTTGT GAGCAAGTTG GTGAGCCTGT TATAGCTAAA
GGTCCGGTTG AACGCCAAGT TGTACGTATT GTTACACCGG GAACAGTGAG TGATGAAGCA
TTGTTACCGG AAAAGCAAGA CAATCTTATC GCTACGATTT ATCAAGAAAA AACACAATTT
GGCTTAGCTG TTTTAGATAT GACTTCAGGG TGTTTTCAAA TTAGCGAACT ACAAGATGCC
GCCAGTTTAC AAGCTGAGTT ACAACGTATT CAACCGGTTG AATTACTATA TTCGGAAGCG
TTAGAGGATA AACATTTAAT TGAACAATTC AAAGGATTAC GTCGTCGTCC TCTTTGGGAG
TTTGAATTAA GTACGGCAAT TCAATTACTT AATCGCCAAT TCGGTACAAA AGATTTGCGT
GGTTTTGGTG TAGAAAAAGC GGTTTTAGGA TTATGTGCTG CGGGTTGTTT GTTGCAATAT
GCAAAAGAAA CGCAACGTAC CGCACTTCCT CATATTCAAA GTATTAGCTT GTTACAAAAT
AGCGATACTG TACAAATTGA TGCGTCAACA CGTAGAAATT TAGAACTAAC CCAAAATCTT
GCTGGCGGAA CTGAAAATAC ACTTGCTGCA ATTTTAGATA AATGTGTTAC CCCGATGGGA
AGTCGCTTAC TCAAACGTTG GATTCACCAG CCTATTCGCA ATATTGAAAA ATTGCAGTGT
CGTCAACAAC ATATTCAAAT GCTGTTGCAG CAAAATTTAG TTGAAGAATT ACAACCTCTT
TTACGTCAAG TTGGCGATAT GGAGCGTATT CTTGCTCGGG TTGCCCTGCG TTCCGCTCGA
CCTCGAGATT TAACTCGGCT ACGAACAGCG TTGGAACAAA TACCTTTTAT TCAACACCAA
TTAACAAAAA TACCGCACTT TGTTGCATTT TCACAACAAA TTGCTGATTT TTCTGTGCAA
TTAGCACTTT TGCAGCGAGC GATTATTGAT AATCCCCCCT TACTTATTCG TGATGGCGGT
GTCATTGCTG AAGGTTACAA CGAGGAGCTT GATGAATGGC GGAGTTTGTC TGAAGGAGCA
ACACGCTACT TAAAGGATTT GGAGCAACGT GAGCGTGCAA ATACCGGTAT TGATACATTA
AAAATCGGTT TTAATGCGGT GCATGGTTAT TATATTCAAA TCAGTCAAGG GCAAGCACAT
AAAGCACCGC TTCATTATGT ACGTCGCCAA ACATTGAAAA ATGCAGAACG TTATATTATT
CCTGAACTAA AAACCTATGA AGAAAAAGTT CTAAAAGCAA AAGGAGCGTC ACTTGCGTTA
GAAAAACAAC TTTATGATGA AATTTTCGAT CAATTATTAC CGCACTTAGG TGATTTACAA
CTAGCCAGTT TAACTTTGGC AGAACTTGAT GTTTTAACCA ATTTAGCGGA ACGGGCAGAA
ACCTTAAATT ATGTTCAACC ACAATTTAGT ACGCAAATCG GTTTGCAAAT AATGCAGGGG
CGTCATCCGG TTGTAGAGCA AGTATTAAAA GATCCCTTTA TTGCTAATCC CGTAGAACTT
AATCAAAAGC GTCATTTGTT GATTATTACG GGACCGAATA TGGGGGGTAA AAGTACTTAT
ATGCGACAAA CGGCACTGAT TACTTTGATG GCATATATTG GCAGTTTTGT ACCTGCAGAA
AGTGCGGTGA TTGGACCTAT TGATCGAATT TTTACACGTA TTGGTGCCTC TGACGATCTT
GCTTCCGGAC GTTCAACTTT TATGGTTGAA ATGACTGAAA TGGCAAATAT TTTGCATCAA
GCGACGGAGC AAAGTTTGGT GCTTATTGAT GAAATTGGGC GAGGAACGTC AACCTATGAT
GGACTTTCTC TTGCTTGGGC TTGTGCTGAA CAATTGGCTC AAAAAATTCG TAGTTTAACT
TTATTTGCTA CTCATTACTT TGAACTGACG GTTTTACCGG AAAAAATTGA CGGTATCCAT
AATGTTCATC TTGATGCCAT TGAGCATAAT GACAATATTG CATTTATGCA TTCGATACAA
GAAGGCGCGG CAAGTAAAAG TTATGGTTTG GCTGTTGCTG CTTTAGCCGG TGTCCCACAA
AATGTCATTA AATCGGCAAA ACAGAAATTA AAACAGCTTG AAACACTTTC TCAGCAAAAC
AGTTGCCAAT CACAGTCCGT TTTGACACAA GTTCAAGGGG AATTAACTTT AATGGAAGAG
GAGGAGAATA CAAGTGCGGT GATTGAAACG CTAAAAACGC TTGATCCTAA TGAGTTAAGT
CCGAAGCAAG CACTTGAGTG TTTATATCAG TTAAAGAAAA TGTTGAATTA A
 
Protein sequence
MHTFENHTPM MKQYLKIKAE NPDVLLFYRM GDFYELFYDD AKKAAELLDI SLTKRGQSAG 
QPVPMAGVPY HAIEGYLAKL VHLGESVAIC EQVGEPVIAK GPVERQVVRI VTPGTVSDEA
LLPEKQDNLI ATIYQEKTQF GLAVLDMTSG CFQISELQDA ASLQAELQRI QPVELLYSEA
LEDKHLIEQF KGLRRRPLWE FELSTAIQLL NRQFGTKDLR GFGVEKAVLG LCAAGCLLQY
AKETQRTALP HIQSISLLQN SDTVQIDAST RRNLELTQNL AGGTENTLAA ILDKCVTPMG
SRLLKRWIHQ PIRNIEKLQC RQQHIQMLLQ QNLVEELQPL LRQVGDMERI LARVALRSAR
PRDLTRLRTA LEQIPFIQHQ LTKIPHFVAF SQQIADFSVQ LALLQRAIID NPPLLIRDGG
VIAEGYNEEL DEWRSLSEGA TRYLKDLEQR ERANTGIDTL KIGFNAVHGY YIQISQGQAH
KAPLHYVRRQ TLKNAERYII PELKTYEEKV LKAKGASLAL EKQLYDEIFD QLLPHLGDLQ
LASLTLAELD VLTNLAERAE TLNYVQPQFS TQIGLQIMQG RHPVVEQVLK DPFIANPVEL
NQKRHLLIIT GPNMGGKSTY MRQTALITLM AYIGSFVPAE SAVIGPIDRI FTRIGASDDL
ASGRSTFMVE MTEMANILHQ ATEQSLVLID EIGRGTSTYD GLSLAWACAE QLAQKIRSLT
LFATHYFELT VLPEKIDGIH NVHLDAIEHN DNIAFMHSIQ EGAASKSYGL AVAALAGVPQ
NVIKSAKQKL KQLETLSQQN SCQSQSVLTQ VQGELTLMEE EENTSAVIET LKTLDPNELS
PKQALECLYQ LKKMLN