Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2979 |
Symbol | mutH |
ID | 6145783 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3058294 |
End bp | 3058983 |
Gene Length | 690 bp |
Protein Length | 229 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641617848 |
Product | DNA mismatch repair protein |
Protein accession | YP_001745000 |
Protein GI | 170680941 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3066] DNA mismatch repair protein |
TIGRFAM ID | [TIGR02248] DNA mismatch repair endonuclease MutH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000162046 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.00389286 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGTCCCAAC CTCGCCCACT GCTCTCTCCT CCCGAAACTG AAGAACAATT GTTAGCGCAA GCACAACAAC TTTCTGGTTA TACATTGGGA GAACTGGCGG CACTTGCTGG GCTGGTGACG CCGGAGAATT TAAAACGCGA TAAGGGCTGG ATTGGCGTGT TACTGGAGAT CTGGCTAGGT GCCAGCGCAG GGAGTAAACC TGAGCAAGAT TTTGCTGCTC TGGGCGTGGA ACTTAAAACT ATCCCTGTGG ATAGTCTTGG TCGTCCGCTG GAAACAACAT TCGTTTGTGT TGCCCCGTTA ACGGGTAATA GCGGGGTGAC CTGGGAAACC AGCCACGTGC GCCACAAGCT CAAACGCGTA CTGTGGATAC CGGTTGAAGG CGAGCGCAGC ATCCCGCTGG CGCAGCGTCG CGTAGGATCA CCGTTACTGT GGAGCCCGAA TGAAGAGGAA GACCGGCAAC TGCGCGAAGA CTGGGAAGAA TTAATGGATA TGATTGTTCT CGGTCAGGTT GAGCGGATCA CCGCTCGTCA CGGGGAATAT TTACAGATAC GACCGAAAGC AGCGAATGCG AAAGCGCTTA CCGAAGCCAT TGGTGTCCGG GGCGAACGGA TTCTGACGCT GCCGCGCGGC TTTTATTTGA AGAAGAATTT CACCAGTGCG CTACTGGCCC GTCATTTTCT GATCCAGTAG
|
Protein sequence | MSQPRPLLSP PETEEQLLAQ AQQLSGYTLG ELAALAGLVT PENLKRDKGW IGVLLEIWLG ASAGSKPEQD FAALGVELKT IPVDSLGRPL ETTFVCVAPL TGNSGVTWET SHVRHKLKRV LWIPVEGERS IPLAQRRVGS PLLWSPNEEE DRQLREDWEE LMDMIVLGQV ERITARHGEY LQIRPKAANA KALTEAIGVR GERILTLPRG FYLKKNFTSA LLARHFLIQ
|
| |