Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0824 |
Symbol | mutS |
ID | 3926941 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | - |
Start bp | 838813 |
End bp | 841227 |
Gene Length | 2415 bp |
Protein Length | 804 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 637901941 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_507620 |
Protein GI | 88658581 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATCATG ATAGCAAGAT AACCCCTATA ATGCAGCAGT ACATGATGCT GAAGAGTCAA TATAAGGAGT ATTTATTATT TTATAGATTA GGTGATTTTT ATGAGTTATT TTTTGATGAT GCAATAGAGA CGTCTAGAAT ATTAAATATT GTATTAACTA AAAAGGGGAA TGTACCTATG TGTGGTGTTC CCTTTCATAG TAGTGAATCT TATTTAAATA GATTGGTAAA ATTAGGTTAT AAGATAGCAA TTTGTGAGCA GTTAGAAACG TCAGAAGAAG CTAAGAAAAG AGGATATAAG GCTTTAGTAA AACGTGATGT TGTAAGAATA GTTACTCCAG GGACTATATT AGAGGATTCT TTACTTGAAG CAAAAGAGAA TAATTATTTA TCTTGTATAG TTAATGTTGA CCATAATTAT GCTATTGCAT GGTTGGAATT GTCTACTGGG TTATTTTATT ATCATACAAC AGAATTGCAT AAGCTTGATA GTGATTTGTT CAGAATTAAT CCCAAGGAAG TTTTGATTTC TGATAAGTTA GTGGAATTGG ATTCTATATA TTCTATTTTA AGGAAATACA AATTTTCGGT GACACAATAT TCGGGTAGTT TTTTTGATGT GAGTAGATCC TATAATACTT TGTGTAATGT TTATGGAATA TCTACTTTAA AAGGATTAGG TGATTTAAAA AATGAAGAGA TAGCAGTATG TGGTTCTTTG TTGGAATATG TTAAAGCTAC GCAAAAAGGG AATCTACCTC AGTTGGAATT TCCAAAAGCT TATTCAAAGG GTGATTTTAT GTTTATAGAT GCAGCAGCAT TAAGGAACCT TGAGTTATTT TGTACACAAT CTGGAGATTT AGAAGGATCC TTAATTTCTT CTATAGATTA TACTATTACA GCATGTGGTG GAAGATTATT AAAACGATGT TTGTCAGCTC CTTTAGCATG TTCTCATGCA ATAAATCGTA GGTTAGATAT TGTTGAGTTT TTTGTAAATG ATAGAACATT GTGTAGGGGT GTTAGGGAAA CATTACGTGG TATTGCAGAT ATAGAGCGTA TTTTAACAAG AATTAAAGTT GGTAAATGTT CACCTAAGGA TTTATATGCT CTGAAGTTAA CTTTGGACAA AATTTTTGTA TTATTAGATT TATTGCATAA GTTTGATTCT AGTGTTGTAG GTGATTTTTG TTCAAGGTTG GGTAAATATG ATGATTTGTG TAAAACGCTT GATGATGTGT TAATACCGAA TAATGTTAAT AATGTTAAAG ATGGGGGATT TATTAATCCT GACTATGATG CACAATTGTC AGAATATATA TATATTCAAA GTTATAGTAA TGATTTAATT CAAGAATTAC GGGATAAGTA CCGTAATATT ACTAATATTC AAAGTTTAAA AATATTGTAT AACAATATTT TAGGTTATTA TGTTGAAGTT TCATCAAGCT ATTTGATTAG TGATAAAGAC TTTATTCATA GGCAAACTCT AGCAAATAGT ATTAGATATA CGACAAGTGA ATTAAAAGCA TTGGAAAGTA AAATAATTTC TGCTAGGGAT GCAGCGATTA ATTTGGAAGT AAAAATTTTT GGTCAATTAT GTACATGTAT TATTGAAGTT GCAGATAAAA TCACTATGAC TGCACATGCT ATTGCTGAAA TTGATATGCT AACTTCTTTT GCTGAGTTGG CAATACAATA TTCTTATACT AAACCTATAG TTGATGATAG TTATGAATTT AACATAAAAA AAGGTAGGCA TCCTGTGGTT GAACGTAATG GGAAATTTGT AGCTAATGAT ATTGACCTTT CATTAATGCA AAGAGTACAT TTAATCACTG GACCTAATAT GGCTGGTAAA AGTACTTTCT TAAGACAGAA TGCATTGATA GGTATTTTAG CGCATATTGG ATCATTTGTT CCTGCTCAAC ATGCTCATAT AGGAGTTATT GATAAAGTAT TTAGTAGAGT AGGGGCTTCT GATAATATTG CATCTGGGCA TTCTACGTTT ATGGTAGAAA TGACAGAAAC TGCTGCAATA ATCAATCAAG CCACAGATAA ATCTTTTGTA ATACTTGATG AAATTGGTAG GGGTACAGGA ACATATGATG GATTATCAAT AGCATGGTCG GTTATTGAAC AAATTCATAA TGTTAACAAG AGTAGAGCAA TTTTTGCAAC CCATTATCAT GAATTGTCAA AGTTAGATAG GTATTTAGAA AATATAAAGT GTTTTTGTAT GAAAGTAGAA GAATGGAATG GAAAAGTAGT GTTCTTGCAT GAAATTATAC CTGGATCAAC TAATAAATCT TATGGAATAC ATGTTGCAAA ATTAGCAGGA TTCCCACAAT CAGTCCTAGA TAGGGCAGAA GATTTAATGA GTAAATTAAA AGCAAATGAG GATTTATTAA CTTAG
|
Protein sequence | MNHDSKITPI MQQYMMLKSQ YKEYLLFYRL GDFYELFFDD AIETSRILNI VLTKKGNVPM CGVPFHSSES YLNRLVKLGY KIAICEQLET SEEAKKRGYK ALVKRDVVRI VTPGTILEDS LLEAKENNYL SCIVNVDHNY AIAWLELSTG LFYYHTTELH KLDSDLFRIN PKEVLISDKL VELDSIYSIL RKYKFSVTQY SGSFFDVSRS YNTLCNVYGI STLKGLGDLK NEEIAVCGSL LEYVKATQKG NLPQLEFPKA YSKGDFMFID AAALRNLELF CTQSGDLEGS LISSIDYTIT ACGGRLLKRC LSAPLACSHA INRRLDIVEF FVNDRTLCRG VRETLRGIAD IERILTRIKV GKCSPKDLYA LKLTLDKIFV LLDLLHKFDS SVVGDFCSRL GKYDDLCKTL DDVLIPNNVN NVKDGGFINP DYDAQLSEYI YIQSYSNDLI QELRDKYRNI TNIQSLKILY NNILGYYVEV SSSYLISDKD FIHRQTLANS IRYTTSELKA LESKIISARD AAINLEVKIF GQLCTCIIEV ADKITMTAHA IAEIDMLTSF AELAIQYSYT KPIVDDSYEF NIKKGRHPVV ERNGKFVAND IDLSLMQRVH LITGPNMAGK STFLRQNALI GILAHIGSFV PAQHAHIGVI DKVFSRVGAS DNIASGHSTF MVEMTETAAI INQATDKSFV ILDEIGRGTG TYDGLSIAWS VIEQIHNVNK SRAIFATHYH ELSKLDRYLE NIKCFCMKVE EWNGKVVFLH EIIPGSTNKS YGIHVAKLAG FPQSVLDRAE DLMSKLKANE DLLT
|
| |