Gene ECH_0824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0824 
SymbolmutS 
ID3926941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp838813 
End bp841227 
Gene Length2415 bp 
Protein Length804 aa 
Translation table11 
GC content31% 
IMG OID637901941 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_507620 
Protein GI88658581 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATCATG ATAGCAAGAT AACCCCTATA ATGCAGCAGT ACATGATGCT GAAGAGTCAA 
TATAAGGAGT ATTTATTATT TTATAGATTA GGTGATTTTT ATGAGTTATT TTTTGATGAT
GCAATAGAGA CGTCTAGAAT ATTAAATATT GTATTAACTA AAAAGGGGAA TGTACCTATG
TGTGGTGTTC CCTTTCATAG TAGTGAATCT TATTTAAATA GATTGGTAAA ATTAGGTTAT
AAGATAGCAA TTTGTGAGCA GTTAGAAACG TCAGAAGAAG CTAAGAAAAG AGGATATAAG
GCTTTAGTAA AACGTGATGT TGTAAGAATA GTTACTCCAG GGACTATATT AGAGGATTCT
TTACTTGAAG CAAAAGAGAA TAATTATTTA TCTTGTATAG TTAATGTTGA CCATAATTAT
GCTATTGCAT GGTTGGAATT GTCTACTGGG TTATTTTATT ATCATACAAC AGAATTGCAT
AAGCTTGATA GTGATTTGTT CAGAATTAAT CCCAAGGAAG TTTTGATTTC TGATAAGTTA
GTGGAATTGG ATTCTATATA TTCTATTTTA AGGAAATACA AATTTTCGGT GACACAATAT
TCGGGTAGTT TTTTTGATGT GAGTAGATCC TATAATACTT TGTGTAATGT TTATGGAATA
TCTACTTTAA AAGGATTAGG TGATTTAAAA AATGAAGAGA TAGCAGTATG TGGTTCTTTG
TTGGAATATG TTAAAGCTAC GCAAAAAGGG AATCTACCTC AGTTGGAATT TCCAAAAGCT
TATTCAAAGG GTGATTTTAT GTTTATAGAT GCAGCAGCAT TAAGGAACCT TGAGTTATTT
TGTACACAAT CTGGAGATTT AGAAGGATCC TTAATTTCTT CTATAGATTA TACTATTACA
GCATGTGGTG GAAGATTATT AAAACGATGT TTGTCAGCTC CTTTAGCATG TTCTCATGCA
ATAAATCGTA GGTTAGATAT TGTTGAGTTT TTTGTAAATG ATAGAACATT GTGTAGGGGT
GTTAGGGAAA CATTACGTGG TATTGCAGAT ATAGAGCGTA TTTTAACAAG AATTAAAGTT
GGTAAATGTT CACCTAAGGA TTTATATGCT CTGAAGTTAA CTTTGGACAA AATTTTTGTA
TTATTAGATT TATTGCATAA GTTTGATTCT AGTGTTGTAG GTGATTTTTG TTCAAGGTTG
GGTAAATATG ATGATTTGTG TAAAACGCTT GATGATGTGT TAATACCGAA TAATGTTAAT
AATGTTAAAG ATGGGGGATT TATTAATCCT GACTATGATG CACAATTGTC AGAATATATA
TATATTCAAA GTTATAGTAA TGATTTAATT CAAGAATTAC GGGATAAGTA CCGTAATATT
ACTAATATTC AAAGTTTAAA AATATTGTAT AACAATATTT TAGGTTATTA TGTTGAAGTT
TCATCAAGCT ATTTGATTAG TGATAAAGAC TTTATTCATA GGCAAACTCT AGCAAATAGT
ATTAGATATA CGACAAGTGA ATTAAAAGCA TTGGAAAGTA AAATAATTTC TGCTAGGGAT
GCAGCGATTA ATTTGGAAGT AAAAATTTTT GGTCAATTAT GTACATGTAT TATTGAAGTT
GCAGATAAAA TCACTATGAC TGCACATGCT ATTGCTGAAA TTGATATGCT AACTTCTTTT
GCTGAGTTGG CAATACAATA TTCTTATACT AAACCTATAG TTGATGATAG TTATGAATTT
AACATAAAAA AAGGTAGGCA TCCTGTGGTT GAACGTAATG GGAAATTTGT AGCTAATGAT
ATTGACCTTT CATTAATGCA AAGAGTACAT TTAATCACTG GACCTAATAT GGCTGGTAAA
AGTACTTTCT TAAGACAGAA TGCATTGATA GGTATTTTAG CGCATATTGG ATCATTTGTT
CCTGCTCAAC ATGCTCATAT AGGAGTTATT GATAAAGTAT TTAGTAGAGT AGGGGCTTCT
GATAATATTG CATCTGGGCA TTCTACGTTT ATGGTAGAAA TGACAGAAAC TGCTGCAATA
ATCAATCAAG CCACAGATAA ATCTTTTGTA ATACTTGATG AAATTGGTAG GGGTACAGGA
ACATATGATG GATTATCAAT AGCATGGTCG GTTATTGAAC AAATTCATAA TGTTAACAAG
AGTAGAGCAA TTTTTGCAAC CCATTATCAT GAATTGTCAA AGTTAGATAG GTATTTAGAA
AATATAAAGT GTTTTTGTAT GAAAGTAGAA GAATGGAATG GAAAAGTAGT GTTCTTGCAT
GAAATTATAC CTGGATCAAC TAATAAATCT TATGGAATAC ATGTTGCAAA ATTAGCAGGA
TTCCCACAAT CAGTCCTAGA TAGGGCAGAA GATTTAATGA GTAAATTAAA AGCAAATGAG
GATTTATTAA CTTAG
 
Protein sequence
MNHDSKITPI MQQYMMLKSQ YKEYLLFYRL GDFYELFFDD AIETSRILNI VLTKKGNVPM 
CGVPFHSSES YLNRLVKLGY KIAICEQLET SEEAKKRGYK ALVKRDVVRI VTPGTILEDS
LLEAKENNYL SCIVNVDHNY AIAWLELSTG LFYYHTTELH KLDSDLFRIN PKEVLISDKL
VELDSIYSIL RKYKFSVTQY SGSFFDVSRS YNTLCNVYGI STLKGLGDLK NEEIAVCGSL
LEYVKATQKG NLPQLEFPKA YSKGDFMFID AAALRNLELF CTQSGDLEGS LISSIDYTIT
ACGGRLLKRC LSAPLACSHA INRRLDIVEF FVNDRTLCRG VRETLRGIAD IERILTRIKV
GKCSPKDLYA LKLTLDKIFV LLDLLHKFDS SVVGDFCSRL GKYDDLCKTL DDVLIPNNVN
NVKDGGFINP DYDAQLSEYI YIQSYSNDLI QELRDKYRNI TNIQSLKILY NNILGYYVEV
SSSYLISDKD FIHRQTLANS IRYTTSELKA LESKIISARD AAINLEVKIF GQLCTCIIEV
ADKITMTAHA IAEIDMLTSF AELAIQYSYT KPIVDDSYEF NIKKGRHPVV ERNGKFVAND
IDLSLMQRVH LITGPNMAGK STFLRQNALI GILAHIGSFV PAQHAHIGVI DKVFSRVGAS
DNIASGHSTF MVEMTETAAI INQATDKSFV ILDEIGRGTG TYDGLSIAWS VIEQIHNVNK
SRAIFATHYH ELSKLDRYLE NIKCFCMKVE EWNGKVVFLH EIIPGSTNKS YGIHVAKLAG
FPQSVLDRAE DLMSKLKANE DLLT