Gene NSE_0335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNSE_0335 
SymbolmutS 
ID3931998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNeorickettsia sennetsu str. Miyayama 
KingdomBacteria 
Replicon accessionNC_007798 
Strand
Start bp273806 
End bp276253 
Gene Length2448 bp 
Protein Length815 aa 
Translation table11 
GC content42% 
IMG OID637900491 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_506224 
Protein GI88608613 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.213975 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGAAG AATTCCCGCC TGCGATGAAA CGGTATTTGG AAGTCCGATG CCAATACCCA 
GATGCGGTTG TTTTCTATAG AGTAGGTGAT TTCTACGAGA TGTTTTTCGA AGATGCACGC
GAGGTATCGC ATCTACTAGG GTTGCATCTT ACTCGAAGGG GTACGTACAA GGGGAAAGAT
ATTCCGATGT GTGGGGTACC AGTTTCTTCT TGTGAAGTTT ACATAAACAA ATTAGTAAAG
CTAGGTCGTA AGGTTGCTAT TTGTGAGCAA TTGGAAACAG CAGAGGAAGC AAAAAAACGT
GGTGCCACAG CTATAGTCAG AAGAGATGTA GTTCGGCTGG TTACTCCTGG GACACTTACT
GAGGATAATC TTCTAGTGAG TGGGGAGAAC AACTATTTAC TCTGTGTTGC TCCTGGGAAG
AATGAGATTG GTCTGGCATG GTTGGATATT TCTACAAAGA AGATTGTCTT CACAAGTGCC
AACCCAGCTT CTTTGGAAAG CTATCTCGCG AAAATTGAGC CCAAGGAGGT ATTACTTCCA
GATGCAATTG ATTCAGAACT GAGAAAAGTT ATAGAACAAC ACAACATCCA TATAACGAGA
CGTCCCAATA ACCTTTTTCA ATTTGATTAT GCCGCAAATG AATTGAGAGG GTTTTATAAT
GTTCTTCAAT TGGGTTTTAT GGATGCTAGA TCTCCATGCG AAATTGTTGC TTGTGGTGCT
CTGATTGCTT ATGCGCGTGC AACACAAATG GGGGAGCTAA AACGGTTAGA ATTTCCAAAA
CGATACGAGA AGGGCTACTA TCTTGCGCTT GATGCATCAA CTATTCGAGG TCTGGAGTTA
ATCGAATCGC AAACACCAGG TGAGAAGAAT AGTTTACTGC AAGTAATTGA CCAGACGTGT
ACAGCAGGAG GTAAAAGGCT TCTAAAGAGC TATATCGTTT CTCCGCTGAT ATCGGTCGAA
GAAATTCAAG CCCGTCAAGA CAAGGTAGAA TTTTTCTTTA TACAAGAAGA GTTGCGAAAA
AAGGTGCGTA CCGAACTTGC TAACATTCCA GATGCAGAGC GAGCACTGTC GCGCATTGCA
CTGAATCGTG GAGAACCAAT TGATTGTCTT GCTGTGCATT CCTGTATGAG GAGTTCGCTA
TTACTCGCTG AGTGTTTTTC TGCTTTCCTA GAGAATGGTT ATATTAGAAG CATATATGAC
AAATGTGCTC CAGATGATGA ATTGATGGAG ACTTTGCGCA CTGCGTTTTT ACCAACTTCT
AATAGAAAAG TGGATGGCCC GTTTCTAGAT CCTACACATC ATCCCAAACT TCTAGAGTTG
AATAGGTTAT CCACCAACGC TGATGTGGTG ATAAATGATT TGTTAAACAC ATACAAAAGG
AATACTGGGA TTAACTCTTT GAAATTGGGT AAGAACAACC TTATAGGCTA CTATGTTGAG
GTTCCTAAGT CCGCGCCGCT TCTTGATAGT GAAGTTTTTA TCCACAGACA ATCCTTGTTG
AACAATATAC GCTATACAAC TCTTGAGTTG CAGAATTTGG AGGCACAGAT AGCAAAAGCA
AACGAGAACT ACAGAAAGTT AGAATTGGAA CTTTTCAGGG AACTGTGTGG AAAAATTCTT
GCATCTGAGG GTCCACTGAA AGAAATGATC GCAGCAATAG CAGAACTGGA TGTTATAGCT
TCCTTTGCTG AGATTGCTGT TCAAAGAAAA TATGTGCGTC CACAGGTTGA TAATAGTAAC
GAACTGCGCA TTTCTGGGGG TAGACACCCA TTTGTAGAAC AGGTGAATGC ATTTGTGCCA
AATGATCTAG CTTTTACCTC CGCAGAGCGT GTGTGTGTTT TAACTGGGCC TAATATGGCT
GGAAAAAGTA CTTACCTGCG TCAAAATGCA TTGATAACTA TACTTGCTCA AATGGGTTCG
TTTGTGCCAG CTGATTCTGC TCACATCGGT GTTGTAGATA GGGTTTTTAG TCGCATTGGC
GCATCTGATA ATATTGCCAT GGGCAAGTCA ACGTTTATGG TGGAAATGAT GGAGACAGCA
AATATAGTTA ATAATGCGAC ATGCAGATCT CTCGTAATCT TAGATGAGGT TGGGAGAGGT
ACGTCTACTC TAGACGGTAT CTCAATCGCA CAAGCTGTTC TTGAATATTT GCATGACTCA
GTGAACTGTA AGACTATTTT TGCAACTCAT TACAACGAGC TTTGTGATCT GGAAAGTAAA
CTCCCACGGA TGAAATGTTA CTCAATTGAA GTAAAGCGCT GGCGAGATGA GGTTCTTCTA
ATGTATAAAA TTGTTCCTGG GCGAGGTGAT AATTCGTATG GAATACATAC AGCAATGCTT
TCTGGTATTC CAGAAGCGAT TATCCGTCGC GCAACCGAAA TAGCGAAGGA AAAGAATCTC
AGCATTGAGA ATTCCCTTTC TAATGAAAGG ATCAGAGTGA AACACTAG
 
Protein sequence
MSEEFPPAMK RYLEVRCQYP DAVVFYRVGD FYEMFFEDAR EVSHLLGLHL TRRGTYKGKD 
IPMCGVPVSS CEVYINKLVK LGRKVAICEQ LETAEEAKKR GATAIVRRDV VRLVTPGTLT
EDNLLVSGEN NYLLCVAPGK NEIGLAWLDI STKKIVFTSA NPASLESYLA KIEPKEVLLP
DAIDSELRKV IEQHNIHITR RPNNLFQFDY AANELRGFYN VLQLGFMDAR SPCEIVACGA
LIAYARATQM GELKRLEFPK RYEKGYYLAL DASTIRGLEL IESQTPGEKN SLLQVIDQTC
TAGGKRLLKS YIVSPLISVE EIQARQDKVE FFFIQEELRK KVRTELANIP DAERALSRIA
LNRGEPIDCL AVHSCMRSSL LLAECFSAFL ENGYIRSIYD KCAPDDELME TLRTAFLPTS
NRKVDGPFLD PTHHPKLLEL NRLSTNADVV INDLLNTYKR NTGINSLKLG KNNLIGYYVE
VPKSAPLLDS EVFIHRQSLL NNIRYTTLEL QNLEAQIAKA NENYRKLELE LFRELCGKIL
ASEGPLKEMI AAIAELDVIA SFAEIAVQRK YVRPQVDNSN ELRISGGRHP FVEQVNAFVP
NDLAFTSAER VCVLTGPNMA GKSTYLRQNA LITILAQMGS FVPADSAHIG VVDRVFSRIG
ASDNIAMGKS TFMVEMMETA NIVNNATCRS LVILDEVGRG TSTLDGISIA QAVLEYLHDS
VNCKTIFATH YNELCDLESK LPRMKCYSIE VKRWRDEVLL MYKIVPGRGD NSYGIHTAML
SGIPEAIIRR ATEIAKEKNL SIENSLSNER IRVKH