Gene COXBURSA331_A0879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCOXBURSA331_A0879 
SymbolmutS 
ID5794719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCoxiella burnetii RSA 331 
KingdomBacteria 
Replicon accessionNC_010117 
Strand
Start bp781862 
End bp784441 
Gene Length2580 bp 
Protein Length859 aa 
Translation table11 
GC content44% 
IMG OID641330363 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001596673 
Protein GI161831167 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACTTC AAACAAAATC GCCAACGACG CAAAATGATT TTAGTCAGCA CACGCCCATG 
ATGCGTCAAT ACTTACGCAT TAAAGCCGAG TACCCCGATC TTTTAGTTTT TTACCGCATG
GGTGATTTCT ATGAATTATT TTATGATGAC GCTAAAAAAG CGGCGAAATT ATTAAATATT
ACGCTAACCG CTCGAGGTCA ATCGGCCGGT CATGCCATTC CGATGGCCGG TGTTCCCTAC
CATGCCGTCG AAAATTATTT AACAAAACTA GTTCGACTCG GCGAATCCGT TGTCATTTGC
GAGCAAATTG GAGACCCCGC AACCAGTAAA GGTCCTGTCG CTCGCGAAGT CACCCGTATT
ATCACTCCAG GTACCGTTAG CGACGAAGCA CTGCTTGATG AACATCGCGA CAACACCTTA
ATGGTAATTC ATCAAGAGAA AGATCGTTTC GGTATCGCGA CATTAGATAT TACCAGCGGG
CGTTTTTTGA TTCAGGAAAT CATTTCTGAA AATGCACTAT TCGCTGAAAT TGAACGCATT
CGCCCGGCAG AACTTTTAAT CAGTGAAGAA AATAGCGTTC ATCCGCTAAA AGCCGACTCC
ATCAAGCGCC GACCTCCGTG GGAATTTGAT CATGCCACTG CGCTCACTTT ATTGTGCCAA
CAATTTCAGA CAAAAAGTTT GGATGGATTT GGTATTACTC ATTTGCCGCT GGCCATTACG
GCGGCAGGAT GTTTGCTGCA ATATGTTAAT TACACACAAA AAAGCGCCCT GCCTCATATT
CACTCCATTC AAGCCGAACA AAATGAGGAA GCCCTATTTA TCGACGCTAA CACGCGCCGT
AACCTTGAAT TAATCACCAA TTTACAAGGC GAAGAAGTTC ACTCTCTGGC TTGGCTGCTC
GATCACACAG CCACCCCCAT GGGAAGCCGA TTATTACGCC GCTGGATTAA TCGGCCCTTG
CGCGACCAAA TTTTATTGCA ACAAAGGCAA AACGCCGTTT CTACCCTTCT CGAAAAAAGA
AATTATTCGG AAATCTACGA AAATTTACGC CATATTGGCG ATCTCGAAAG GATAGTGGCA
CGGATCGCAC TTCGCTCTGC TCGGCCCCGA GATTTAATGC AACTGCGCCA GGCGCTCGGT
GTATTGCCAA CCCTTCACCA ACAGCTCACC AACTTGCCGC TTAATAAACA ATTACAAGAA
ATTAAAAATA ACCTTGGTCT TTTCGATGAA TTATTTCGAT TATTACAAAA AGCGATTATC
GAAAACCCAC CGATCGTTAT TCGCGATGGC GGCGTGATTG CCGATGGTTA TGACGCGCCA
TTGGATGAAC TGCGCAATAT GAGCACCAAC AGTCACCAAT TTTTAATTGA CTTAGAACAA
CAAGAGCGCG AACGCACCAA AATAAATACG GTTAAAGTTG GCTATAATCG CATCCATGGT
TATTACATTG AAATTTCACG CGCCCAAGCC AAACAAGCCC CTACGGAATA TATTCGGCGA
CAAACCCTAA AAAACGTGGA ACGTTACATC ACTCCCGAAT TAAAAATTTT TGAAGACAAA
GTGTTAAGCA GCCGCTCCCG CGCACTGGCC CGCGAAAAAG AACTTTACGA ACAATTATTG
GATACTTTAA TCGAAAAATT AATTCCTTTA CAACAATGCG CAAGTGCAAT TGCAAACCTG
GACGTTTTAA ATACCCTCGC TGAACGGGCT GACACACTTA ATTTTAACGC GCCACAGTTT
TGCGATTACC CTATTATTAA AATTGAGGCT GGGCGTCACC CTATTGTGGA AAATGTAATG
ACCGATCCGT TTATGCCCAA TGACACTCAT CTCGACGAAA AGCGCCGAAT GCTTATCATT
ACTGGCCCGA ATATGGGTGG TAAATCCACT TACATGCGGC AAACGGCTTT AATCACTTTA
CTCGCCTATA TTGGCAGTTT CGTACCCGCT AAAAATGCGC AACTTGGACC CATTGATCGC
ATCTTCACTC GGATCGGTGC GGCAGATGAT TTAGCCAGTG GCCGATCAAC CTTTATGGTA
GAAATGACTG AGACTGCAGC GATTTTGCAT AATGCGACAG AAGAAAGTTT AGTGCTAATG
GATGAAGTTG GCCGCGGTAC CAGTACGTTT GATGGGCTTT CGTTGGCTTA CGCTTGCGCT
TCTTATTTAG CAACAAAATT AAAAGCTTTT GCCTTATTTG CAACCCATTA TTTTGAATTG
ACAGCTTTAG CTTCGACACT ACAGGCGGTT AAAAATGTTC ACTTAGACGC TGTGGAGCAC
GAAGAAAAAA TTATTTTTTT ACACGCTCTG AGGGAGGGAC CGGCGAATAA AAGTTATGGA
TTGCAAGTAG CGCAGCTAGC AGGAATTCCT CGCTCAGTAA TTCAGCATGC CCGGCAAAAA
TTAGAAGAGT TGGAAAATCC CGTTATTTCG GAAACCCAAC AACCCCAACA AAATGAACTT
TTTCTTCCCA TAGAAAATCC TGTTTTAACG CAATTAGACA AACTCAATCC CGATAACCTA
ACGCCTAAAC AAGCTTTAGA TATCCTTTAT CAATTAATTC AATTACGTCA GCAAAAATGA
 
Protein sequence
MELQTKSPTT QNDFSQHTPM MRQYLRIKAE YPDLLVFYRM GDFYELFYDD AKKAAKLLNI 
TLTARGQSAG HAIPMAGVPY HAVENYLTKL VRLGESVVIC EQIGDPATSK GPVAREVTRI
ITPGTVSDEA LLDEHRDNTL MVIHQEKDRF GIATLDITSG RFLIQEIISE NALFAEIERI
RPAELLISEE NSVHPLKADS IKRRPPWEFD HATALTLLCQ QFQTKSLDGF GITHLPLAIT
AAGCLLQYVN YTQKSALPHI HSIQAEQNEE ALFIDANTRR NLELITNLQG EEVHSLAWLL
DHTATPMGSR LLRRWINRPL RDQILLQQRQ NAVSTLLEKR NYSEIYENLR HIGDLERIVA
RIALRSARPR DLMQLRQALG VLPTLHQQLT NLPLNKQLQE IKNNLGLFDE LFRLLQKAII
ENPPIVIRDG GVIADGYDAP LDELRNMSTN SHQFLIDLEQ QERERTKINT VKVGYNRIHG
YYIEISRAQA KQAPTEYIRR QTLKNVERYI TPELKIFEDK VLSSRSRALA REKELYEQLL
DTLIEKLIPL QQCASAIANL DVLNTLAERA DTLNFNAPQF CDYPIIKIEA GRHPIVENVM
TDPFMPNDTH LDEKRRMLII TGPNMGGKST YMRQTALITL LAYIGSFVPA KNAQLGPIDR
IFTRIGAADD LASGRSTFMV EMTETAAILH NATEESLVLM DEVGRGTSTF DGLSLAYACA
SYLATKLKAF ALFATHYFEL TALASTLQAV KNVHLDAVEH EEKIIFLHAL REGPANKSYG
LQVAQLAGIP RSVIQHARQK LEELENPVIS ETQQPQQNEL FLPIENPVLT QLDKLNPDNL
TPKQALDILY QLIQLRQQK