Gene BURPS668_3264 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3264 
SymbolmutL 
ID4884520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3198794 
End bp3200839 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content72% 
IMG OID640129192 
ProductDNA mismatch repair protein 
Protein accessionYP_001060275 
Protein GI126441610 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.815986 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGAAT TCACCGATTC CGCCGCGGGC CGCTCGAGCA CGCCGCCCGC CGACGCGCCG 
TCGCCCGCCT CGCGCCCGCT GCGCGCGATC CAGCCGCTGC CCGACCAGTT GATCAGCCAG
ATCGCGGCGG GCGAAGTGGT CGAGCGGCCC GCCTCCGTCG TCAAGGAGCT CGTCGAGAAC
GCGCTCGACG CCGGCGCGGG CACGCTGCGC ATCCTGCTCG ACGAAGGCGG CGTCAAGCGC
ATTTCGATCA CCGACGACGG CTGCGGGATT CCCGCCGACG AGCTGCCGCT CGCGCTGATG
CGCCACGCGA CGAGCAAGAT CCGCTCGCTC GCCGAGCTCG AGGCGGTCGC GACGCTCGGG
TTCCGCGGCG AAGCGCTCGC ATCGATCGCG TCGGTCGCCG AGATGTTCAT CACGAGCCGC
ACCGAGGACG CCGCGCACGC GACGCGCATC GACGCGCAGA CGGGCGTCGT CGGGCCCGCG
GCCGGCACGC GCGGCACGAC GATCGAAGTG CGCGAGCTGT ACTTCAGCAC GCCCGCGCGC
CGCAAGTTCC TGAAGAGCGA GCAGACCGAG TTCGGCCATT GCCTCGAGAT GATTCGCCGC
GCGGCGCTCG CGCGGCCGGA CGTCGCGATC TCGGTGCTGC ACAACGGCCG CGCGGTCGAG
CACTGGAACG CGAGCGAGCC CGCCGCGCGC GTCGCGAAGA TCCTCGGCGA CGGTTTCGCC
ACCGCCCATC TGCCGCTCGA CGAGCGCGCC GGCCCGCTCG CCGTCTACGG CTGCGCGGGG
CTGCCGACCG CGAGCCGCGG CCGCGCGGAC CAGCAATACT TCTTCGTCAA CGGCCGCTTC
GTGCGCGACA AACTGCTCAC GCACGCGGTG CGCGCCGCGT ACGAGGACGT GCTGCACGGC
GATCGCTATC CGTCGTACGT GCTGTTCCTC GACCTGCCGC CGGAAGCCGT CGACGTGAAC
GTCCATCCGT CGAAGATCGA GGTGCGCTTC CGCGATTCGC GCTCGATCCA CCAGTTCGTG
TTCCACGCGG TGCAGCGCGC GCTCGCGCGG CACGCGGGCG CGTCGCCCGA GACGACGGCG
GGCGGCCACG CCGCGCATCT TGCGCCCATC GTGCCGGCAT CGGCCGACTC GGCGGCCGCG
CCGGGCGCCT CGTTTGTCCG CTCGGGCCCG ACGGCGGGCG CGGGCGTCGG TCAGCCCGCG
TCCGGCAATA CATGGCTGCG CCAGTCGCGG ATGACGCAGG GCACGCTGCC CGTCGCGCAG
CCGCTCGCGC TGTACGACGC GCTGTTCGGC CGCAAGGACA CGGGCGCGGG CATCCCGCGC
GGCGCGACGC TCGCGCTCGA AGCGCATGAC GCGCCGGACG GGGCGAATGC GCCGGGCGCG
CCGCTCTACG CGACCATGCC GGGCGGCGAC GCGACGCCCG CGTTCTCCCC GGCGGGTGCA
GCCGGTCTTC CGATGCACGA CGAGCAGCCG CTCGGCTTCG CGGTCGGCCA GATCCACGGC
ATTTACGTGC TCGCGCAGAA CGCGCGCGGG CTCGTGATCG TCGACATGCA CGCCGCGCAC
GAGCGGATCC TGTACGAGCA GTTCAAGCGC GCGCTCGCCG ATCGCACGAT CGCCGTGCAG
ACGCTGCTGA TTCCGGTGTC GATGACGGCG ACGCCCGTCG AGGTCGGCAC CGCGGAGGAG
GAGCGCGAGA CGCTCGACGC GCTCGGCTTC GATCTCGCGG TGCTGTCACC GACGACGCTC
GCGATCCGCG CGGTGCCCGC CTTGCTGAAG GACGCCGACC TGCAAGCGCT CGCGCGCGCG
GTGCTCGCGG ATCTGCATGC GTTCGGCGGC TCGCGGGTGC TGACCGAGCG CCAGCACGAG
CTGCTCGGCA CGCTCGCGTG CCATCACGCG GTGCGCGCGA ACCGGCGTCT CACGCTCGAC
GAGATGAACG CGCTGCTGCG GCAGATGGAG GCGACCGAGC GCGCGGATCA GTGCAACCAT
GGCCGGCCGA CCTGGTATCA ACTGACGCTC GGCGATCTCG ACAAGCTTTT CATGCGCGGC
CAATGA
 
Protein sequence
MSEFTDSAAG RSSTPPADAP SPASRPLRAI QPLPDQLISQ IAAGEVVERP ASVVKELVEN 
ALDAGAGTLR ILLDEGGVKR ISITDDGCGI PADELPLALM RHATSKIRSL AELEAVATLG
FRGEALASIA SVAEMFITSR TEDAAHATRI DAQTGVVGPA AGTRGTTIEV RELYFSTPAR
RKFLKSEQTE FGHCLEMIRR AALARPDVAI SVLHNGRAVE HWNASEPAAR VAKILGDGFA
TAHLPLDERA GPLAVYGCAG LPTASRGRAD QQYFFVNGRF VRDKLLTHAV RAAYEDVLHG
DRYPSYVLFL DLPPEAVDVN VHPSKIEVRF RDSRSIHQFV FHAVQRALAR HAGASPETTA
GGHAAHLAPI VPASADSAAA PGASFVRSGP TAGAGVGQPA SGNTWLRQSR MTQGTLPVAQ
PLALYDALFG RKDTGAGIPR GATLALEAHD APDGANAPGA PLYATMPGGD ATPAFSPAGA
AGLPMHDEQP LGFAVGQIHG IYVLAQNARG LVIVDMHAAH ERILYEQFKR ALADRTIAVQ
TLLIPVSMTA TPVEVGTAEE ERETLDALGF DLAVLSPTTL AIRAVPALLK DADLQALARA
VLADLHAFGG SRVLTERQHE LLGTLACHHA VRANRRLTLD EMNALLRQME ATERADQCNH
GRPTWYQLTL GDLDKLFMRG Q