Gene Dole_0149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0149 
Symbol 
ID5692964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp162933 
End bp164750 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content64% 
IMG OID641262726 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_001528036 
Protein GI158520166 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGTGA TTCGGATTCT TCCGGAACAC CTTTCCAACA AGATCGCTGC CGGCGAGGTG 
GTGGAACGGC CCGCCTCGGT GGTCAAGGAG CTGGTGGAAA ACGCCATCGA CGCCGGAGCC
TCGGCCATCT TCGTGGAAAT CCAGAACGGG GGCCGCTCCC TGGTCCGGGT AACCGACAAC
GGGGCGGGCA TGGGCAAAGA CGACGCCCTG CTCTGCCTGG AGCGGTACGC CACCAGCAAA
ATCGCCGACG AAAAAAGCCT GTTTGCCATC TCCACCCTGG GGTTCCGGGG AGAGGCCATT
CCCAGCATTG CCTCGGTATC CGAGTTCGTT CTGACCACCC GGCCGGCCGA TGCCGATGCC
GGCACCCGCA TTCGGGTCTC CGGCGGCACC ATCACCGACG TGGCCGACAC AGGCGCGCCC
CCGGGCACCA CAGTGGAGGT GGGCCGCCTC TTTTTCAACA CCCCGGCCCG GCGCAAGTTT
CTAAAGGCCG TGGCCACGGA AACCGGCCAC ATTGCCGACA CCCTGGCCGC CTTTGCCCTG
TGCCGGCCCG ACATTCACTT CAAGCTGGTC CAGGACGGCC GGACCGTGAA AAACTGGCCC
CGAACCGCCG ATCCCCTGGA ACGGATCACG GACGTGCTGG GCAGCCAGAC CCGGGGCCAC
ATGGCGCCGA TTTCCTGTGC TGACGACACG GTTACCATAT CCGGCTGGAC CTCCTCTCCG
GCGGTGACGC GCAGCACCTC CCAGAAGATT CACCTGTTTG TAAACGGCCG GATCGTAAAA
GACCGGGGCC TTCAGTACGC CCTGTTTGAA GGCTACAGGG GGCGGCTGGT CAAAGGCGCC
TTTCCCGTGG CCGCCATCTT TATCAACATT CCCTTTGACC GGGTGGATGT CAATGTTCAC
CCCACCAAAA ACGAGGTGCG GTTTGCCGAC CAGCGCCGGG TCTACCAGGC CCTGAAAACC
GCGGTGGCCG GTGCCTGGGC CATAAACGCA GCCCCACCCT GGAGCGACGG ACAAAAGCCA
TCCCGGCCGG CGCCGGCTTT TTCTTTGCCG GGCGTCAGGC CCGCGCCCAA AGTCAGCGAG
TCTGTGTTTC AATATGCCAC ACCCCTTCCC CTGCCCTCCT TTGCGAATGC GGGGGATGAA
CCGCCGGCCC CGGCCAGGCC CCGGCTTTCC GGGGATGCAG AGCCCCGGCC GCTCGAAACG
CCCGTTTCTT CGGACGGTCA TCAGTCGTTT TTCCGCTTTT CCGACCTGGC CGTGGTGGGA
CAGGTGTTTA ACACCTATAT TGTCTGCCAG GCCGACGCCC AGGTGGTGCT GATCGACCAG
CACGCGGCCC ATGAACGCAT CCTGTTTGAA GCCTTGAAGC AGCGGGGCCA AAACCGCCCG
CTCCCCGGCC AGAACCTGCT GGTGCCGGAA ACCGTGGAAC TGACCCACAA AAGCGCGGCC
GCCATTGAGC CCCTGCTGGA CGCCTTTGCC GCCATGGGCC TTGAGATCGA GCCCTTTGGC
CCCACCGCCT TTGTAATCAA GGCCGTGCCC GCCGTCCTGG CCGACACGGC CGTTGGCCCC
ATGGTGGCCG AAATCGCGGA AAAGCAGGCA GACACCGGGT TTGCCCCGGA CCCCGACACC
CTCACCAACG ACGTGCTGCA CGTGATGGCC TGCCACGCCG CCATCCGCGC CCACCAGCGG
CTTTCCGAGG CGGAAATAAA GGCCCTGCTG GCCCGGCTGG ACGGCTGCGA CAACCCCAGG
CACTGCCCCC ACGGCCGGCC CACCGCGATT TTCTGGTCCA CAAGTGAGAT TGAAAAGGCT
TTCAAGCGCA TTGTCTGA
 
Protein sequence
MAVIRILPEH LSNKIAAGEV VERPASVVKE LVENAIDAGA SAIFVEIQNG GRSLVRVTDN 
GAGMGKDDAL LCLERYATSK IADEKSLFAI STLGFRGEAI PSIASVSEFV LTTRPADADA
GTRIRVSGGT ITDVADTGAP PGTTVEVGRL FFNTPARRKF LKAVATETGH IADTLAAFAL
CRPDIHFKLV QDGRTVKNWP RTADPLERIT DVLGSQTRGH MAPISCADDT VTISGWTSSP
AVTRSTSQKI HLFVNGRIVK DRGLQYALFE GYRGRLVKGA FPVAAIFINI PFDRVDVNVH
PTKNEVRFAD QRRVYQALKT AVAGAWAINA APPWSDGQKP SRPAPAFSLP GVRPAPKVSE
SVFQYATPLP LPSFANAGDE PPAPARPRLS GDAEPRPLET PVSSDGHQSF FRFSDLAVVG
QVFNTYIVCQ ADAQVVLIDQ HAAHERILFE ALKQRGQNRP LPGQNLLVPE TVELTHKSAA
AIEPLLDAFA AMGLEIEPFG PTAFVIKAVP AVLADTAVGP MVAEIAEKQA DTGFAPDPDT
LTNDVLHVMA CHAAIRAHQR LSEAEIKALL ARLDGCDNPR HCPHGRPTAI FWSTSEIEKA
FKRIV