Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0363 |
Symbol | mutL |
ID | 5711272 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 346074 |
End bp | 347939 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641266261 |
Product | DNA mismatch repair protein |
Protein accession | YP_001531713 |
Protein GI | 159042919 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.127252 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCGC CCAACCCCAA TATCATGCCT TCGCACCCGG ATCTTGCGGC GCGCCCGGTG ATCCGGCAGC TCGACGAGGC CGCCATCAAC CGCATCGCCG CGGGCGAGGT GATCGAACGC CCCGCCTCCG CCGTCAAGGA GTTGGTGGAG AACGCGCTCG ATGCCCAGGC CCGGCAGATC GAGATTGCCT ATGCGGACGG GGGCAAGACC CTGCTGCGGG TGACCGATGA CGGCATCGGG ATCGCCGCCG GGGATCTGCC GCTGGCGCTG GCGCGGCACG CTACGTCCAA GATCGACGGC GCGGATCTTC TGAACATCCA TACCTTCGGT TTCCGGGGCG AGGCGCTGCC GTCGCTCGGC GCGGTCGGGC GGCTGACCAT CGCCTCGCGC GCGGCGGGTG CGGAGGCGGC GGAGGTCACG GTGGAGGGCG GGCGCATGGG CCCGGTCCGG CCCGCGGCGC TGAACCGCGG CACGGTCGTC ACCTTGCGCG ATCTGTTTTC CGCCACGCCC GCGCGGCTGA AATTCCTGCG CTCGGACCGG GCGGAGGTGC AGGCGATCGG CGAGGTCGTG CGCCGCCTCG CCATGGCCGA GCCGTCCGTG GGCTTCACCC TGCGCGATGT CTCCGGGGGC GGCGAGGGGC GGGTGACCTT CCGCGTGGCG CCGGAGCAGG GTGATTTGTT CGACGCGCTG CACCGGCGGC TGGGCAGGGT TCTGGGTTCG GAGTTCGCCG AGAATGCGCT GAGGATCGAG GGGGAGCGGG AGGGGCTGCG CCTGTCCGGA TATGCGGCCC TGCCGACCTA TTCGCGCGGC GCGGCGGTGG CGCAGTTTCT CTTCGTCAAC GGGCGGCCCG TGCGCGACAA GCTGCTGGTG GGCGCGCTGC GCGGGGCTTA TGCGGATTTC CTGAGCCGGG ACCGGCATCC GGCGGCGGTG CTCTTTGTCG ACTGCCCGCC GGAGCGGGTG GATGTGAACG TCCATCCCGC CAAGTCCGAG GTGCGCTTCC GCGAGCCCGG GGTGGCGCGC GGGCTGATCG TCACCGCCCT GCGCCATGCC CTGGCCGAGG CCGGGCACCG GGCGTCGAGC ACCGTGGCCG ATGCCACCCT GGGCGCGTTC CGGGCGCCGG ACGCGGTCGG GAGCGGCGCG CGGATCTACC AGATGGACCG ACCCTCTGCG GCGGCCCTGG GGCGCAGCAC CGCCTGGCAG GCCCCGGAGA CCGCGGCGCA GGGCTTCGGA TTTGCCGAGG CGCCGTCGGC CCGGGTGGAG CCTGCCGAGA CCGTCGAGGC CATCGCGCGG CCCCTCGGGG CGGCGCGTGC GCAGCTCCAC GAGAATTACA TCGTGGCCCA GACCGAGACC GGCATGGTGC TGGTCGATCA GCACGCGGCC CATGAGCGGC TGGTCTATGA GCGGCTCAAG GCGCTGATGG CGGAGAACGG CGTGCCGTCC CAGGCCCTGC TGATCCCCGA GATCGTCGAG ATGTCGGAGG CCGACGCGCG GACCCTGCTG GACCGTTCCG AAGAACTTGC CGCGCTGGGC TTGCGGATCG AGCCCTTCGG GCCCGGTGCG GTGGCCGTGC GTGAGACACC GGCGCTGCTG GGGCCGGTGA AGGCCGAGGC GTTGTTGCGC GATATCCTGG ACGAGCTTTC GGACCTGGGC CAGACCGATG CGCTGCAGGC GCGGATCGAG GCGATCCTGT CGCGCATGGC GTGCCACGGT TCGGTCCGGT CCGGGCGGCG GATGAGCGGG GAGGAGATGA ACGCGCTTTT GCGCCAGATG GAGGCGACGC CCCATTCGGG CCAGTGCAAC CACGGGCGGC CCACCTATGT GGAGCTGAAA CTGGCCGATA TCGAGCGGCT CTTCGGGCGC ACATGA
|
Protein sequence | MSAPNPNIMP SHPDLAARPV IRQLDEAAIN RIAAGEVIER PASAVKELVE NALDAQARQI EIAYADGGKT LLRVTDDGIG IAAGDLPLAL ARHATSKIDG ADLLNIHTFG FRGEALPSLG AVGRLTIASR AAGAEAAEVT VEGGRMGPVR PAALNRGTVV TLRDLFSATP ARLKFLRSDR AEVQAIGEVV RRLAMAEPSV GFTLRDVSGG GEGRVTFRVA PEQGDLFDAL HRRLGRVLGS EFAENALRIE GEREGLRLSG YAALPTYSRG AAVAQFLFVN GRPVRDKLLV GALRGAYADF LSRDRHPAAV LFVDCPPERV DVNVHPAKSE VRFREPGVAR GLIVTALRHA LAEAGHRASS TVADATLGAF RAPDAVGSGA RIYQMDRPSA AALGRSTAWQ APETAAQGFG FAEAPSARVE PAETVEAIAR PLGAARAQLH ENYIVAQTET GMVLVDQHAA HERLVYERLK ALMAENGVPS QALLIPEIVE MSEADARTLL DRSEELAALG LRIEPFGPGA VAVRETPALL GPVKAEALLR DILDELSDLG QTDALQARIE AILSRMACHG SVRSGRRMSG EEMNALLRQM EATPHSGQCN HGRPTYVELK LADIERLFGR T
|
| |