Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0668 |
Symbol | |
ID | 4710062 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 750168 |
End bp | 752030 |
Gene Length | 1863 bp |
Protein Length | 620 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639855130 |
Product | DNA mismatch repair protein MutL |
Protein accession | YP_001002252 |
Protein GI | 121997465 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.422126 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACGC GCGGTATTCG TCCCCTGCCG GATAACCTGA TCGACCAGAT CGCCGCTGGC GAGGTGGTCG AGCGCCCCGG TTCGGTGGTC AAGGAACTGG TGGAGAATAG CCTCGACGCT GGCGCCGGGC GTGTCGAGGT GCAGATTGAG CGCGGCGGCA AGCAGCGTAT CCGCATCGCC GATGACGGCG ACGGCATCCC GCCCGAGGAG CTGGAGTTGG CGCTGCGCCG GCACGCCACC AGCAAGCTCA CCGGCCTCGA GGAGCTGGAG CGCATCGCCA GTCTGGGGTT CCGGGGGGAG GCCCTGCCGA GCATCGCTGC CGTTTCGCGG CTGACGCTGG CCTCGCGCAC CGCCGAGGCG GAGCTGGGCC ATCAGCTGCG CTGTGACGGC GGCGCACTGG GTGCGCCGGA ACCGGTGGCT CACCCCCCGG GGACCACGGT CACCGTCGAC GACCTGTTCT ACAACACGCC AGGGCGTCGC AAGTTCCTGC GCACCGAGCG CACCGAGCTC TACCACGTCC AGGAGGCGTT GCGCCGGCTG GCGCTGAGCC GCTTCGACGT CGGTTTCTCG CTCGTCCATC AGGGGCGGCG GCTCTGGTCG GTGCCCCGGG CGGAGAGCGA GACGGAGCGG CACGAGCGTC TGGCGGAGCT GCTCGGTCGT GCCTTTGCCG ATCACGCCCT GGCGGTGGAG TTGGAGGGGG CCGGGCTGCA GCTGCGGGGC TGGCTGGGGC TGCCTACGGC GGCGCGGCGC CAGGGGGATC TTCAGTACTT GTTCGTCAAC GGCCGGCTGG TGCGCGACCG GGGGGCGGCC CACGGCATCC GCCAGGCTTA CAGCGACTGC CTCTACCGCG ATCACTACCC GGCCTACGTC CTCTTTCTGG AGATGGATCC GGCTCGGGTG GATGTCAACG TCCACCCCAT GAAGCATGAG GTGCGTTTTC GCGACGGGCG GACGGTGCAC GACTTCCTCG CCCGGCGCAT CGCCGACGCC CTGGCCACCG CCGAGCCGGC CGGTGCCGCG GCACCGCCTG CGGCGGAGCG GCCGCCGTCG GGCGCGCCGA CCGGCCCCGG GCAACCGGCG GCGGCCGAGC GGACGGGGCC GGCCTCCGAA GGGCTCGGCG GCACGGCGGA GCTGGGGCTG CCGCTGGCCG AGGCGCGGCA GCTCTATGGG GGCGCCGACG CCGCTGCCGA GGGCCCTGGC GGCGCGGCTG AAGGGGCGTC GTCGGCCATT GCCGGATCCC CCGGGCCGTC GCAGACCCGG GACCGGGAGA GTGAGGCGAC CCCGGAGCTC GGCTACGCCG TCGGGCAGAT CCGCGACGCC TATATTCTGG CCGAGTCGCA GCGGGGGCTG GTGGTGGTGG ACATGCACGC CGCCCACGAG CGGGTGGTCT ACGAGCGGAT GAAGGCGCAG CTGGTGGCGT CGGGGATCGC CACGCAGTCG TTGCTGGTGC CGGTGAGTGT GCCGGTGACC CCGGCGGAGG CCGAGCGGGT GGAGCTGCAC GCGGCTACGC TGGCCCGGGC GGGTCTGGAG GTGGACCGCG CCGGCCCCGA GTCGGTGCGT GTTCACCGAG TACCGGCGCT GCTCGCCGAG GCCGACGCCG CGGCCCTGGT CCGCGACGCG GTGGCGGCGC TGGAGTCCGA GGGGACCGGT GGGCGGGTCG AGGACCGGGT CCACGCGCTG CTGGCGCAGA TGGCCTGCCA CGGGTCGGTC CGCGCCGGAC GGCGTCTGGA GCGCGCCGAG ATGGACGCCC TGCTGCGGGA TATCGAGCGC ACCCCGCGGG CGGCGCAGTG CAACCACGGG CGGCCGACCT ACACCGTGCT CGACGACGAG GCGCTGGCCC GGCTGTTCAT GCGGGGGCGG TGA
|
Protein sequence | MTTRGIRPLP DNLIDQIAAG EVVERPGSVV KELVENSLDA GAGRVEVQIE RGGKQRIRIA DDGDGIPPEE LELALRRHAT SKLTGLEELE RIASLGFRGE ALPSIAAVSR LTLASRTAEA ELGHQLRCDG GALGAPEPVA HPPGTTVTVD DLFYNTPGRR KFLRTERTEL YHVQEALRRL ALSRFDVGFS LVHQGRRLWS VPRAESETER HERLAELLGR AFADHALAVE LEGAGLQLRG WLGLPTAARR QGDLQYLFVN GRLVRDRGAA HGIRQAYSDC LYRDHYPAYV LFLEMDPARV DVNVHPMKHE VRFRDGRTVH DFLARRIADA LATAEPAGAA APPAAERPPS GAPTGPGQPA AAERTGPASE GLGGTAELGL PLAEARQLYG GADAAAEGPG GAAEGASSAI AGSPGPSQTR DRESEATPEL GYAVGQIRDA YILAESQRGL VVVDMHAAHE RVVYERMKAQ LVASGIATQS LLVPVSVPVT PAEAERVELH AATLARAGLE VDRAGPESVR VHRVPALLAE ADAAALVRDA VAALESEGTG GRVEDRVHAL LAQMACHGSV RAGRRLERAE MDALLRDIER TPRAAQCNHG RPTYTVLDDE ALARLFMRGR
|
| |