Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3218 |
Symbol | |
ID | 8743838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 3313863 |
End bp | 3316079 |
Gene Length | 2217 bp |
Protein Length | 738 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 646513802 |
Product | DNA mismatch repair protein MutL |
Protein accession | YP_003404756 |
Protein GI | 284166477 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGACG AGCCACCACA GGAGACCGAG ATCCGCCAGC TAGATGAGGA CACCGTCGCC CGCATCGCCG CCGGCGAGGT CGTCGAACGC CCCGCCAGCG CGGTGAAGGA ACTCGTCGAG AACAGCCTCG ACGCGAACGC CGACAGCGTC GACGTCACCG TCGAAGCGGG CGGCACCGAA CTGATCCGGG TCGCCGACGA CGGCCGCGGC ATGGGCGAGG CCGACGTCCG CGCGGCCGTC CGCGAGCACA CGACGAGCAA GATCGAGGGG CTAGAGGACC TCGAATCGGG CGTCGCCACG CTGGGCTTCC GCGGCGAGGC CTTACACACC ATCGGCTCCG TCTCGCGAGT GACGATCGAA TCGCGCCCAC GCGACGGGGA CGAGTCCGGC GGTGCGGGAA CGAAGCTGAC CTACGAGGGC GGCGAGGTGA CGGGCGTCGA GCCGACGGGC TGTCCCGAGG GGACGATCGT CGAGATCGAG GATCTGTTCT ACAACACGCC GGCCCGCCGG AAGTTCCTCA AGACGACGGC GACGGAGTTC GCCCACGTCA ACCGCGTCGT CACGCGCTAC GCGCTGGCGA ACCCCGACGT CGCGGTCTCG CTGACTCACG ACGGTCGCGA GGTGTTCGCG ACGACGGGAC AGGGAGACCT CCAAGCGGCC GTCATGGCGG TCTACGGCCG CGAGGTCGCC TCGGCGATGA TCGCCGTCGA GGCCGACGGC GACGACCTCC CGCCGGGGCC GCTCGAGTCG GTCTCCGGGC TGGTCTCTCA CCCCGAAACC AACCGCTCGA GTCCGGAGTA CCTCGCGACC TACGTCAACG GTCGGTCGGT CACCGCCGAC GCCGTCCGCG AGGGGATCAT GGGCGCCTAC GGCGCCCAGC TGGGCGGCGA TCGCTACCCG TTCGTGACGC TCTTTCTCGA GGTGCCCGGC GAGGCCGTCG ACGTCAACGT CCACCCCCGG AAGCGGGAGG TGCGGTTCGA CGACGATGAT TCCGTCCGCC GGCAGGTCGA CGCCGCCGTC GAGTCGGCGC TGCTGGAGCA CGGCCTGCTG CGCTCGCGGG CGCCCCGCGG CCGGTCGGCA CCCGGCGAGG CCAGCGTCGC TCCCGGCGAC GCCCCTCGCG GGACGAGCGA CGGGGAGCCG ACGACCGAGG ACCTCCCGGC CGCGCTCGAG GGCGACGGTG AGTCGGAATC GGGATCCGAG TCGCCATCCG AACCGACGGC CGACGATACC GCAGCGGCCT CGTCGCGGTC GCCGTCGACA GTCGACGCCG AATCCGCCGA CAGCGGTTCG ACGACGGGTT CGGCCGGATC ACCACCGGCC GGCGGCGCCT CGAGCGGCCC GTCGGGAACC GCTTCGAGCG GTCCGTCCGG CAGCGACTCG AGTGGTACGT CTGAAACGCC GTCGGGAACC GCCTCGAGCG AGTCGACGCG ACGCGCCTCG AGCGACGCTC CGACGACCGA GTCGACGGCG TCTCGAGGAG ACTCGAGCGT CGATCACAGA GACGACGATT CCGATCGGAG CGACCGCCAG CCGGAGCGCA AGTTCGCCGC GGCCACCGAA CAGCGGACGC TCTCCGGCGA GCCCGCGACC GGAGACGAGA CCGACTTCGA CTCGCTGCCC GCCCTGCGCG TGCTGGGACA GCTTGACGAC ACCTACCTCG TCTGCGAGAC CGACGACGGC CTCGTCCTGA TCGACCAGCA CGCCGCCGAC GAGCGGGTCA ACTACGAGCG CCTGCAGCAG GCGTTCGCCG ATGATCCCGC CGCGCAGGCG CTGGCGGAGC CCGTGGAACT CGAGTTGACC GCCGCTGAGG CAGAGGCTTT CGAGGGCTAC CGCGAGGCGC TCTCGCGGCT GGGCTTCTAC GCCGACCGGA CCGACGACCG GACGGTGGCC GTGACGACGG TACCTGCGGT ACTCGAGGAG ACCATCGCGC CCGAGCGGCT GCGGGACGTC CTCGCGTCGT TCGTCGAGGG CGACCGCGAG GCGGGCGCGG AGACGGTCGA CGCGCTGGCC GACGAGTTCC TCGGGGATCT GGCCTGCTAC CCGTCGATCA CGGGGAACAC GTCGCTGACC GAGGGGTCAG TGGTCGACCT CCTCGAGGCC TTAGACGACT GTGAGAATCC GTACGCCTGT CCGCACGGGC GACCGGTGAT CGTTCGGTTC GACGAGCGCG AGATAGAGGA TCGATTCGAG CGGGATTACC CGGGCCACCA GGGCTGA
|
Protein sequence | MTDEPPQETE IRQLDEDTVA RIAAGEVVER PASAVKELVE NSLDANADSV DVTVEAGGTE LIRVADDGRG MGEADVRAAV REHTTSKIEG LEDLESGVAT LGFRGEALHT IGSVSRVTIE SRPRDGDESG GAGTKLTYEG GEVTGVEPTG CPEGTIVEIE DLFYNTPARR KFLKTTATEF AHVNRVVTRY ALANPDVAVS LTHDGREVFA TTGQGDLQAA VMAVYGREVA SAMIAVEADG DDLPPGPLES VSGLVSHPET NRSSPEYLAT YVNGRSVTAD AVREGIMGAY GAQLGGDRYP FVTLFLEVPG EAVDVNVHPR KREVRFDDDD SVRRQVDAAV ESALLEHGLL RSRAPRGRSA PGEASVAPGD APRGTSDGEP TTEDLPAALE GDGESESGSE SPSEPTADDT AAASSRSPST VDAESADSGS TTGSAGSPPA GGASSGPSGT ASSGPSGSDS SGTSETPSGT ASSESTRRAS SDAPTTESTA SRGDSSVDHR DDDSDRSDRQ PERKFAAATE QRTLSGEPAT GDETDFDSLP ALRVLGQLDD TYLVCETDDG LVLIDQHAAD ERVNYERLQQ AFADDPAAQA LAEPVELELT AAEAEAFEGY REALSRLGFY ADRTDDRTVA VTTVPAVLEE TIAPERLRDV LASFVEGDRE AGAETVDALA DEFLGDLACY PSITGNTSLT EGSVVDLLEA LDDCENPYAC PHGRPVIVRF DEREIEDRFE RDYPGHQG
|
| |