Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1501 |
Symbol | |
ID | 7400329 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 1508221 |
End bp | 1510077 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643708563 |
Product | DNA mismatch repair protein MutS domain protein |
Protein accession | YP_002566159 |
Protein GI | 222479922 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACTGG AGGACTACTG GGGGATCGGC CCGAAGACGA GCGAGCGGCT CACGGAGTCG CTCGGGACCG AGCGGGCGAT CGAGGCGATC GAGGCGGCCG ACGTCCGGGC GCTCGTCGAC GCCGGGCTCC ACCGCGGGCG AGCCACCCGG ATCCTCCGCC GCGCGAACGG CGAGGCCGGC ATGGGTGTCC TCGCGACTGG CGACGCACGC TCGGTGTACG ACGACCTCCT CACGGTAGCG GCCGGCCACG CGCTGACGGA CCACGCCGCC GACCGAATCC GGGTGTTGAC GCCCCTCACC GAGCGGAGCG CGATTGAGTC GCGGCTCGAT GAGGTGGTCG CCGCTCGCGA CGCGTGGGCA GCACTCGACG ACGGCGAGCG CGACCGCGTC GTCGCCGCGT TCGACGACTA CGACGCGGCC GAGGGCTCCG ATCTGGCGGC CGTCGAGACC GCGGTCGCGC TGCGCGATGT GGGTCTCACG GAGACGCCCT TCGAGGATAT CGGGGCGCTG GATGGCGACA GCCTGCGCGA CGCCGCCGAC GCCCTCGCCG ACGTGCGGGG TGCGATCGAC CCGACGGGCG TCGACGGCGA CGGCGAGATC GAGGTCGCGC GCGGTGCGGA CGACGAGCTT GACCGCCTGC GTGAGCAGTT CGACGCGGCG GAGGAGCTGG CGAACTCCGC GTTCGACGTG CTTGATACGG TTCGGGACGG CTCCCTGCGC GACTTCGAGG CGCTGGAGGC AGCGACGATC GACCACGTCG CTCGCGAGAC CGGCGTCGAT CCGGCGACGG TGCGCTCGGT CGCGCCGGAC GACGCGATCG ACGCCGCCGA CTTCGTCTCC GCCACGCTCC GCGATCTGGT GACGGAACTG GAGGCGGCGG TCGCGGAGCG CGAGGAGACC GTCGCGGCCG ACATCCGCGA GCGGATCGGC GGGATGCGAG TTGGGGATGA GGGAGACAAA GGTGAGAAAG ATGACGAAGC TGACGAAGCG GCGACCGGAA CCGTCGCCGG CGCGGTCGCG GCCGTCTCCG ACGCCGCGTT CCTGCTGTCG CTCGCGCGGT TCGCGGTCGC GTACGATCTG ACTCGACCGA CCCTCGTCGA CGACGGCGTC GCGGTGCGAA ACGCTCGCAA CCTCTTCATC GACGGCGAGG TCCAGCCGGT GTCGTACGCG ATCGGCTCAC ACTCGCTTGC GGGCGAACCC GGCGTCGCGA GCGTCGACGC GCCGCCGACC GGCGACCGCG TGAGCGTCCT CACGGGGGCG AACTCGGGCG GGAAAACCAC CCTGTTGGAG ACGCTGTGTG CGGTGGCACT GCTGGCGTCG ATGGGGCTTC CGGTGCCGGC CGAGGAGGCG GAGGTCGGTG CGTTCGATCG GATCGTGTTC CACCGACGGC ACGCCTCCTT CAACGCCGGT GTGTTGGAGT CGACGCTGAA GTCGGTCGTC CCGCCGCTGG TCGAGGACGG GCGGACGCTG ATGCTCGTCG ACGAGTTCGA GGCGATCACG GAGCCGGGCC GAGCCGCCAA CCTGCTGAAC GGGCTCGTGA CGCTCACCGT GGACCGCGGC GCCCTCGGCG TGTACGTCAC GCACCTCGCG GAGGACTTGA GCCCGCTGCC CGAGGCCGCC CGGATCGACG GTATCTTCGC CGAGGGACTC ACGAACGACT TGGACCTCCG CGTCGACTAC CAGCCGCGGT TCGGTACCGT CGGGAAGTCG ACGCCGGAGT TCATCGTCTC GCGGCTCGTG GCGAACGCGA AAGACCGCGG CGTCCGCGCC GGGTTCGAGC ACCTCGCCGG CGCGGTCGGC GAAGAGGCGG TCCAGCGCAC CCTCTCGGAC GTGGAGTGGT CGGAAGGCGA TGACTGA
|
Protein sequence | MRLEDYWGIG PKTSERLTES LGTERAIEAI EAADVRALVD AGLHRGRATR ILRRANGEAG MGVLATGDAR SVYDDLLTVA AGHALTDHAA DRIRVLTPLT ERSAIESRLD EVVAARDAWA ALDDGERDRV VAAFDDYDAA EGSDLAAVET AVALRDVGLT ETPFEDIGAL DGDSLRDAAD ALADVRGAID PTGVDGDGEI EVARGADDEL DRLREQFDAA EELANSAFDV LDTVRDGSLR DFEALEAATI DHVARETGVD PATVRSVAPD DAIDAADFVS ATLRDLVTEL EAAVAEREET VAADIRERIG GMRVGDEGDK GEKDDEADEA ATGTVAGAVA AVSDAAFLLS LARFAVAYDL TRPTLVDDGV AVRNARNLFI DGEVQPVSYA IGSHSLAGEP GVASVDAPPT GDRVSVLTGA NSGGKTTLLE TLCAVALLAS MGLPVPAEEA EVGAFDRIVF HRRHASFNAG VLESTLKSVV PPLVEDGRTL MLVDEFEAIT EPGRAANLLN GLVTLTVDRG ALGVYVTHLA EDLSPLPEAA RIDGIFAEGL TNDLDLRVDY QPRFGTVGKS TPEFIVSRLV ANAKDRGVRA GFEHLAGAVG EEAVQRTLSD VEWSEGDD
|
| |