Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2604 |
Symbol | |
ID | 7399830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 2581499 |
End bp | 2583496 |
Gene Length | 1998 bp |
Protein Length | 665 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643709676 |
Product | DNA mismatch repair protein MutS domain protein |
Protein accession | YP_002567245 |
Protein GI | 222481008 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0194376 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.689485 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTTGG AGGCGATCCC CGGCGTCGGC GCGAAGACGG CCGCGGCGTT GCACGAGCTC GACGACCCGG TCGCGACCGT CGAGTCCGGC GACGTCGCCG CGATCGCCCG CGCGCCCGGT GTCAACGAGG CGCGCGCGGC CCGCATCGCT CGCGGAGCGA TCCGTCGCCG ACACGACGAC GACGGGCGCG TGCTGGCGAC CGACCGCGCC CGCGAGGTGT ACCGCTCGGC GATCGACCTG CTGCGCGAGC GCACCGTCAC CGACTACGCC GCCAAGCGGC TGGAGACGTT CTACCCGAGC GAATCGACCT CGCGGATCGC GGAGGCGCAG GCGTTCGTCG AGGGGGCGAT GGAGCGCGAG CCCGATCCCG CCGTCCACGA GGCGCTCGCC GGGGTCAAAC CCCTGATCGA CCCGCCGACG GTTCGAGTGC GCGACCGCTG TCTCGCGACG GCCGACGCCG AGGCGCTCGC CCGCGCCGAG TCGGCGGTGC CGGAGCTGTC GGTCGAGACC GTCGAGAACG CCCGTGACAT CTCGGAACTC GCGCGGTCGT ACGCGACCGT GATCGTCTTA GACGAATCGT TCGCCGGACT CGACGTCGAG GGTGACGTAC ACGTCCGCCC GGATGCGTTG GACAAGCCCG CGGAGACCGT CCCCGAGCGT CTGCTCGCCT TCTTCGCGGC GAACCGCGAG CGACTGGAAG CGGCCGCGGC GGTCCACGAG ACGGCGAACC TCTCCCCCGC GGCCGACCTC GACCGCCTCC GCGACGCCCT CGCGCGGCTC GACGACGACG GGACGGTCGT CGGCGACGGG GAGCTCGAAC GACTGACCGC CGCCGTCGAC GACCTCGATG CCGCGGTGTC GACGGCGGAG TCAGTGGCCG ACGACCGGCT CAGAGAGGCG ATCCGCGAGC GCGACGTGAC CATCGAGGGG ACCGACTTCC TCTCGCTGGT CGAGCAGGGC GCCCGGGTCG ACTCGCTCTT AGACCGCGAG CTGGCCGACG AGTACGACGC GGCGATGGCC CGCGCTCGCG AGCACCTCGC GGACGCGCTC CGCTTGGAGC CCGAGGAGGC GGAACTCGCC GACCGGGTGT TCGGCGACGA CCCCTCTTTC CCGGTCGAGC ACGATGAGAG CGCGGTCTCG CGGCTCCGCA CCGAGCTTGC GGCGGCGCGC GACCGCCGGG CTGCCCGCCT GAAGGCGGAA CTCGCGAGCG ACCTCGGCGA CCTGCGGGAG CCCGTGGAGG AACTCGTCCG GGACGCCCTC GAACTCGACG TGGAACTGGC GATCGCGCGA TTCGCGCGCG ACTTCGACTG CGTCATGCCC GAGGTGGTTG ATTCGGACGG GGACGGCTCT CCCGGACCCT CCGGCTTTCG GATTGTGGGC GGCCGCTCCT CACTTCTCGA CGTTGATTTC GAGAACGTGG AGCCGATCGA CTACGCGGTG TCGGGCGCGA CGCTCCTCTC GGGAGTCAAC TCCGGCGGGA AGACCTCGAC CCTCGATCTC GTGGCGCTCG TCGTCGTCCT CGCGCAGATG GGGATGCCCG TCCCCGCCGA GTCCGCCACC GTCGAGCGCT TCGAGGAGAT CCACTACTAC GCCAAATCGC AGGGAACCCT CGACGCGGGC GCGTTCGAGG CGACCCTGCG GGACTTCGGC GACCTCGTCG AGGGCGCGGA CGGGCGGCTC GTCTTGGTCG ACGAGCTTGA GTCGATCACG GAGCCGGGCG CCTCCGCGAA GATCATCGCG GGCATCCTCG AAGCGCTCGA CGAGCAGGAC GCCACCGCCG TCTTCGTCTC CCACCTGGCC CGCGAGATCC GGGACGCGGC CGACTTCGAA GTCGCCGTCG ACGGAATCGA GGCCGCCGGG CTCGTCGACG GCGAGCTACG GGTGAATCGC TCACCGCGGA AGGGTCACCT CGCGCGGTCG ACCCCGGAGC TCATCGTCGA GAAGCTCGCG GGCGACCGCG ACACCGACTT CTACGGGGAC TTACTGGAGA AGTTCTGA
|
Protein sequence | MELEAIPGVG AKTAAALHEL DDPVATVESG DVAAIARAPG VNEARAARIA RGAIRRRHDD DGRVLATDRA REVYRSAIDL LRERTVTDYA AKRLETFYPS ESTSRIAEAQ AFVEGAMERE PDPAVHEALA GVKPLIDPPT VRVRDRCLAT ADAEALARAE SAVPELSVET VENARDISEL ARSYATVIVL DESFAGLDVE GDVHVRPDAL DKPAETVPER LLAFFAANRE RLEAAAAVHE TANLSPAADL DRLRDALARL DDDGTVVGDG ELERLTAAVD DLDAAVSTAE SVADDRLREA IRERDVTIEG TDFLSLVEQG ARVDSLLDRE LADEYDAAMA RAREHLADAL RLEPEEAELA DRVFGDDPSF PVEHDESAVS RLRTELAAAR DRRAARLKAE LASDLGDLRE PVEELVRDAL ELDVELAIAR FARDFDCVMP EVVDSDGDGS PGPSGFRIVG GRSSLLDVDF ENVEPIDYAV SGATLLSGVN SGGKTSTLDL VALVVVLAQM GMPVPAESAT VERFEEIHYY AKSQGTLDAG AFEATLRDFG DLVEGADGRL VLVDELESIT EPGASAKIIA GILEALDEQD ATAVFVSHLA REIRDAADFE VAVDGIEAAG LVDGELRVNR SPRKGHLARS TPELIVEKLA GDRDTDFYGD LLEKF
|
| |