Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_2660 |
Symbol | |
ID | 8743274 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 2732035 |
End bp | 2733822 |
Gene Length | 1788 bp |
Protein Length | 595 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 646513249 |
Product | DNA mismatch repair protein MutS domain protein |
Protein accession | YP_003404209 |
Protein GI | 284165930 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACTCG AGGAGTACTG GGGCGTCGGG CCGAAGACGC GGGCGACGCT GGTCGAGGAA CTGGGACGGG ACCGCGCGAT CGAGGCGATC GAGAGCGGCG ACGTGCGGGC GCTCGCGGAC GCCGGCCTCG CTCGCGGCCG GGCGACGCGA ATCCTGCGCC GAGCGACGGG CGGCGACGGG ATGGACGTGC TGGCGACGAG TGACGCCCGA TCGGCGTACA AGGAACTGCT CGATCTGGCG GTCGAACACG CCGTCACGCA GCGCGCGGCC GACCGTATTC GCGTGCTGAC GCCGCTGACC GACCGCGAGG CAATGGAGGA CCGGTTGGAC GACGTCCTCG CGGCCCGGGA CGCCTGGGCC GCACTCGAGA CCGACGACCG CGAGGCCGTT CTCGCGGCCT ACGACCGCTA CGACGAGCGC GCGGGGAGCG AGCGCGCCGC CGTCGAGGCC GCGCTCGCCC TGCTCGAGGC CGGCGTCGAC TCGGGACCGT TCGCCGCCAT CGCGGATCTC GAGCGGGACC GACTCGCGGA GGCCGCCGAG GCGCTCGCGG CGCTGGACGG CGACCGCGGG CGAGTGCGGG AGGGCGCCGA CGAGGACCTC GACCGCTTGC GCGAGGCGCT CGGCGCGGTC GAGGACATGG ACGCCAACGC CCTCGAGTTG ATCGAGGAGT TGCGGTCGGA CGGGGTTCGC GACGTCGGCC AGTTCCGCGA GGCCTTCGAG GACCACCTGC TGACCGAGAC GGCGGTGACC GCCGACCGGG TTCGCGACGC GATGCCGACG GACGCGACCG ACGCGACGGA CTTCGTCGGT GCCACGCTGC GAACCCTCCG GAGCGACCTC ACCGACGCGA TCGACGAGCA CGAGGAAACC GTCGCGAGCG ACCTCGAGGC GACCCTCGAG GAGACGAGCG ATGCCGTCGA GCGGGCCGTG TCCGCCGTCG ACGACATCGC CTTGCACCTC TCGCTGGCGC GGTTCGCCCT CGAGTACGAC TGTACCCGGC CGACGTTCAT CGAGGCCCCC GAGGCCGCCG TTTCCGTCGT CAACGCCCGA AACCTCACCC TCGCCGCCGT GGACGACGAG TCCGTTCAGC CGGTCACCTA CGCGCTCGGC GACCACGGGG TGACTGAGGT ACCAACAGAT GTGAACGCGG TTCCCGGCGA GGAGCGCGTC TCCGTGCTCA CCGGAGCCAA CAGCGGCGGG AAGACGACGC TGCTCGAGAC CCTCTGCCAA GTGGTCCTGC TGGCGATGAT GGGGCTGCCC GTCCCCGCCG ATCGGGCCGA GGTGACGCCC GTCGACGCGC TGGTCTTCCA CCGTCGCCAC GCGAGTTTCA ACGCGGGCGT CCTCGAGTCG ACGCTGCGCT CGATCGTACC GCCGCTGTCG GCCGGCGGGC GCACCCTGAT GCTGGTCGAC GAGTTCGAGG CGATCACCGA ACCCGGCAGC GCGGCCGACC TCCTACACGG GCTGGTGACG CTGTCCGTCG ACCGCGACGC GCTGGGCGTG TTCGTCACCC ACCTCGCGGA CGACTTAGAG CCGTTGCCGC CGGAAGCGCG GGTCGACGGC ATCTTCGCGG AGGGGCTGAA CCCCGACCTC GAGTTGCTGG TCGACTACCA GCCCCGCTTC GATACGGTCG GGCGGTCGAC GCCGGAGTTC ATCGTCTCGC GACTCGTCGC GAACGCCGAC GACCGCGGCG AGCGCGCGGG CTTCGAGACG CTCGCGGAGG CGGTCGGCAA CGACGTCGTC CAGCGAACGC TGGCCGACGC CCGCTGGACG GAGACGAAGA GCGACTAG
|
Protein sequence | MRLEEYWGVG PKTRATLVEE LGRDRAIEAI ESGDVRALAD AGLARGRATR ILRRATGGDG MDVLATSDAR SAYKELLDLA VEHAVTQRAA DRIRVLTPLT DREAMEDRLD DVLAARDAWA ALETDDREAV LAAYDRYDER AGSERAAVEA ALALLEAGVD SGPFAAIADL ERDRLAEAAE ALAALDGDRG RVREGADEDL DRLREALGAV EDMDANALEL IEELRSDGVR DVGQFREAFE DHLLTETAVT ADRVRDAMPT DATDATDFVG ATLRTLRSDL TDAIDEHEET VASDLEATLE ETSDAVERAV SAVDDIALHL SLARFALEYD CTRPTFIEAP EAAVSVVNAR NLTLAAVDDE SVQPVTYALG DHGVTEVPTD VNAVPGEERV SVLTGANSGG KTTLLETLCQ VVLLAMMGLP VPADRAEVTP VDALVFHRRH ASFNAGVLES TLRSIVPPLS AGGRTLMLVD EFEAITEPGS AADLLHGLVT LSVDRDALGV FVTHLADDLE PLPPEARVDG IFAEGLNPDL ELLVDYQPRF DTVGRSTPEF IVSRLVANAD DRGERAGFET LAEAVGNDVV QRTLADARWT ETKSD
|
| |