Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_2159 |
Symbol | |
ID | 7318263 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 2289560 |
End bp | 2291128 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643617054 |
Product | DNA mismatch repair protein MutS domain protein |
Protein accession | YP_002514226 |
Protein GI | 220935327 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGCCG ATCTGAAACC CCTGGAGTTC GACTCCATCC GGCGATTGCT GGAACGCCTG ACCCACACCC CCTACGGGGC CGATGCCGCG CGGGCCCTGG AGCCGGCGCC GACCCTGGCG GTGGCCCGTG ACATGCAGCG GGCGGTCACG GCGGCCCGCA CCCGCATCGA TGCGGGGCGC ATGCCGCTCA TGGGACAGCT GCCGGACATC CGTGCGGCCC TGCGTCAGGC GTCTTCTCCG GGCGCGCGGC TCTCCACCCA GGCCATGCAC AACATCCAGG TGGTGATGCG TGCCGCCGGC CAGCTGGCCG GGGCGCTGGA CGAAACGCCG GATATCTATC CTGGCAGCCT GGATGAACTG AAGCCGCCCC AGGCGCTGGT GGAACTGCTG GACAAGAGCC TCGTGGGTGC AGGTTCCCTG CGCGAGGACG CCAGTACCGA GCTGGAAGAG GCCTTTGCTG AGCGGTCGAA GCTGCGTGCC GAGGTGGAGC AGGTGGTGCT CGGGCGCATG GGCCGTGACG ACATCCGCGA CTGCCTCGAT GACCACCGCA AGGTGCAGTG GAACAGCGAG CGCGCGGTGA TCGTGATCCG CGGCACCGAG GCGGACAAGG TCAAGGGGGT GCGCCGGGGC TCGGCCATGG GCGGTCGCGA CCAGATCGTG GAACCCATGG AGGCGGTGCC CCTGAACAAC CGCCTGGACA CCCTGAACGG GCGCATCAAC GCCGAGCAGC AGCGGGTGCT GCGGGAGCTG ACGGCCGGTA TCGCCGAGCA CATGGATGCC CTGAACGGCA TGCTGGATGC ACTCACCTGG GTGGACCTGG CCTTCGCCGC CGGGCAGCTT TCCCACCACA TGAATGCCCA CGCCCCCACC CTGGTGGAAG GGCCGCGGGT GCGCCTCGCG GAGGCCTACC ACCCCCTGCT GCTGATCCAG TTTGCCGACG GCAGTGGACC CCGGCCCGTG CCCCTGTCCA TTCACCTGGA CGGCAACGAC GTGCTGCTGC TGGTCACCGG TCCCAACACC GGCGGCAAGA CCGTGGCGCT CAAGACCCTG GGCCTGATCA CGGTGATGGC CTGGTGCGGC CTGCACGTGC CCGCGGAACA GGACTGCGAG ATCGGCAGCT ATGCCCGGGT GATCGTGGAC GTGGGCGACC ACCAGAGCCT GTTTCATCAC CTGTCCACCT TCGCCGGCCA CGTGGAGGTG CTCAAGCGCA TCCTGGACGA GGCGGACGGC GAGACCCTGG TGCTGCTGGA CGAGTTGGGC ACCGGCACGG ACCCGGAGGA GGGCGCGGCC CTGGCCATGG CGGTGCTGGA CGAGCTGCTG TCCCGCAAGG TCCAGGGCAT CGTCAATACC CACCTCTCGC CGCTCAAGGA CTATGCCGCC AGGCACCCGG GCATCCGCAA CGCCTCCATG CAGTTCGACC ACCAGCGCCT GGCGCCCACC TACCGGCTGA TCATCGGCGA GCCGGGGGTG TCCCTGGGGC TCACCATCGC CCAGAAGAAT GGCCTGCCCG AGGCCCTGGT GGAGCGGGCG AGGGGGCATC TGGCGGCCAT CATCGGGGAG GGGCGTTAG
|
Protein sequence | MQADLKPLEF DSIRRLLERL THTPYGADAA RALEPAPTLA VARDMQRAVT AARTRIDAGR MPLMGQLPDI RAALRQASSP GARLSTQAMH NIQVVMRAAG QLAGALDETP DIYPGSLDEL KPPQALVELL DKSLVGAGSL REDASTELEE AFAERSKLRA EVEQVVLGRM GRDDIRDCLD DHRKVQWNSE RAVIVIRGTE ADKVKGVRRG SAMGGRDQIV EPMEAVPLNN RLDTLNGRIN AEQQRVLREL TAGIAEHMDA LNGMLDALTW VDLAFAAGQL SHHMNAHAPT LVEGPRVRLA EAYHPLLLIQ FADGSGPRPV PLSIHLDGND VLLLVTGPNT GGKTVALKTL GLITVMAWCG LHVPAEQDCE IGSYARVIVD VGDHQSLFHH LSTFAGHVEV LKRILDEADG ETLVLLDELG TGTDPEEGAA LAMAVLDELL SRKVQGIVNT HLSPLKDYAA RHPGIRNASM QFDHQRLAPT YRLIIGEPGV SLGLTIAQKN GLPEALVERA RGHLAAIIGE GR
|
| |