Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_0784 |
Symbol | |
ID | 8806537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 836939 |
End bp | 838528 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | DNA mismatch repair protein MutS domain protein |
Protein accession | YP_003460035 |
Protein GI | 289207969 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGTTG ATCTGAAGGC CCTCGAATTC CCCGCGATCC AGCGTCTGCT GGAGCGCCTG ACCGCGACGC CCTACGGCGC GGATGCCGCG CGCGGGCTGG AACCGGCACC GAATCTGGAT GCGGCCCGCG CACTGCAGAC CGCGGTGACG GTCGCGCGCC GACGTCTGGA TGCCGGCACC CTGCCGCGGC TGGGGCAGCT GCCGGATGTG CGCGCGGCCC TGCGTCAGGC CTCCAATCCG GGCTCGGCGC TGCCGGTGCA GGCCCTGCAC AACCTGCAGA CCATCATGCG CCAAGCCCGC GAGCTGGCGG ATCAGCTGGC GGATACGCCC GAGATCTATC CGGCGGACCT GAACAAGCTG TACCCACCCG AGGGGCTGGA AGAACGACTG TCGGCATGCC TGAACCCCGG CGGGTCGCTG CGCGAGGACG CGAGCCCCTC GCTGATCGAG GCCTTCGAGC AGCGCGGACG CCTGCGCGAG GAGGTCGAGG CGGTGGTGAA AAAGCGCCTG GCGGCCTCCG ATGTCGCGCA AAAGGGCGAG GATGCCTTGA AGGTGCAGTG GCACCAGGAG CGCGCGGTGA TGGTGCTGCG TGGCGAGGCG GCGAATGCGG TCAAGGGCGT GCGTCGTGGC ACCGCGATGG GCGGGCGTGA CCAGATCGTC GAGCCGATCG AGGCGGTGCC GCTGAACAAC CAGCTGGATA CGGTCAACGG CCAGATCAAC ACCGAACAGC AGCGCCTGCT GCGCGAGCTG ACCGACGTGG TGCGCCAGTA TGGCGAGCCG CTGGAACTGA TGCTGACCGC GTTGACCTGG ATCGACCTGG CGTCCGCCGC CGCGCAGCTC TCGGCACAGA TGAATGCCCA TGCCCCGCGG CTGGAGGCGG AGGCCGGGGT GGAACTGATC GAGGCCTATC ACCCGCTCCT GCTGTTGCAG TTCGCCGAGG GCAACGGGCC GCAGCCGGTG CCGCTGTCCA TCCGGCTGGA TGGCGAGCAG CCGCTGCTGC TGATTACCGG ACCGAACACC GGCGGCAAGA CGGTCGCGCT GAAGACGCTC GGGCTGCTGG TCACCATGGC CTGGTGCGGG CTGCATATCC CGGCCGAGCA GGACTGTCGC ATCGGGCGTT TCGATCGGGT GATGGTCGAT GTCGGCGACC ACCAGAGCCT GTTCCACCAC CTCTCGACCT TCGCCGGGCA TGTGGAGGTC CTGAAACGTA TCCTCGACCA CGCCGGGCCG GAGAGCCTGA TCCTGCTGGA CGAGTTGGGT ACGGGCACCG ACCCGGACGA GGGTGCGGCG CTGGCGATGG CGATGCTGGA CGAGCTGCGC GCGCGTGGTA CGCGCGGGAT CGTGAATACC CATCTGGCGC CCCTGAAGGA CTACGCCGCC CAGCACGCGG GCATCGTGAA CGCCTCGATG CAGTTCGACG CCGAGACGCT GTCCCCGACC TACCGGCTGC TGATCGGCGA GCCGGGTGTG TCGTTCGGCC TTACGATTGC CGAGAAGAAC GGGCTGCCGC CCCAGCTGGT TGCCCGCGCG CGCGAGCATT TCGCCGAACT CCCCACCGCC CAGGCCGGGG GCGATGCCGG CAAGGCCTGA
|
Protein sequence | MQVDLKALEF PAIQRLLERL TATPYGADAA RGLEPAPNLD AARALQTAVT VARRRLDAGT LPRLGQLPDV RAALRQASNP GSALPVQALH NLQTIMRQAR ELADQLADTP EIYPADLNKL YPPEGLEERL SACLNPGGSL REDASPSLIE AFEQRGRLRE EVEAVVKKRL AASDVAQKGE DALKVQWHQE RAVMVLRGEA ANAVKGVRRG TAMGGRDQIV EPIEAVPLNN QLDTVNGQIN TEQQRLLREL TDVVRQYGEP LELMLTALTW IDLASAAAQL SAQMNAHAPR LEAEAGVELI EAYHPLLLLQ FAEGNGPQPV PLSIRLDGEQ PLLLITGPNT GGKTVALKTL GLLVTMAWCG LHIPAEQDCR IGRFDRVMVD VGDHQSLFHH LSTFAGHVEV LKRILDHAGP ESLILLDELG TGTDPDEGAA LAMAMLDELR ARGTRGIVNT HLAPLKDYAA QHAGIVNASM QFDAETLSPT YRLLIGEPGV SFGLTIAEKN GLPPQLVARA REHFAELPTA QAGGDAGKA
|
| |