Gene TK90_0784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTK90_0784 
Symbol 
ID8806537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThioalkalivibrio sp. K90mix 
KingdomBacteria 
Replicon accessionNC_013889 
Strand
Start bp836939 
End bp838528 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content69% 
IMG OID 
ProductDNA mismatch repair protein MutS domain protein 
Protein accessionYP_003460035 
Protein GI289207969 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGTTG ATCTGAAGGC CCTCGAATTC CCCGCGATCC AGCGTCTGCT GGAGCGCCTG 
ACCGCGACGC CCTACGGCGC GGATGCCGCG CGCGGGCTGG AACCGGCACC GAATCTGGAT
GCGGCCCGCG CACTGCAGAC CGCGGTGACG GTCGCGCGCC GACGTCTGGA TGCCGGCACC
CTGCCGCGGC TGGGGCAGCT GCCGGATGTG CGCGCGGCCC TGCGTCAGGC CTCCAATCCG
GGCTCGGCGC TGCCGGTGCA GGCCCTGCAC AACCTGCAGA CCATCATGCG CCAAGCCCGC
GAGCTGGCGG ATCAGCTGGC GGATACGCCC GAGATCTATC CGGCGGACCT GAACAAGCTG
TACCCACCCG AGGGGCTGGA AGAACGACTG TCGGCATGCC TGAACCCCGG CGGGTCGCTG
CGCGAGGACG CGAGCCCCTC GCTGATCGAG GCCTTCGAGC AGCGCGGACG CCTGCGCGAG
GAGGTCGAGG CGGTGGTGAA AAAGCGCCTG GCGGCCTCCG ATGTCGCGCA AAAGGGCGAG
GATGCCTTGA AGGTGCAGTG GCACCAGGAG CGCGCGGTGA TGGTGCTGCG TGGCGAGGCG
GCGAATGCGG TCAAGGGCGT GCGTCGTGGC ACCGCGATGG GCGGGCGTGA CCAGATCGTC
GAGCCGATCG AGGCGGTGCC GCTGAACAAC CAGCTGGATA CGGTCAACGG CCAGATCAAC
ACCGAACAGC AGCGCCTGCT GCGCGAGCTG ACCGACGTGG TGCGCCAGTA TGGCGAGCCG
CTGGAACTGA TGCTGACCGC GTTGACCTGG ATCGACCTGG CGTCCGCCGC CGCGCAGCTC
TCGGCACAGA TGAATGCCCA TGCCCCGCGG CTGGAGGCGG AGGCCGGGGT GGAACTGATC
GAGGCCTATC ACCCGCTCCT GCTGTTGCAG TTCGCCGAGG GCAACGGGCC GCAGCCGGTG
CCGCTGTCCA TCCGGCTGGA TGGCGAGCAG CCGCTGCTGC TGATTACCGG ACCGAACACC
GGCGGCAAGA CGGTCGCGCT GAAGACGCTC GGGCTGCTGG TCACCATGGC CTGGTGCGGG
CTGCATATCC CGGCCGAGCA GGACTGTCGC ATCGGGCGTT TCGATCGGGT GATGGTCGAT
GTCGGCGACC ACCAGAGCCT GTTCCACCAC CTCTCGACCT TCGCCGGGCA TGTGGAGGTC
CTGAAACGTA TCCTCGACCA CGCCGGGCCG GAGAGCCTGA TCCTGCTGGA CGAGTTGGGT
ACGGGCACCG ACCCGGACGA GGGTGCGGCG CTGGCGATGG CGATGCTGGA CGAGCTGCGC
GCGCGTGGTA CGCGCGGGAT CGTGAATACC CATCTGGCGC CCCTGAAGGA CTACGCCGCC
CAGCACGCGG GCATCGTGAA CGCCTCGATG CAGTTCGACG CCGAGACGCT GTCCCCGACC
TACCGGCTGC TGATCGGCGA GCCGGGTGTG TCGTTCGGCC TTACGATTGC CGAGAAGAAC
GGGCTGCCGC CCCAGCTGGT TGCCCGCGCG CGCGAGCATT TCGCCGAACT CCCCACCGCC
CAGGCCGGGG GCGATGCCGG CAAGGCCTGA
 
Protein sequence
MQVDLKALEF PAIQRLLERL TATPYGADAA RGLEPAPNLD AARALQTAVT VARRRLDAGT 
LPRLGQLPDV RAALRQASNP GSALPVQALH NLQTIMRQAR ELADQLADTP EIYPADLNKL
YPPEGLEERL SACLNPGGSL REDASPSLIE AFEQRGRLRE EVEAVVKKRL AASDVAQKGE
DALKVQWHQE RAVMVLRGEA ANAVKGVRRG TAMGGRDQIV EPIEAVPLNN QLDTVNGQIN
TEQQRLLREL TDVVRQYGEP LELMLTALTW IDLASAAAQL SAQMNAHAPR LEAEAGVELI
EAYHPLLLLQ FAEGNGPQPV PLSIRLDGEQ PLLLITGPNT GGKTVALKTL GLLVTMAWCG
LHIPAEQDCR IGRFDRVMVD VGDHQSLFHH LSTFAGHVEV LKRILDHAGP ESLILLDELG
TGTDPDEGAA LAMAMLDELR ARGTRGIVNT HLAPLKDYAA QHAGIVNASM QFDAETLSPT
YRLLIGEPGV SFGLTIAEKN GLPPQLVARA REHFAELPTA QAGGDAGKA