Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3645 |
Symbol | |
ID | 4244163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 5602599 |
End bp | 5605304 |
Gene Length | 2706 bp |
Protein Length | 901 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638108595 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_723183 |
Protein GI | 113477122 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.473403 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATATT CCGCATCTAC ATCTACTCCA AAATCTGCAC AACCCAAAGA AGAGGAACTA GAAAATTCTC TTCCTACTAA TGCTGATTAT AGTAAAATTG ACGTTAGCAA ACTGTCAGAA ATGATGCAAC GTTATGTGGA AGTTAAACAA CAATATTCTC ATGCTCTTTT ACTATTTCGA GTAGGTGACT TTTTTGAGTG TTTTTTTCAG GATGCTGTTA CGATCGCGCA AGAATTAGAA CTAGTACAAA CCACTAAACA CGCTGGCAAA GAAATTGGTA GAGTCCCCAT GACTGGTGTA CCTCATCATG CGGTAGAAAA ATATGCTACT TTCTTAGTAG AAAAAGGTTA TGCGGTGGTC GTTTGCGATC AAGTAGAAGA CTCTGCTATT GCTAAAAAAG AAAACCGTCA AGTCAAGCGT GAAATTACTC GCATCCTTAC TCCCGGCACT CTTACCGATG ATGGAATGCT GAAAGCACGC TACAATAATT ATTTAGCTGC AGTCGTTATT GCCAAAAATT ATTGGGGACT TGCTTACACG GATATTTCTA CTGGAGAGTT TCTCACGACT CAAACTGAAG GTTTAGACCA GCTAACCCAA GAATTAATGC GTTTGCAACC TTCGGAGGTG CTATTTCCGA CTAAAGCACC AGATATAGGT TTTATGTTAC GGCCAGGAGA AAGATCGGAT CATTTACCGG AATATCTCCC TCATTCTTTT TGCTATTCTC TGCGCCCACA ACAACCTTTT AGTTTGGGGG AGGCAAAGGA GCGACTGTTG ATGAAATTTC AACTGGCATC CCTCGAAGGT CTCGGTTGCG AACGTCTTCC TTTGGGGGTG CGTGCTGCGG GAGGTTTACT GGAATATCTT GAAGAAACCC AAAAGGAAAA TCAAGTTCCT TTACAACGTT TGCGTAGCTA TACTTTGGCA GATTTTTTGA TTCTCGATCA CCAGAGTCGG CGGAATTTAG AAATTACTCA AACGGTGCGG GATGGTAGTT ATCAAGGTTC GTTGTTGTCG GTAGTTGACA AGACTAGTAC TGCAATGGGT GGGCGTGCTT TAAGACGTTG GTTGCAACAA CCACTTCTTA GTTTGAAGGG TATTCGTGCT AGACATGATA CTATTGACGA GCTGATACAA AATAATGATC TACGTCAAGA TATTCAAAGA GTATTACGTC AAATTTACGA TTTAGAGCGT TTAACTGGCC GTACTGGTGC TGGTACGGCA AATGCTAGAG ATTTAGTTTT TTTAGCTGAC TCTTTAACGA AACTTCCTGA ACTTTCTACT TTCGTTTCTC AAGGTAATTC TCCTTATTTA AAGGTGTTGC AAAAAATACC ACCAATATTA CAAGAATTGG GAAAAAAAAT TCATTCCAAT TTAGTTGAGT CTCCTTCTCA AAAGTTAAAA GAAGGAGGGT TAATTCGCCC TGGTATAAAT GAACGATTAG ATGAGATGCG GAAGTTAGCA GAAGAAGACC AAAAATGGAT TGCTTCTTTG GAGACAACGG AGAGAGAAAG GACTGGAATT CCTAATTTAA AGGTTGGTTA TAATAAGGCT TTTGGTTATT ACATTAGTAT TTCTAAATCA AAGGCAAATT TGGCTCCGGA TGATTATACT CGGAAGCAAA CTTTGACGAA TGAGGAGCGT TATATTACTG AGGAATTAAA GGAAAGAGAA GTTAGAATTT TAACGGCACA AGATGATTTG AATGAGCTGG AATATGATAT TTTTGTTGAT TTAAGAAATG AAGTAGGGGA ATATGCAGAA GAGATTAGAA ATGTTTCCCG CGCTGTGGCA GCTCTTGATA TTTTATGTGG TTTGGCAGAT GTAGCAATTT ATCAAAATTA TGTCCGTCCT ACTATGGTTG ATAGCCGAGA ATTGAAAATT ATTGAAGGTC GTCATCCGGT GGTAGAAAAA TATTTACCTG CTGGGTTTTT TGTACCAAAT ACTGCTATAT TGGGAAGTAA AAATTTAGAG AAAAATAATT CGGGAATTAC TCCCTATTCG GCGCCGGATT TAATTATTTT AACTGGTCCT AATGCTAGTG GTAAAAGTTG TTATTTACGG CAGGTAGGAT TGATTCAATT AATGGCACAA ATTGGCAGTT TTGTTCCGGC AAGCTCTGCT GTTTTAGGGG TGAGCGATCG CATCTTTACT CGTGTGGGAG CTGTGGATGA TTTAGCTACT GGTCAATCAA CTTTTATGGT GGAGATGAAT GAAACGGCAA ATATTTTGAA TCATGCTACG GAAAAGTCTT TGGTTTTGTT GGATGAAATT GGCAGGGGAA CGGCAACTTT TGATGGAATT TCGATTGCTT GGTCAGTGGC AGAATATTTG GCAACGGAAA TTTTGTCTCG GACAATTTTT GCTACTCATT ACCACGAATT AAATGAACTT TCTTCTATTT TGGATAATGT GGCAAACTAT CAGGTAACAG TGAAAGAATT GCCGGATAAA ATTGTATTTT TGCATCAAGT ACAACCTGGT GGGGCGGATA AGTCTTATGG TATTGAAGCG GGAAGATTAG CTGGTTTACC AGATTCAGTA ATTGCAAGAG CAAGACAGGT AATGCAGCAA ATTGAAAACC ATAGCAAAAT AGCTATTGGT TTACGAAAAG GAATTAATAA AAAAGAAGAG GAAGAAATTA TAACTGTGGA GCAGTTAGAT ATTTTTAGTG AAGAATTTGG AGATAGTTTA TTATGA
|
Protein sequence | MKYSASTSTP KSAQPKEEEL ENSLPTNADY SKIDVSKLSE MMQRYVEVKQ QYSHALLLFR VGDFFECFFQ DAVTIAQELE LVQTTKHAGK EIGRVPMTGV PHHAVEKYAT FLVEKGYAVV VCDQVEDSAI AKKENRQVKR EITRILTPGT LTDDGMLKAR YNNYLAAVVI AKNYWGLAYT DISTGEFLTT QTEGLDQLTQ ELMRLQPSEV LFPTKAPDIG FMLRPGERSD HLPEYLPHSF CYSLRPQQPF SLGEAKERLL MKFQLASLEG LGCERLPLGV RAAGGLLEYL EETQKENQVP LQRLRSYTLA DFLILDHQSR RNLEITQTVR DGSYQGSLLS VVDKTSTAMG GRALRRWLQQ PLLSLKGIRA RHDTIDELIQ NNDLRQDIQR VLRQIYDLER LTGRTGAGTA NARDLVFLAD SLTKLPELST FVSQGNSPYL KVLQKIPPIL QELGKKIHSN LVESPSQKLK EGGLIRPGIN ERLDEMRKLA EEDQKWIASL ETTERERTGI PNLKVGYNKA FGYYISISKS KANLAPDDYT RKQTLTNEER YITEELKERE VRILTAQDDL NELEYDIFVD LRNEVGEYAE EIRNVSRAVA ALDILCGLAD VAIYQNYVRP TMVDSRELKI IEGRHPVVEK YLPAGFFVPN TAILGSKNLE KNNSGITPYS APDLIILTGP NASGKSCYLR QVGLIQLMAQ IGSFVPASSA VLGVSDRIFT RVGAVDDLAT GQSTFMVEMN ETANILNHAT EKSLVLLDEI GRGTATFDGI SIAWSVAEYL ATEILSRTIF ATHYHELNEL SSILDNVANY QVTVKELPDK IVFLHQVQPG GADKSYGIEA GRLAGLPDSV IARARQVMQQ IENHSKIAIG LRKGINKKEE EEIITVEQLD IFSEEFGDSL L
|
| |