Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2875 |
Symbol | |
ID | 4076409 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 3042910 |
End bp | 3045549 |
Gene Length | 2640 bp |
Protein Length | 879 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 638008204 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_614869 |
Protein GI | 99082715 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.928835 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGTCA CGCCGATGAT GGCGCAATAT CTCGAGATCA AGGCGCAATA CCCGGATGCG CTCCTGTTTT ATCGCATGGG CGATTTCTAC GAGATGTTCT TTGAGGATGC GGTCAACGCG GCCGAGGCGC TTGATATCGC GCTCACAAAA CGGGGCAAAC ACGAAGGCGA GGATATTCCC ATGTGCGGCG TTCCCGTGCA TGCCGCCGAA GGGTATCTTT TGACCCTGAT CCGCAAAGGG TTTCGCGTTG CCGTGGGCGA GCAGCTTGAA AGCCCCGCAG AAGCCAAGAA ACGTGGTTCC AAATCCGTTG TGAAACGTGA CGTGGTACGC CTGGTGACGC CCGGCACGCT TACCGAGGAT TCCCTGCTCG AAGCGCGTCG CCATAACTTC CTGGTGGCCT ATTCCGAACT GCGTGACCAG GCCGCCCTGG CTTGGGCCGA TATATCAACC GGCGCGTTTC ACGTCATGCC CGTCGCCCGC GTGCGTCTCA GTCCAGAGCT TGCTCGTCTT GCCCCATCAG AGCTGATCGT TGCAGATGGC CCCATCTTTG ACGCCACATT GCCTTTGGCA GAGGAGTACA AAATCCCGCT CACGCCTCTT GGGAAGGCAA GCTTTGACAG TACCGCTGCA GAAAAACGTC TCTGCCATCT GTTCAATGTG AGCGCGCTCG ATGGTTTTGG CACCTTCAAC AGGGCCGAAA TCTCGGCCAT GGGCGCTGTC GTGGACTATC TGGAGATCAC GCAGAAAGGC AAACTGCCCC TGTTGCAGCC TCCACTCCAG GAATCCGAGG ATCGGACAGT CCAGATCGAC GCCTCAACCC GGCGTAACCT CGAACTCACC CGCTCGCTAT CTGGCGGACG TGCTGGATCT CTTTTGTCTG TTGTGGATCG CACCGTCACT CCGGGCGGCG CCCGACTGCT CGAACAACGC CTTTCCAGCC CCTCTCGCAA TCTCGACGTG ATTTCCGCGC GCCTCGAGGC TCTGGATACG ATCGTCGAAG ACCCCATTCG CTGTGATACG TTGCGTGGCC TTCTCCGCAA AACACCTGAT ATCGACCGCG CGCTTTCGCG CCTTGCGCTT GATCGGGGCG GACCACGCGA CCTCGCTGCC ATTCGCAACG CCCTGAGCCA AGGCGAAGAC ATCGAACGGG CACTACAGGA TCCGGATCTG CCGACCCTGC TGCGCGATGC GGCACACTCC CTCGAAGGGT TCCAAGATCT GCTCTCCCTC CTCGATGCCG CCTTGATCGC CGAGCCCCCC CTGCTGGCCC GTGATGGCGG CTTTATCGCA GCAGGCTATG ATCGCGAACT CGATGAAGCG CGCACCCTCA GAGATGAGGG CCGCTCTGTC ATCGCAGGTC TGCAGAAAAA ATATGCAGAG CATACGGGAA TCAGCTCACT CAAGATCAAG CACAACAATG TGCTTGGCTA TTTCATTGAA ACCACATCGA CGCACGCCGC AAAGATGCAG TCAGCGCCGA TGTCAGACAC CTATATTCAT CGTCAAACCA CCGCAAACCA AGTCCGTTTC ACAACCGTGG AACTAAGCGA AATCGAGACC AAGATTCTGA ACGCCGGAAA TCTGGCGCTT GAGATCGAAA AACGGCTCTA TCAAAGGCTT TCTGGCGCTA TTCTAGACAG CGCTGCGCGG CTCAATCAGG CCGCGCGCGG GTTTGCCGAG ATCGATTTGG TCACCGCATT GGCAGATCTT GCACGCGCGG AAAACTGGAC CCGACCGCGC GTTGATACAT CTCGTGCGTT TCACGTGGAC GGCGGACGTC ATCCGGTTGT GGAACAAGCG TTGCGCCATC AAGGCGGTGA CAGCTTTGTG GCGAATGACT GTGATCTCAG CCCTCAAGAC GGAGCAGCGA TCTGGCTTCT CACCGGGCCC AACATGGCCG GTAAATCGAC CTTCTTGCGT CAGAACGCCC TGATTGCCGT GCTTGCTCAA ATGGGCAGCT ATGTCCCCGC AGAAGCAGCT CATATCGGCA TGATCAGCCA GTTGTTCAGC CGCGTTGGCG CATCAGACGA TCTCGCGCGT GGACGCTCGA CCTTTATGGT GGAAATGGTA GAGACCGCTG CCATTCTGAA TCAGGCCGAT GATCGCGCAC TGGTGATCCT TGATGAAATC GGGCGTGGCA CGGCAACCTA CGATGGCCTA TCGATCGCCT GGGCGACGCT CGAACATCTG CATGAGGTCA ACCGCTCCCG GGCGCTCTTT GCAACGCACT ATCACGAATT GACGCAACTC GCGACAAAAC TCACCGGTGT CGAGAATGCA ACCGTCTCGG TCAAAGAGTG GGAAGGCGAA GTCATCTTCC TGCATGAGGT CAAAAAGGGC GCAGCGGATC GTTCCTATGG TGTGCAGGTG GCACAGCTTG CCGGTCTACC TGCCTCGGTC GTGGCACGGG CGCGCAGCGT CCTCGATATG CTGGAGAAAA GCAGCCGCGA AGGTGGCGGT GCCGGAAAGG TACAAATCGA TGACCTGCCG TTGTTTGCAG CCGCGCCAGC GCCGCAGCCC AAACCCGCCC AAGGCCCCTC GCCGGTAGAA AAGCTCCTCG AAGAGATCTT TCCCGATGAC CTCACCCCAC GTGAAGCACT CGAAACACTC TATCGGCTCA AGGACGTAAG CAAGGGTTAA
|
Protein sequence | MSVTPMMAQY LEIKAQYPDA LLFYRMGDFY EMFFEDAVNA AEALDIALTK RGKHEGEDIP MCGVPVHAAE GYLLTLIRKG FRVAVGEQLE SPAEAKKRGS KSVVKRDVVR LVTPGTLTED SLLEARRHNF LVAYSELRDQ AALAWADIST GAFHVMPVAR VRLSPELARL APSELIVADG PIFDATLPLA EEYKIPLTPL GKASFDSTAA EKRLCHLFNV SALDGFGTFN RAEISAMGAV VDYLEITQKG KLPLLQPPLQ ESEDRTVQID ASTRRNLELT RSLSGGRAGS LLSVVDRTVT PGGARLLEQR LSSPSRNLDV ISARLEALDT IVEDPIRCDT LRGLLRKTPD IDRALSRLAL DRGGPRDLAA IRNALSQGED IERALQDPDL PTLLRDAAHS LEGFQDLLSL LDAALIAEPP LLARDGGFIA AGYDRELDEA RTLRDEGRSV IAGLQKKYAE HTGISSLKIK HNNVLGYFIE TTSTHAAKMQ SAPMSDTYIH RQTTANQVRF TTVELSEIET KILNAGNLAL EIEKRLYQRL SGAILDSAAR LNQAARGFAE IDLVTALADL ARAENWTRPR VDTSRAFHVD GGRHPVVEQA LRHQGGDSFV ANDCDLSPQD GAAIWLLTGP NMAGKSTFLR QNALIAVLAQ MGSYVPAEAA HIGMISQLFS RVGASDDLAR GRSTFMVEMV ETAAILNQAD DRALVILDEI GRGTATYDGL SIAWATLEHL HEVNRSRALF ATHYHELTQL ATKLTGVENA TVSVKEWEGE VIFLHEVKKG AADRSYGVQV AQLAGLPASV VARARSVLDM LEKSSREGGG AGKVQIDDLP LFAAAPAPQP KPAQGPSPVE KLLEEIFPDD LTPREALETL YRLKDVSKG
|
| |