Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | STER_1737 |
Symbol | |
ID | 4438029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptococcus thermophilus LMD-9 |
Kingdom | Bacteria |
Replicon accession | NC_008532 |
Strand | + |
Start bp | 1624973 |
End bp | 1627324 |
Gene Length | 2352 bp |
Protein Length | 783 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 639677325 |
Product | MutS family DNA structure-specific ATPase |
Protein accession | YP_821074 |
Protein GI | 116628455 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01069] MutS2 family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACAA AGATTTTAGA CCAATTAGAA TTTAACAAGG TCAAAGACCA GTTCACAGAA TACCTGCAGA CCGAACAGGC TCAGGCGGAG TTACGTGACT TGGTGCCTAT GACCAATCCA GAAAGAATTC AAAACCAATT TACAGAAATC CAAGAAATGT CTGAGATTTT CATAGAGCAT CATGGCTTTG CCATTGGTAG CTTAAGAGAT ATTTCTGAGC CCTTGCGTCG CTTGGAACTG GATGCTGACC TCAACATCCA GGAGCTCATT GCTATCAAGA AGGTCTTGCA AGCTTCAGCT GACCTTAGCC GCTTCTACGC TGACCTTGAA AATGTTGAAC TCATCGCTCT TAAACGTCTT TTTGAAAAGA TTGAGGCCTT TCCAAGTCTA CAAGGTAGTC TTCAAAGTAT TAATGATGGT GGTTTTATTG AGCATTTTGC TAGTCCTGAG TTGCAAAATA TCCGTCGTCA ACTTAAGGCT TGTGATGATG CCATTCGCCA GACCTTGCAA GACATTCTCA AGAAATCTGG GCACATGTTA GCCGAGAACT TAATTGCTAG TCGTAATGGC CGTTCAGTGC TTCCTGTCAA AAATACCTAC CGTAACCGTA TTGCGGGGGT TGTTCATGAC ATCTCAAGCT CTGGGAACAC AGTGTATATC GAGCCACGAG CCGTTATCCA GCTTAACGAA AAAATTACAC AATTACGTGC AGACGAGCGC CATGAGATGG CACGTATCCT ACATGAACTG TCTGATCAAC TCCGTCCACA TACTGCTGCC ATCGCTAACA ATGCTTGGAT TTTGGGCCAT ATGGACTTCA TTAGAGGAAA ATACCTCTAT CTCCATGATA AAAAGGCAAT TATTCCAGAG ATTAGTGACA ACCAAACACT CCAACTGCTT AACGTTCGTC ACCCTCTTTT AATCAATCCA GTAGCCAACG ATTTGCGCTT TGATGAGGAT CTCACGGTCA TTGTCATTAC AGGTCCCAAT ACTGGTGGTA AAACGGTCAT GCTTAAAACT CTAGGACTAG CTCAACTAAT GGCTCAATCT GGTCTGCCAA TTTTGGCTGA TAAGGGCAGT CGTGTAGCCA TTTTCCAAGA GATTTTTGCG GATATCGGAG ACGAACAGTC TATCGAACAG AGCTTGTCAA CTTTCTCTAG TCACATGACG CACATTGTCG AGATTTTGAA CACTGCCGAC AGCAATAGCT TGGTCTTGGT CGATGAGCTT GGGGCTGGTA CCGACCCTCA GGAAGGGGCT AGTCTAGCCA TGGCAATCCT AGAGCACCTC CGTTTGAGCC AGATTAAGAC CATGGCTACT ACTCACTATC CTGAGCTTAA GGCCTATGGG ATTGAGACAC AACATGTGGA AAATGCTAGC ATGGAGTTTG ACACAGCAAC TCTAAGACCT ACCTATCGCT TTATGCAAGG GGTTCCTGGT CGCTCTAACG CCTTTGAGAT TGCTCGCCGC CTCGGTCTGA ATGAGATTAT TGTCAAGGAA GCAGAAAACC TCACTGATAC GGATAGTGAT GTCAACCGTA TCATCGAGCA ACTTGAAGCC CAAACTGTAG AAACACAAAA ACGTCTGGAG CACATCAAGG ACGTTGAGCA AGAGAACCTT AAATTCAACC GTGCGGTCAA GAAACTCTAT AATGAATTCT CCCATGAGTA CGACAAGGAA CTCGAAAAGG CTCAAAAAGA AATTCAAGAG ATGGTGGATA CAGCTTTAGC TGAAAGCGAT AGTATTCTCA AAAATCTCCA TGATAAGAGT CAACTCAAGC CTCATGAAGT TATCGACGCC AAAGGGAAAC TTAAGAAACT TGCCGCACAA GTCGACCTTT CTAAGAACAA GGTTCTTCGT AAAGCCAAGA AAGAAAAGGC TGCGCGTGCC CCTCGTGTCG GTGATGATAT CATCGTTACT GCCTATGGGC AACGTGGTAC GCTCACCAGT CAAGCCAAAA ATGGTAACTG GGAAGCTCAA GTGGGTCTCA TCAAGATGAG CCTAAAGGCA GACGAATTCA CTCTGGTTCG TGCCCAAGCA GAAGCACAAC AACCCAAGAA AAAACAAATT AATGTGGTTA AAAAAGCCAA GAAGACATCT TCTGATGGGC CTCGTGCCCG TCTAGACCTT CGTGGGAAAC GTTACGAAGA AGCCATGCAA GAGCTCGATG CCTTTATTGA CCAAGCTCTA CTTAACAACA TGAGTCAGGT GGAAATCATT CACGGTATTG GTACCGGTGT CATCCGAGAT GCTGTAACCA AATATCTCCG CCGTCATCGC CATGTTAAAA ATTTTGAATA TGCCCCACAA AGTGCTGGTG GTTCCGGATG TACCATCGCA ACCTTGGGTT AA
|
Protein sequence | MNTKILDQLE FNKVKDQFTE YLQTEQAQAE LRDLVPMTNP ERIQNQFTEI QEMSEIFIEH HGFAIGSLRD ISEPLRRLEL DADLNIQELI AIKKVLQASA DLSRFYADLE NVELIALKRL FEKIEAFPSL QGSLQSINDG GFIEHFASPE LQNIRRQLKA CDDAIRQTLQ DILKKSGHML AENLIASRNG RSVLPVKNTY RNRIAGVVHD ISSSGNTVYI EPRAVIQLNE KITQLRADER HEMARILHEL SDQLRPHTAA IANNAWILGH MDFIRGKYLY LHDKKAIIPE ISDNQTLQLL NVRHPLLINP VANDLRFDED LTVIVITGPN TGGKTVMLKT LGLAQLMAQS GLPILADKGS RVAIFQEIFA DIGDEQSIEQ SLSTFSSHMT HIVEILNTAD SNSLVLVDEL GAGTDPQEGA SLAMAILEHL RLSQIKTMAT THYPELKAYG IETQHVENAS MEFDTATLRP TYRFMQGVPG RSNAFEIARR LGLNEIIVKE AENLTDTDSD VNRIIEQLEA QTVETQKRLE HIKDVEQENL KFNRAVKKLY NEFSHEYDKE LEKAQKEIQE MVDTALAESD SILKNLHDKS QLKPHEVIDA KGKLKKLAAQ VDLSKNKVLR KAKKEKAARA PRVGDDIIVT AYGQRGTLTS QAKNGNWEAQ VGLIKMSLKA DEFTLVRAQA EAQQPKKKQI NVVKKAKKTS SDGPRARLDL RGKRYEEAMQ ELDAFIDQAL LNNMSQVEII HGIGTGVIRD AVTKYLRRHR HVKNFEYAPQ SAGGSGCTIA TLG
|
| |