Gene STER_1737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSTER_1737 
Symbol 
ID4438029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus thermophilus LMD-9 
KingdomBacteria 
Replicon accessionNC_008532 
Strand
Start bp1624973 
End bp1627324 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content45% 
IMG OID639677325 
ProductMutS family DNA structure-specific ATPase 
Protein accessionYP_821074 
Protein GI116628455 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACAA AGATTTTAGA CCAATTAGAA TTTAACAAGG TCAAAGACCA GTTCACAGAA 
TACCTGCAGA CCGAACAGGC TCAGGCGGAG TTACGTGACT TGGTGCCTAT GACCAATCCA
GAAAGAATTC AAAACCAATT TACAGAAATC CAAGAAATGT CTGAGATTTT CATAGAGCAT
CATGGCTTTG CCATTGGTAG CTTAAGAGAT ATTTCTGAGC CCTTGCGTCG CTTGGAACTG
GATGCTGACC TCAACATCCA GGAGCTCATT GCTATCAAGA AGGTCTTGCA AGCTTCAGCT
GACCTTAGCC GCTTCTACGC TGACCTTGAA AATGTTGAAC TCATCGCTCT TAAACGTCTT
TTTGAAAAGA TTGAGGCCTT TCCAAGTCTA CAAGGTAGTC TTCAAAGTAT TAATGATGGT
GGTTTTATTG AGCATTTTGC TAGTCCTGAG TTGCAAAATA TCCGTCGTCA ACTTAAGGCT
TGTGATGATG CCATTCGCCA GACCTTGCAA GACATTCTCA AGAAATCTGG GCACATGTTA
GCCGAGAACT TAATTGCTAG TCGTAATGGC CGTTCAGTGC TTCCTGTCAA AAATACCTAC
CGTAACCGTA TTGCGGGGGT TGTTCATGAC ATCTCAAGCT CTGGGAACAC AGTGTATATC
GAGCCACGAG CCGTTATCCA GCTTAACGAA AAAATTACAC AATTACGTGC AGACGAGCGC
CATGAGATGG CACGTATCCT ACATGAACTG TCTGATCAAC TCCGTCCACA TACTGCTGCC
ATCGCTAACA ATGCTTGGAT TTTGGGCCAT ATGGACTTCA TTAGAGGAAA ATACCTCTAT
CTCCATGATA AAAAGGCAAT TATTCCAGAG ATTAGTGACA ACCAAACACT CCAACTGCTT
AACGTTCGTC ACCCTCTTTT AATCAATCCA GTAGCCAACG ATTTGCGCTT TGATGAGGAT
CTCACGGTCA TTGTCATTAC AGGTCCCAAT ACTGGTGGTA AAACGGTCAT GCTTAAAACT
CTAGGACTAG CTCAACTAAT GGCTCAATCT GGTCTGCCAA TTTTGGCTGA TAAGGGCAGT
CGTGTAGCCA TTTTCCAAGA GATTTTTGCG GATATCGGAG ACGAACAGTC TATCGAACAG
AGCTTGTCAA CTTTCTCTAG TCACATGACG CACATTGTCG AGATTTTGAA CACTGCCGAC
AGCAATAGCT TGGTCTTGGT CGATGAGCTT GGGGCTGGTA CCGACCCTCA GGAAGGGGCT
AGTCTAGCCA TGGCAATCCT AGAGCACCTC CGTTTGAGCC AGATTAAGAC CATGGCTACT
ACTCACTATC CTGAGCTTAA GGCCTATGGG ATTGAGACAC AACATGTGGA AAATGCTAGC
ATGGAGTTTG ACACAGCAAC TCTAAGACCT ACCTATCGCT TTATGCAAGG GGTTCCTGGT
CGCTCTAACG CCTTTGAGAT TGCTCGCCGC CTCGGTCTGA ATGAGATTAT TGTCAAGGAA
GCAGAAAACC TCACTGATAC GGATAGTGAT GTCAACCGTA TCATCGAGCA ACTTGAAGCC
CAAACTGTAG AAACACAAAA ACGTCTGGAG CACATCAAGG ACGTTGAGCA AGAGAACCTT
AAATTCAACC GTGCGGTCAA GAAACTCTAT AATGAATTCT CCCATGAGTA CGACAAGGAA
CTCGAAAAGG CTCAAAAAGA AATTCAAGAG ATGGTGGATA CAGCTTTAGC TGAAAGCGAT
AGTATTCTCA AAAATCTCCA TGATAAGAGT CAACTCAAGC CTCATGAAGT TATCGACGCC
AAAGGGAAAC TTAAGAAACT TGCCGCACAA GTCGACCTTT CTAAGAACAA GGTTCTTCGT
AAAGCCAAGA AAGAAAAGGC TGCGCGTGCC CCTCGTGTCG GTGATGATAT CATCGTTACT
GCCTATGGGC AACGTGGTAC GCTCACCAGT CAAGCCAAAA ATGGTAACTG GGAAGCTCAA
GTGGGTCTCA TCAAGATGAG CCTAAAGGCA GACGAATTCA CTCTGGTTCG TGCCCAAGCA
GAAGCACAAC AACCCAAGAA AAAACAAATT AATGTGGTTA AAAAAGCCAA GAAGACATCT
TCTGATGGGC CTCGTGCCCG TCTAGACCTT CGTGGGAAAC GTTACGAAGA AGCCATGCAA
GAGCTCGATG CCTTTATTGA CCAAGCTCTA CTTAACAACA TGAGTCAGGT GGAAATCATT
CACGGTATTG GTACCGGTGT CATCCGAGAT GCTGTAACCA AATATCTCCG CCGTCATCGC
CATGTTAAAA ATTTTGAATA TGCCCCACAA AGTGCTGGTG GTTCCGGATG TACCATCGCA
ACCTTGGGTT AA
 
Protein sequence
MNTKILDQLE FNKVKDQFTE YLQTEQAQAE LRDLVPMTNP ERIQNQFTEI QEMSEIFIEH 
HGFAIGSLRD ISEPLRRLEL DADLNIQELI AIKKVLQASA DLSRFYADLE NVELIALKRL
FEKIEAFPSL QGSLQSINDG GFIEHFASPE LQNIRRQLKA CDDAIRQTLQ DILKKSGHML
AENLIASRNG RSVLPVKNTY RNRIAGVVHD ISSSGNTVYI EPRAVIQLNE KITQLRADER
HEMARILHEL SDQLRPHTAA IANNAWILGH MDFIRGKYLY LHDKKAIIPE ISDNQTLQLL
NVRHPLLINP VANDLRFDED LTVIVITGPN TGGKTVMLKT LGLAQLMAQS GLPILADKGS
RVAIFQEIFA DIGDEQSIEQ SLSTFSSHMT HIVEILNTAD SNSLVLVDEL GAGTDPQEGA
SLAMAILEHL RLSQIKTMAT THYPELKAYG IETQHVENAS MEFDTATLRP TYRFMQGVPG
RSNAFEIARR LGLNEIIVKE AENLTDTDSD VNRIIEQLEA QTVETQKRLE HIKDVEQENL
KFNRAVKKLY NEFSHEYDKE LEKAQKEIQE MVDTALAESD SILKNLHDKS QLKPHEVIDA
KGKLKKLAAQ VDLSKNKVLR KAKKEKAARA PRVGDDIIVT AYGQRGTLTS QAKNGNWEAQ
VGLIKMSLKA DEFTLVRAQA EAQQPKKKQI NVVKKAKKTS SDGPRARLDL RGKRYEEAMQ
ELDAFIDQAL LNNMSQVEII HGIGTGVIRD AVTKYLRRHR HVKNFEYAPQ SAGGSGCTIA
TLG