Gene Psyc_0247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPsyc_0247 
SymbolmutS 
ID3515090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePsychrobacter arcticus 273-4 
KingdomBacteria 
Replicon accessionNC_007204 
Strand
Start bp292249 
End bp295425 
Gene Length3177 bp 
Protein Length1058 aa 
Translation table11 
GC content46% 
IMG OID637668934 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_263551 
Protein GI71064824 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTCA AACCCTCTGC TCAAACTAAT GACACCATAC AATCGACTTC AAAATCTGTG 
CCAGCGGCGT CTGACTCATT GGTCATAGGC GATGCGATCT ATCACTTGGC AGATCATACA
CCCATGATGG TGCAATATCT CACCATGAAA GCCAACTATC CTCAAGCGCT ACTGCTATAC
CGGATGGGTG ATTTTTATGA GCTGTTTTTT GATGATGCCA AGCGCGCGGC ACAGATACTA
GACATCACAC TAACTCGCCG TGGTACAGAT AAAGCTGGTA ACACCATTGC GATGGCAGGC
GTACCGTTTC ATGCTGCTGA CAGTTATATG GCAAGATTGA TTGCCGCTGG GCAGACAGTG
GTTGTCTGCG AGCAAATTGA CGAATCAGCC ACTGGTAATG CTAGGAATAC GTCTAATACT
CCTACGATGG GCGATAAACA GAAAAAGGAT AAAAGCAAAT CTGCTGCTGG CACTATTATG
CGCCGTGAAG TGGTCAAAAC GCTGACCGCA GGGACGATTA CTGACGATGC CTTAATTGCG
CCCAATCACA CCCCTACTGT CGTCGCCATT GATATACTAA TACCTAAATC TAACAGCAAA
CAATCCTTGC AAGCTGCCAT CAGTCAGATG GATTTGGCTG CGGGAACGTT AACGACACAA
ACACTAAGCG CGAACCATGA TGACATTGAG GGGCTAAAAA CCCAGATGCT CACTGTGCTA
GCACGCTTTG CGCCGAGTGA ATGCATTATT GGTGAAGCAC TCAATGACAG TATTGGTGGT
ATCGGCGAAG ACTGGCTGCT GTGGCTGCGT CAAAGCCTTG ACTGCCCTAT TATCGAAGTT
GCTGCTAATG ACTTCCATCG TCAGCATGCC AGCGCGACTT TATGCCAGCA ATTTGGAGTA
CAGCGTCTTG ATGGCTTAGG ATCAGGCATT AGCGATGCGC CATTGGCTCA ATCTAGCTGT
GCAGCGCTCA TTCACTATGC GCGGCAGACT CAGCAGCGTC AGGTGCCACA GATTAATCAG
CTGATTGTCG AGTACAGCGA TGATTACTTA ATCATTGATG CCAATAGTCA ACAGAATCTT
GAGCTATTTA CACCCGTTAG CAGCAATGGC ACCTCTTTAC TATCTGTACT CAATCACTGT
CAAACGCCGA TGGGGCGACG TTTATTAGTA CAGCAAATGA AGCGACCATT ACGTCAACAT
TCGCGTATCA ACCTACGCTT GGATGCGATA GCTTGTCTAT TAAAGACAGA CAAGACCTCA
ATCGAGTCGA ATCAAGCGTC GAATCAAGCA TTGAAACACA GCTCACTGGT GATAAGCTTA
CGCGAAATGC TTAATAGCAT CGGTGATATC GAGCGCATCA GTAGCCGTAT CGGGCTGATG
AGCGCCAAGC CTCGCGACTT ACGTAAGCTT GCTGATGGTA TTGCCAGTAG CACCCAGCTT
ACTACCTTAC TGACGGATTC AGGTGTCAGC CATGAGCAGG CAGGACTGTT GCCGATGCTG
ATGCAGCAAC TGCCAGCCCA GCTACCTGCC GTACAGTCTA TAGCCAAGCT GATTGAGCGC
GCCATTATCA CAGAGCCTCC GGCGCATATC CGTGATGGCG GTATGCTAGC CGCAGGTTAC
GATGACGAGT TTGATCGGCT GACCCATTTA CATGACAATA TCCAAGTGAC ACTGGATGAG
ATGGTAGAGC GAGCACGTTT AGAGAGTCAA TTGCCCAGTC TAAAAGTCGG TTTTAATAAA
GTCAGCGGCT TTTATTTTGA ACTACCAAAA ATGCAGGCAA AAAATGCGCC GGCACACTTT
ATCCGTCGGC AAACGCTCAA GAGCAGCGAG CGCTTTATCA CTGACGAATT AAAAGACGTC
GAGACCGAAT ATTTGAGTGC GCAGACATTA GCTCTGACTC GTGAAAAGCA GCTATATCAT
GAGCTTTTAA CGGAACTTAG TAGCCATTTA GCTGAATTAC AACAGCTGAG TGCTGCAATC
GCTCAAATAG ACGTACTGAG CAATTGGGCG CAGCTGGCTA TGACATATAA CTGGCAGTGT
CCTGTCATGA GTAATAATGA TGAAAATAAA GATAGCTCAA ATACTGACAA TCAAGCCAGT
ATTGATATCA GCCAAGGTCG TCATGTGGTG GTCGAAGCCG CGTTAAATCC CGTCAATGCT
GGTAATGCTG GTAATACCGT TAATAGCTCT AATAACAGCT CTAATAATAG CGGCACCACT
AGACATAACA GCCATTTTGT CGCCAATGAT TGTGCGCTCG GCAGCGATGC AAATCCTGAA
AGGCTGCTTC TGATTACCGG TCCTAATATG GGCGGCAAGT CGACCTATAT GCGTCAAACC
GCGCTAATCG TTCTTCTTGC CCATTGTGGT AGCTTCGTCC CAGCAGCGAG CGCTCATATT
GGTGATATCG ACCGCATCTT TACTCGTATT GGCTCAGCGG ATGATTTGGC TGGTGGCAAA
TCCACTTTTA TGGTGGAGAT GATTGAGACG GCTAATATTC TGAATCAGGC AACCAATCAA
TCACTGGTAT TAATGGATGA AGTCGGACGT GGTACTGCCA CCACTGATGG ACTGGCTATC
GCCCATGCTT GCGTCAATCG ATTGCTAGAG ATTGGCTGCC TGACATTATT TGCCACCCAC
TATTTTGAGC TGACAAAATT GGCGCAAAAC CCTAAAGAAA GTAGTGGCAG CAATGACAAG
CTTATCCGCA ATGTGCATGT CGCTGCTAGC GAAGTTGATG GTCAACTGCT ATTACTGCAT
CAAATTAAAG AAGGTGCAGC AAGCTCTAGT TTTGGATTGC ATGTCGCTAA GATGGCTGGT
ATCCCCACTC AAGTATTAAA TGATGCCAAA CGCTACTTGG TAGATAACTT AACTATAGAT
AACTTAGGCA TAGATAACTT AAGCATAGAC AACCTAAAAT CGAATAATGA AAGTGCTAAT
GATGATAAAA ATGACTTAGC TAAGTCGGTA AAGGACAAAC GCCAGCAGAC TGCTGATAGC
GACATAGAAA AACTAAATCT AAGTAATATT AAAAAAACTC AAAATATGAC TGATATTCCA
CAACAAAATC AGTTATTTAG CTTACAAGAC GAGCTACATG CTATCGACCC TGACAGCCTA
ACGCCAAAGC AGGCGCACGA TTTAATTTAT CATCTAAAGA AAATCATTAG TCGTTAA
 
Protein sequence
MSVKPSAQTN DTIQSTSKSV PAASDSLVIG DAIYHLADHT PMMVQYLTMK ANYPQALLLY 
RMGDFYELFF DDAKRAAQIL DITLTRRGTD KAGNTIAMAG VPFHAADSYM ARLIAAGQTV
VVCEQIDESA TGNARNTSNT PTMGDKQKKD KSKSAAGTIM RREVVKTLTA GTITDDALIA
PNHTPTVVAI DILIPKSNSK QSLQAAISQM DLAAGTLTTQ TLSANHDDIE GLKTQMLTVL
ARFAPSECII GEALNDSIGG IGEDWLLWLR QSLDCPIIEV AANDFHRQHA SATLCQQFGV
QRLDGLGSGI SDAPLAQSSC AALIHYARQT QQRQVPQINQ LIVEYSDDYL IIDANSQQNL
ELFTPVSSNG TSLLSVLNHC QTPMGRRLLV QQMKRPLRQH SRINLRLDAI ACLLKTDKTS
IESNQASNQA LKHSSLVISL REMLNSIGDI ERISSRIGLM SAKPRDLRKL ADGIASSTQL
TTLLTDSGVS HEQAGLLPML MQQLPAQLPA VQSIAKLIER AIITEPPAHI RDGGMLAAGY
DDEFDRLTHL HDNIQVTLDE MVERARLESQ LPSLKVGFNK VSGFYFELPK MQAKNAPAHF
IRRQTLKSSE RFITDELKDV ETEYLSAQTL ALTREKQLYH ELLTELSSHL AELQQLSAAI
AQIDVLSNWA QLAMTYNWQC PVMSNNDENK DSSNTDNQAS IDISQGRHVV VEAALNPVNA
GNAGNTVNSS NNSSNNSGTT RHNSHFVAND CALGSDANPE RLLLITGPNM GGKSTYMRQT
ALIVLLAHCG SFVPAASAHI GDIDRIFTRI GSADDLAGGK STFMVEMIET ANILNQATNQ
SLVLMDEVGR GTATTDGLAI AHACVNRLLE IGCLTLFATH YFELTKLAQN PKESSGSNDK
LIRNVHVAAS EVDGQLLLLH QIKEGAASSS FGLHVAKMAG IPTQVLNDAK RYLVDNLTID
NLGIDNLSID NLKSNNESAN DDKNDLAKSV KDKRQQTADS DIEKLNLSNI KKTQNMTDIP
QQNQLFSLQD ELHAIDPDSL TPKQAHDLIY HLKKIISR