Gene Shewana3_1125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_1125 
Symbol 
ID4477830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp1323429 
End bp1326014 
Gene Length2586 bp 
Protein Length861 aa 
Translation table11 
GC content51% 
IMG OID639725668 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_868766 
Protein GI117919574 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCTA TAGATACCGA TGATTTAGAA AAACATACCC CTATGATGCG TCAATATTTG 
ACCATGAAAG CAGAGCATCA CGACATGCTG CTGTTTTATC GTATGGGTGA CTTCTATGAA
CTCTTCTATG ATGACGCTAA ACGAGCCTCA GAATTATTGG GCATTTCCCT TACGGCCCGC
GGCAAAAGTG GTGGCGATCC GATCCCGATG GCGGGTATCC CTTACCATGC GGTTGAAGGC
TATCTAGCAA AACTAGTTCA AATTGGTCAA TCGGTCGCGA TCTGTGAGCA GATTGGCGAC
CCCGCCACCT CAAAGGGTCC AGTTGAGCGA AAAGTGGTGC GCATCGTCAC CCCAGGTACT
TTGACCGATG AAGCCCTGTT GCAGGAGAGA CAAGACAATC TGTTAGCCGC GGTTTACCAA
GGTAAAGTTG GGTTTGGCTA CGCGACCCTC GATGTGTCTT CCGGCCGTTT CGTGATTGCC
GAATTAGAGA CAAAAGAGTC CCTCGAAGCC GAGCTGCAAC GTACCAATCC GGTCGAAATT
CTCTATAGCG AAGATTTTGG CGCGATGGAG TTATTGCATC ATTTTAAAGG CAAGCGTCGC
CGCCCTGAGT GGGAGTTTGA TTACGATACC AGCATTAAAC TGCTACTGGC ACAATTTGGC
ACTAAGGATT TGCACGGCTT TGGGATTACC GATGCGAGGC TTTCGCTGCA AGCGGCGGGC
TGCTTAATGC AATATGTGAA AGACACTCAG CGCACTGCTC TGCCCCATAT CAATGCCATT
ACCCGTTTCA ATCAAACCGA TACGATTGTC TTGGATGCAG CAACGCGCCG TAACCTCGAG
CTGACACAAA ATTTGAGTGG TGGTCGAGAT AACACCCTAG CAGCCGTGCT CGACAATACA
GCCACAGCGA TGGGCAGCCG GATGTTGCAA CGCTGGATCC ATCAGCCGCT GAGAGACCAT
GCGCAGATTT TTGCTCGCCA AACGGCGGTC AATGAATTAC TCGAGACCAC GGCCCATGAG
TCATTGCATG ATCAGCTTAA AGCCCTAGGC GATATTGAAC GTATCATGGC AAGACTGGCA
CTGCGTACAG CTCGTCCAAG GGATTTTGCC CGCCTACGCC AAGCATTAAA CCTATTGCCT
CAATTGCAGC AGTCACTCGC GCAGTTAAGT GCGCCCCATA CGGTGAAACT AGGCCAACTC
TTAGGTGAGT TTCCCGAAGA GCAACAACTG CTTGAGCGCG CGATTGTCGA TAACCCTCCC
ATGCTTATCC GTGATGGCGG CGTGATCCGC GAAGGCTACA ATGCCGAGTT AGATGAATGG
CGAGGCTTAA GTGAAGGAGC CACAGATTAT CTGGTTCAGC TCGAAGCGCG GGAAAAAGAG
CGTACCGGCA TTGCCACGCT GAAGGTTGGC TACAACCGAG TGCATGGCTA CTACATCGAA
GTCAGTCGCC TGCAATCACA GCAAGTGCCA TTAAATTACC AGCGCCGTCA AACCCTTAAG
AATATGGAGC GTTACATCAC GCCCGAGCTT AAGGAATACG AAGAAAAAGT GCTGTCGAGC
CAAGGTAAGG CGCTTGCCCT TGAAAAACAA CTTTGGGACG AATTATTTGA TTTAATCCTC
CCCAAGTTGC ATGAGCTACA AGCCTTCGCC AGAGCCGCAG CCGAACTCGA TGTGCTGAGT
AACTTTGCCG AACGTGCCGA AACCTTAGGC TATACCTGCC CAGAACTGAG CAGCGAAATC
GGGGTAAAAA TCGAAGCAGG TCGCCACCCA GTGGTCGAGC GTGTGAGTCA AACGCCTTTT
ATCGCCAACC CTGTCACCCT GCACAATCAA CGCCGGATGT TGATTGTTAC CGGTCCAAAC
ATGGGCGGTA AATCAACCTA CATGCGCCAA GTGGCGCTTA TCACGCTAAT GGCCCATATT
GGCTGCTTTG TGCCAGCAGA TCGCGCCATC ATTGGCCCTA TCGATCGGAT TTTTACTCGG
ATTGGCGCCT CAGACGATCT CGCCTCTGGT CGCTCAACCT TTATGGTAGA AATGACTGAA
ACGGCAAATA TTCTCCATAA TGCGACCGCA CAGAGTTTAG TCTTGATGGA TGAAATTGGC
CGTGGCACAT CAACCTACGA TGGTCTGTCA TTAGCATGGT CTGCAGCGGA ATATTTAGCG
CAGCAAGTCG GCGCAATGAC GCTGTTTGCG ACCCATTATT TCGAGTTAAC ACAATTACCG
GAACTCATGG CTGGCGTCTA TAACGTGCAC CTCGATGCAA TTGAGCACGA AGACACCATC
GCCTTTATGC ACGCGGTGCA AGAAGGTGCT GCCAGTAAGA GTTATGGTTT GCAAGTTGCG
GCCCTTGCCG GGGTACCTGC ACGGGTGATT AAGGCGGCAA AACACAAGTT ACATCAGCTT
GAGAGCCGCG ATCATCAAGT GGAAGGTGCT AATGTTAACG GCACTAGGGC ACCGATCCAA
ACCCTGCTCG CCCTGCCTGA GCCAGTGGAG AATCCCGCCG TCAGCAAATT AAAAGATATC
AATCCCGATA ACCTGACCCC AAAACAGGCG CTGGATTTAC TCTACGAGCT AAAACGCTTG
AGTTAA
 
Protein sequence
MNPIDTDDLE KHTPMMRQYL TMKAEHHDML LFYRMGDFYE LFYDDAKRAS ELLGISLTAR 
GKSGGDPIPM AGIPYHAVEG YLAKLVQIGQ SVAICEQIGD PATSKGPVER KVVRIVTPGT
LTDEALLQER QDNLLAAVYQ GKVGFGYATL DVSSGRFVIA ELETKESLEA ELQRTNPVEI
LYSEDFGAME LLHHFKGKRR RPEWEFDYDT SIKLLLAQFG TKDLHGFGIT DARLSLQAAG
CLMQYVKDTQ RTALPHINAI TRFNQTDTIV LDAATRRNLE LTQNLSGGRD NTLAAVLDNT
ATAMGSRMLQ RWIHQPLRDH AQIFARQTAV NELLETTAHE SLHDQLKALG DIERIMARLA
LRTARPRDFA RLRQALNLLP QLQQSLAQLS APHTVKLGQL LGEFPEEQQL LERAIVDNPP
MLIRDGGVIR EGYNAELDEW RGLSEGATDY LVQLEAREKE RTGIATLKVG YNRVHGYYIE
VSRLQSQQVP LNYQRRQTLK NMERYITPEL KEYEEKVLSS QGKALALEKQ LWDELFDLIL
PKLHELQAFA RAAAELDVLS NFAERAETLG YTCPELSSEI GVKIEAGRHP VVERVSQTPF
IANPVTLHNQ RRMLIVTGPN MGGKSTYMRQ VALITLMAHI GCFVPADRAI IGPIDRIFTR
IGASDDLASG RSTFMVEMTE TANILHNATA QSLVLMDEIG RGTSTYDGLS LAWSAAEYLA
QQVGAMTLFA THYFELTQLP ELMAGVYNVH LDAIEHEDTI AFMHAVQEGA ASKSYGLQVA
ALAGVPARVI KAAKHKLHQL ESRDHQVEGA NVNGTRAPIQ TLLALPEPVE NPAVSKLKDI
NPDNLTPKQA LDLLYELKRL S