Gene Sbal223_1246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_1246 
Symbol 
ID7088018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp1471516 
End bp1474086 
Gene Length2571 bp 
Protein Length856 aa 
Translation table11 
GC content49% 
IMG OID643460152 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_002357179 
Protein GI217972428 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000000830859 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAATGTAA TTGATACCGA TGATTTAGAA AAACATACTC CTATGATGCG TCAGTACTTG 
ACCATGAAAG CAGAACATCA CGACATGTTG CTGTTTTATC GCATGGGTGA CTTCTACGAG
TTGTTTTATG ACGATGCTAA ACGTGCTTCT GAGCTGTTAG GCATTTCACT CACGGCGCGC
GGTAAAAGTG GTGGCGATCC TATCCCCATG GCGGGACTGC CTTACCATGC TGTGGAAGGC
TATCTTGCGA AATTAGTTCA AATTGGTCAA TCGGTTGCGA TATGTGAACA AATTGGCGAC
CCAGCAACCT CTAAAGGGCC TGTTGAGCGT AAAGTGGTGC GTATCGTCAC GCCTGGCACC
TTAACCGACG AAGCCCTGCT GCAGGAACGC CAAGACAATC TATTGGCAGC GGTTTATCAA
GGTAAAATCG GTTTTGGTTA TGCCACTCTG GATGTTTCAT CCGGGCGTTT TGTGATTGCC
GAGCTCGACA CGCGAGAGTC ATTAGAAGCT GAATTGCAAC GCACTAATCC CGTTGAAATT
CTCTACAGTG AAGACTTTGG TGAACTAGGT CTACTAAACG GTTTTAAAGG TAAACGTCGT
CGCCCTGAAT GGGAATTTGA TTACGACACT AGCATCAAGT TACTGCTAGC TCAGTTTGGC
ACGAAAGACT TGCATGGTTT TGGTATCGCA GATGCACGTT TATCACTGCA GGCTGCGGGT
TGCTTGATGC AATATGTCAA AGACACACAG CGCACAGCCC TGCCCCATAT CAATGCCATT
ACCCGCTTTA ATCAAACCGA TAGCATAGTG CTCGATGCGG CAACGCGCCG AAATCTAGAA
CTCACTCAAA ACCTTGCCGG TGGTCGCGAT AATACCTTAG CCGCAGTATT GGATAACACG
GCGACGCCTA TGGGCAGCCG CATGTTGCAA CGCTGGATCC ATCAGCCGCT GCGAGATCCA
AAACACATCA AAGCACGCCA ACAGGCGGTG ACTGAACTAC TCGATACCGC CGCCCACGAA
GGTTTGCATG AACAGTTAAA AGCACTTGGC GATATCGAAC GTATCATGGC AAGACTCGCG
CTGCGTACCG CTCGCCCAAG AGACTTTGCC CGTTTACGTC AAGCGCTGGG CTTACTACCT
GAATTACAAC AGAGTTTAAG CACACTGAGC GCGCCGCACA CGACTCAATT ACGGCAACAC
TTAGGTGAGT TCCCCGCTGA ACAAGCCTTA CTTGAGCGCG CGATAGTCGA TAATCCTCCC
ATGCTTATCC GCGATGGCGG AGTGATCCGT GAAGGCTATA ACAGCGAATT AGATGAATGG
CGCGGCCTTA GCGAAGGTGC GAGCGATTAC TTAGTACAAC TCGAAGCCAG AGAAAAAGAA
CGTACTGGTA TCAACACACT TAAAGTCGGC TATAACCGTG TACACGGCTA TTACATCGAA
GTGAGTCGCT TGCAATCCTC GCAAGTGCCA CTCAATTACC AACGACGTCA AACCCTTAAG
AATATGGAGC GTTATATCAC GCCCGAACTT AAGGAGTACG AAGAAAAAGT GCTCTCGAGC
CAAGGTAAAG CACTGGCACT CGAAAAGCAG TTATGGGAAC AATTATTTGA TCTTATTCTG
CCAAAATTAC ATGAATTACA AGCTTTTGCT CGAGCGGCGG CAGAACTTGA TGTGTTAAGT
AACTTTGCTG AGCGCGCCGA AACCTTAGGC TACACTTGCC CAGAGCTGAG CCAAGATATC
GGCGTACAGA TAGAGGCTGG TCGTCACCCA GTGGTGGAAC GTGTGAGTCA AACACCGTTT
ATCGCTAACC CAGTCACCTT GCACAATCAA AGACGTATGT TGATTGTCAC CGGACCTAAC
ATGGGCGGTA AATCGACTTA CATGCGCCAA GTCGCCTTGA TCACGCTGAT GGCCCACATT
GGCTGTTTTG TGCCAGCAGA TTGCGCACTG ATAGGCCCTA TCGATCGTAT CTTTACCCGT
ATTGGCGCCT CTGACGATCT GGCCTCTGGC CGTTCAACCT TTATGGTAGA AATGACAGAA
ACCGCCAACA TTCTACACAA CGCCACCGCC AGTAGCTTAG TGTTAATGGA TGAAATCGGC
CGCGGCACAT CCACCTATGA TGGTTTATCG CTCGCTTGGT CGGCGGCAGA GTATTTAGCC
CAGCAAGTCG GCGCTATGAC CCTATTCGCC ACCCATTATT TTGAACTAAC TCAGTTGCCT
GAATTAATGG CTGGTGTTTA CAATGTGCAC CTCGATGCCA TTGAACACGA CGATACCATC
GCCTTTATGC ATGCAGTGCA AGAAGGCGCC GCTAGTAAAA GCTACGGCTT ACAAGTTGCG
GCGCTTGCCG GCGTACCAAA CAAAGTGATT AAAGCCGCTA AACATAAGTT GCAGCAATTG
GAGAGTCGCG ATCATCAAGC TGAAGGAACG AGGACGCCTA TTCAAAGCTT ACTCGCGTTA
CCTGAACCGG TTGAGAATCC AGCGCTAACG AAGTTAAGTA GCATTAATCC CGATAACTTA
ACGCCAAAAC AAGCACTTGA TTTGCTCTAT GAGCTGAAAC GTCTGAGCTA A
 
Protein sequence
MNVIDTDDLE KHTPMMRQYL TMKAEHHDML LFYRMGDFYE LFYDDAKRAS ELLGISLTAR 
GKSGGDPIPM AGLPYHAVEG YLAKLVQIGQ SVAICEQIGD PATSKGPVER KVVRIVTPGT
LTDEALLQER QDNLLAAVYQ GKIGFGYATL DVSSGRFVIA ELDTRESLEA ELQRTNPVEI
LYSEDFGELG LLNGFKGKRR RPEWEFDYDT SIKLLLAQFG TKDLHGFGIA DARLSLQAAG
CLMQYVKDTQ RTALPHINAI TRFNQTDSIV LDAATRRNLE LTQNLAGGRD NTLAAVLDNT
ATPMGSRMLQ RWIHQPLRDP KHIKARQQAV TELLDTAAHE GLHEQLKALG DIERIMARLA
LRTARPRDFA RLRQALGLLP ELQQSLSTLS APHTTQLRQH LGEFPAEQAL LERAIVDNPP
MLIRDGGVIR EGYNSELDEW RGLSEGASDY LVQLEAREKE RTGINTLKVG YNRVHGYYIE
VSRLQSSQVP LNYQRRQTLK NMERYITPEL KEYEEKVLSS QGKALALEKQ LWEQLFDLIL
PKLHELQAFA RAAAELDVLS NFAERAETLG YTCPELSQDI GVQIEAGRHP VVERVSQTPF
IANPVTLHNQ RRMLIVTGPN MGGKSTYMRQ VALITLMAHI GCFVPADCAL IGPIDRIFTR
IGASDDLASG RSTFMVEMTE TANILHNATA SSLVLMDEIG RGTSTYDGLS LAWSAAEYLA
QQVGAMTLFA THYFELTQLP ELMAGVYNVH LDAIEHDDTI AFMHAVQEGA ASKSYGLQVA
ALAGVPNKVI KAAKHKLQQL ESRDHQAEGT RTPIQSLLAL PEPVENPALT KLSSINPDNL
TPKQALDLLY ELKRLS