Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Syncc9605_0078 |
Symbol | |
ID | 3737818 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus sp. CC9605 |
Kingdom | Bacteria |
Replicon accession | NC_007516 |
Strand | - |
Start bp | 75614 |
End bp | 78325 |
Gene Length | 2712 bp |
Protein Length | 903 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637774659 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_380412 |
Protein GI | 78211633 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.167909 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCGGT CCGCGTCCCA GCCCCCTGAC GACGCCCTGC AGGGCAATCT GTTCGGGGCG CCTGAGCCGG CCGCCCCATC GGCGACAGCA ACGGCCAGCG AGCCTGAGAC CGCCAGCCAC GATCTCAGCG ACGACGAACT TGGGGCCGAT GCAGCCGCCC GGCCCCGAAC GCGCCAGGCC ACGGCAAGCG AGGGCAGCAG CGAAGCTCCG GCTGCAAACG ATTCAGAACC CAGCAGAGAC GAACCAGCCT GGGGCCATCA CAGCCAGCTG GACCCGCTGC AGCTCACACC GATGCTGCGC CACTACGTGG AGCTAAAAGC AGCCCATCCC GAAAGGGTGC TGCTCTACCG CCTGGGCGAC TTCTTCGAAT GCTTCTTCGA AGACGCGATC GAGCTCTCCC GGGTGCTGGA ACTCACCCTC ACCGGCAAGG AGGGCGGCAA GGCCATCGGC CGGGTGCCGA TGGCGGGCAT CCCCCACCAC GCCGCCGAGC GCTATTGCGC CGAACTGATC AAGCAGGGCT ACAGCGTGGC CCTCTGCGAT CAACTCGAAA CCACTCCCAC CAAGGGTGCC CTGCTCAAGC GGGACATCAC CCGGGTGCTG ACCCCCGGCA CCGTGTTGGA GGAGGGCATG CTCAGCGCCC GCCGCAACAA CTGGCTGGCG GCGGTGGTGG TGGAGCCGGC CCAGGGAAAA CAACCGCTGC GCTGGGGCCT GGCCAGCGCC GATGTGAGCA CCGGCGAGGT GCAAGTGATG CAACGAGAAG ACAGCAGCGC CCTGCACCAG CAGCTGGCCC AGCAGGAGGC CTCCGAATTG CTCTGGGCCG CCGCCCTCGA TACTGAGCGG CCGGCCTGGT GCCCGGAACG GTTGCGGCTG ACGCCAATGG CCAGCACCCC CTTCAGCCCA GTGGAGGCGG AGCGCACCCT GCAGCAGCAC TACGGCCTCA GCAGCCTCGA TGGCCTCGGC CTACCGGAGC ACCCCCTAGC CCTGCAGGCC CTCGGTGGCC TGCTGGGCTA CCTGCAGGAC ACCCAGCCCC TGGAGGAGGA CAGCCGCATT CCCCTCGAAG TGCCGGCAAT CGTGCACCGC GGCGATGCCC TGGTGCTGGA TGCCCAGACC CGCCGCAACC TCGAGCTCAC CGCCACCCAG CGCGACAACC AGTTGCAGGG GTCATTGCTC TGGGCCATCG ATCGCACCCT CACCGCCATG GGCGGCCGTT GCCTACGCCG CTGGCTGGAA GCACCGTTGA TGGACCGCCA GGCCATTCAG CAGCGCCAGG ATCTGGTGAG CAGCCTGGTG GGTGAACGCA GCCTGCGACT GGCGATCCGC CAGCTACTTC GGCCGATGGG GGACCTGGAA CGGCTGGCCG GCCGGGCCGG GGCGGGCCAT GCCGGTGCCC GCGATCTTGT GGCCATCGCC GATGGCCTTG AACGGTTGCC CCAGCTCACC GCTCGGCTGA AATCGGCAAT CAGCACAGGG CCGGAGTGGT TGCAGCAGCT GCTCAGCCCC GATCCAGCCC TGGCGGAGCT GGCCCGAACA ATTCGCCACA AGCTGGTGGA GGCCCCACCG CTCTCCCTCT CCGAGGGCGA TCTGATCCAT GACGGCGTCG ACCCGCTGCT GGATGGGCTG CGCAACCAGC TGGACGATCA GGACGCCTGG CTGAGCCACC AGGAGCAGCA AGAGCGCCAA CGCTGTGGCA TCAGCACCCT GAAGCTGCAG CACCACCGCA CCTTCGGCTA TTTCCTGGCG GTGAGCAAAG CCAAGGCCAC CGCCGTGCCG GAACACTGGA TCCGGCGCCA GACCCTGGCC AACGAAGAAC GCTTCATTAC CCCGGATCTC AAGGAGCGGG AGGGCCGCAT CTTCCAGCTG CGGGCCCGGG CTTGCCAGCG GGAATACGAG CTGTTCTGCC AGCTGCGAGA GCAGGTGGGG GCCATGGCCG CCCCGATCCG CCAGGCAGCC CGCGCCGTTG CTGCACTCGA TGCCCTCACC GGCCTGGGCG ATGTGGCCGC CAGCGGTGGC TACTGCGCAC CAACCATCAC CGACGGCCGA GGGCTGCAGC TGGAGGACAG CCGCCATCCG GTGGTGGAGC AACGGCTGGT GGAAACCGCT TTCACCCCCA ATGATGTGCA GCTCGGTGAG GGCACCGATC TGGTGGTGCT CACGGGCCCC AATGCCAGTG GCAAGAGCTG CTACCTACGC CAGATCGGCC TGATCCAGTT GATGGCTCAG ATCGGCAGCT GGGTGCCGGC CCGCTCAGCC ACGGTCGGCA TTGCCGATCG GATCTTCACC CGGGTGGGTG CTGTGGATGA TCTGGCCGCC GGCCAGTCCA CCTTCATGGT GGAGATGGCT GAAACCGCCA ACATCCTTCA CCACGCCAGC GATCGTTCCC TGGTGCTGCT CGATGAAATC GGGCGGGGCA CCGCCACCTT CGATGGCCTC TCCATCGCCT GGGCCGTGAG CGAGCACCTG GCGGGCGACC TGGGCAGCCG CACGGTGTTC GCCACCCACT ATCACGAACT CAATAACCTG GCCGCCGAAC GCGACAACGT GGCCAACTTC CAGGTGCTGG TGGAGGAAAC CGGTGAGGAT CTGGTGTTCC TTCATCAGGT GCAGGCTGGT GGCGCCAGCC GTAGCTACGG CATCGAAGCG GCACGCCTGG CCGGCGTTCC CAAGCCCGTG GTGCAACGGG CCCGTCAGGT GCTCGATCAG TTGACGGCCT GA
|
Protein sequence | MPRSASQPPD DALQGNLFGA PEPAAPSATA TASEPETASH DLSDDELGAD AAARPRTRQA TASEGSSEAP AANDSEPSRD EPAWGHHSQL DPLQLTPMLR HYVELKAAHP ERVLLYRLGD FFECFFEDAI ELSRVLELTL TGKEGGKAIG RVPMAGIPHH AAERYCAELI KQGYSVALCD QLETTPTKGA LLKRDITRVL TPGTVLEEGM LSARRNNWLA AVVVEPAQGK QPLRWGLASA DVSTGEVQVM QREDSSALHQ QLAQQEASEL LWAAALDTER PAWCPERLRL TPMASTPFSP VEAERTLQQH YGLSSLDGLG LPEHPLALQA LGGLLGYLQD TQPLEEDSRI PLEVPAIVHR GDALVLDAQT RRNLELTATQ RDNQLQGSLL WAIDRTLTAM GGRCLRRWLE APLMDRQAIQ QRQDLVSSLV GERSLRLAIR QLLRPMGDLE RLAGRAGAGH AGARDLVAIA DGLERLPQLT ARLKSAISTG PEWLQQLLSP DPALAELART IRHKLVEAPP LSLSEGDLIH DGVDPLLDGL RNQLDDQDAW LSHQEQQERQ RCGISTLKLQ HHRTFGYFLA VSKAKATAVP EHWIRRQTLA NEERFITPDL KEREGRIFQL RARACQREYE LFCQLREQVG AMAAPIRQAA RAVAALDALT GLGDVAASGG YCAPTITDGR GLQLEDSRHP VVEQRLVETA FTPNDVQLGE GTDLVVLTGP NASGKSCYLR QIGLIQLMAQ IGSWVPARSA TVGIADRIFT RVGAVDDLAA GQSTFMVEMA ETANILHHAS DRSLVLLDEI GRGTATFDGL SIAWAVSEHL AGDLGSRTVF ATHYHELNNL AAERDNVANF QVLVEETGED LVFLHQVQAG GASRSYGIEA ARLAGVPKPV VQRARQVLDQ LTA
|
| |