Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Syncc9902_0081 |
Symbol | |
ID | 3744073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus sp. CC9902 |
Kingdom | Bacteria |
Replicon accession | NC_007513 |
Strand | - |
Start bp | 79456 |
End bp | 82173 |
Gene Length | 2718 bp |
Protein Length | 905 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637770247 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_376099 |
Protein GI | 78183665 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.294772 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACTAT CCCTGCAAGG CAGCCTCTTT GGCGCGCCTG AGCCAACGGT CAATGCCCCA AACACAAGAC CATCAACTGG CGATCTGCCG AACCCGTTCA ACTCCGATCA CAACCTCAGC GATGCAGACC TAAGCAAAGA CGCCTTGGCT CGACCGCGCC GACGCAACGA AACCAGCGGA TCAACACCAA CGGCCGGAGT TGATCCAGAC GACCGAGCAG ATGAAACCCG TGACGACACC ACGTCCACCG ACGAGCCGGC ATGGGGCCAC CACAGCCAAC TCAAACCTGA GCAGCTCACA CCGGTGCTGC GCCATTACGT GGAACTCAAA ATGGCGCACC CGGAGCGGGT GCTGCTCTAC CGGTTAGGCG ACTTTTTTGA ATGCTTTTTT GAAGATGCCA TCACCTTGTC GCGTGAGCTA GAGCTCACCC TTACGGGAAA AGATGCCGGA AAAGCGATCG GGCGGGTGCC GATGGCGGGC ATTCCCCATC ACGCCGCCGA ACGGCATTGC AGCGATTTGA TTCGCCTCGG GTATTCCGTA GCGCTCTGTG ATCAACTCGA AACCACCCCC ACCAAAGGCG CCCTGCTGAA GCGGGACATC ACCCGCGTGC TCACCCCCGG CACGGTGCTT GAGGAAGGCA TGCTCACGGC CCGACGCAAC AACTGGCTTG CCGCGGTGGT GGTGGAGCCA GCCACCCAAC ACCAGCCATT TCGCTGGGGA TTGGCCCAAG CGGATGTGAG CACCGGTGAT GTGCAGGTGT TGCAGCGGGA AGGCAGTGAT GGCTTACATC AACACCTGGC GCGGCTTCAG GCCTCCGAAC TGCTTTGGAG TGGCGACGAT CCTGCTCCGG CCTGGTGCCC AGATCGCGTC GGACTCACGC CAATGAGCAG CACCCCCTTC AGCCGCCCTG AAGCGGAGGC TGTGCTGCTG GAGCACTACA ACCTGGCCAG TCTCGATGGC ATCGGTCTAC CGGAAGTACC GCTCGCCCTG CAGGCCATCG GCGGTTTACT GCAGTACGTC GGCGACACCC AACCGCTCGA AGACAACGCT CGAGTTCCCC TGGACGTGCC CGCAATCGTT CACAACGGCG ACAGCCTCGT TCTAGATGCC CAAACCCGCC GCAACTTGGA ACTCACCGCC ACCCAGCGGG ATGGGCTACT GCAGGGATCG TTGCTATGGG CCATCGATCG CACCCTCACC GCCATGGGAG GGCGTTGCCT ACGCCGCTGG ATTGAAGCCC CGCTTATGGA TCGTTCTGCG ATCCAGCAAC GCCAAACCGT GGTGAGCCGC CTTGTGGAGA AGCGGCCCTT GCGCCAAACG CTGCGCCGAC TGCTGCGACC CATGGGCGAT CTCGAACGGT TGGCAGGACG CGCCGGAGCC GGCCATGCCG GTGCTAGAGA TTTAGTGGCC ATCGCCGACG GGCTGGAACG ACTGCCGCAA CTCGCCGCTC AACTCAACGG CCAGCTCACA GATGGGCCTG CATGGCTCCA GGCACTGTTT GATCCACAAC CACAGCTACA GGAGCTGGCA ACAACCGTCA GCAACACACT GAAGGAAGCG CCGCCACTGT CCCTCAGCGA AGGGGGGTTC ATCCACGATG GAGTCGACCC TCTTCTCGAC GGTCTGCGCA ACCAACTCGA TGATCAAGAC GCCTGGCTCG CGCAGCAAGA ACGACAGGAG CGGCAGGGGA GCGGGATCAG CACCCTGAGG CTTCAACACC ACCGCACCTT CGGCTACTTC CTCGCGGTGA GCAAAGCCAA AACATCAAGC GTGCCGGACC ACTGGATCCG ACGCCAAACG CTCGCCAATG AAGAACGCTT CATCACCCCC GAGCTCAAGG AGCGGGAAGG ACACATTTTC CAGTTGCGTG CCCGTGCCTG CCAGCGCGAG TACGAGCTGT TCGTGCAGTT GCGGGAGCAG GTGGGACTGA TGGCCACATC AATCCGCGAA GCCGCTCGGG CGGTGGCCGG GCTCGATGCT CTGACTGGCC TGGCGGACGT GGCCGCCACC AGCAACTTTT GTGCGCCTGA ACTGACCAAC AACCGTGAGC TGACTCTCTC GGCAGCGCGG CACCCCGTGG TGGAACAGCT GCTGGTGGAA ACTCCCTTCA CACCGAACGA TGTAGCCCTC GGCAACGGCA GAGATTTAGT CGTGCTGACT GGACCCAACG CCAGTGGCAA AAGCTGTTAT TTGCGCCAAA TCGGTCTGAT CCAACTGCTC GCTCAAGTCG GCAGCTGGGT GCCCGCCAAA GAGGCGAAAA TCGGCATTGC CGATCGGATT TTCACCCGCG TTGGTGCCGT CGATGATCTC GCCGCTGGCC AATCCACCTT CATGGTGGAA ATGGCCGAAA CAGCCAACAT CCTGCATCAC GCCACATCAC GCTCTCTGGT GTTGCTCGAC GAGATTGGGC GAGGCACCGC CACCTTCGAT GGCCTCTCGA TCGCCTGGGC TGTGAGTGAA CATCTGGCCG GCGACCTCAA GGCCCGCACC GTGTTCGCTA CGCATTATCA CGAGCTCAAC GCCTTGGCTG GCGAACGCGA CAACGTGGCC AACTTTCAGG TGATGGTGGA AGAAACGGGT GACAATCTGC TCTTCTTGCA CCAGGTGAGG CCAGGCGGAG CCAGCCGAAG CTACGGCATC GAAGCCGCAC GTCTCGCTGG TGTTCCAATG GCGGTGGTGC AGCGCGCTCA GCAAGTGCTG GATCAACTGG GAGATTGA
|
Protein sequence | MELSLQGSLF GAPEPTVNAP NTRPSTGDLP NPFNSDHNLS DADLSKDALA RPRRRNETSG STPTAGVDPD DRADETRDDT TSTDEPAWGH HSQLKPEQLT PVLRHYVELK MAHPERVLLY RLGDFFECFF EDAITLSREL ELTLTGKDAG KAIGRVPMAG IPHHAAERHC SDLIRLGYSV ALCDQLETTP TKGALLKRDI TRVLTPGTVL EEGMLTARRN NWLAAVVVEP ATQHQPFRWG LAQADVSTGD VQVLQREGSD GLHQHLARLQ ASELLWSGDD PAPAWCPDRV GLTPMSSTPF SRPEAEAVLL EHYNLASLDG IGLPEVPLAL QAIGGLLQYV GDTQPLEDNA RVPLDVPAIV HNGDSLVLDA QTRRNLELTA TQRDGLLQGS LLWAIDRTLT AMGGRCLRRW IEAPLMDRSA IQQRQTVVSR LVEKRPLRQT LRRLLRPMGD LERLAGRAGA GHAGARDLVA IADGLERLPQ LAAQLNGQLT DGPAWLQALF DPQPQLQELA TTVSNTLKEA PPLSLSEGGF IHDGVDPLLD GLRNQLDDQD AWLAQQERQE RQGSGISTLR LQHHRTFGYF LAVSKAKTSS VPDHWIRRQT LANEERFITP ELKEREGHIF QLRARACQRE YELFVQLREQ VGLMATSIRE AARAVAGLDA LTGLADVAAT SNFCAPELTN NRELTLSAAR HPVVEQLLVE TPFTPNDVAL GNGRDLVVLT GPNASGKSCY LRQIGLIQLL AQVGSWVPAK EAKIGIADRI FTRVGAVDDL AAGQSTFMVE MAETANILHH ATSRSLVLLD EIGRGTATFD GLSIAWAVSE HLAGDLKART VFATHYHELN ALAGERDNVA NFQVMVEETG DNLLFLHQVR PGGASRSYGI EAARLAGVPM AVVQRAQQVL DQLGD
|
| |