Gene Syncc9902_0081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_0081 
Symbol 
ID3744073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp79456 
End bp82173 
Gene Length2718 bp 
Protein Length905 aa 
Translation table11 
GC content61% 
IMG OID637770247 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_376099 
Protein GI78183665 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.294772 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACTAT CCCTGCAAGG CAGCCTCTTT GGCGCGCCTG AGCCAACGGT CAATGCCCCA 
AACACAAGAC CATCAACTGG CGATCTGCCG AACCCGTTCA ACTCCGATCA CAACCTCAGC
GATGCAGACC TAAGCAAAGA CGCCTTGGCT CGACCGCGCC GACGCAACGA AACCAGCGGA
TCAACACCAA CGGCCGGAGT TGATCCAGAC GACCGAGCAG ATGAAACCCG TGACGACACC
ACGTCCACCG ACGAGCCGGC ATGGGGCCAC CACAGCCAAC TCAAACCTGA GCAGCTCACA
CCGGTGCTGC GCCATTACGT GGAACTCAAA ATGGCGCACC CGGAGCGGGT GCTGCTCTAC
CGGTTAGGCG ACTTTTTTGA ATGCTTTTTT GAAGATGCCA TCACCTTGTC GCGTGAGCTA
GAGCTCACCC TTACGGGAAA AGATGCCGGA AAAGCGATCG GGCGGGTGCC GATGGCGGGC
ATTCCCCATC ACGCCGCCGA ACGGCATTGC AGCGATTTGA TTCGCCTCGG GTATTCCGTA
GCGCTCTGTG ATCAACTCGA AACCACCCCC ACCAAAGGCG CCCTGCTGAA GCGGGACATC
ACCCGCGTGC TCACCCCCGG CACGGTGCTT GAGGAAGGCA TGCTCACGGC CCGACGCAAC
AACTGGCTTG CCGCGGTGGT GGTGGAGCCA GCCACCCAAC ACCAGCCATT TCGCTGGGGA
TTGGCCCAAG CGGATGTGAG CACCGGTGAT GTGCAGGTGT TGCAGCGGGA AGGCAGTGAT
GGCTTACATC AACACCTGGC GCGGCTTCAG GCCTCCGAAC TGCTTTGGAG TGGCGACGAT
CCTGCTCCGG CCTGGTGCCC AGATCGCGTC GGACTCACGC CAATGAGCAG CACCCCCTTC
AGCCGCCCTG AAGCGGAGGC TGTGCTGCTG GAGCACTACA ACCTGGCCAG TCTCGATGGC
ATCGGTCTAC CGGAAGTACC GCTCGCCCTG CAGGCCATCG GCGGTTTACT GCAGTACGTC
GGCGACACCC AACCGCTCGA AGACAACGCT CGAGTTCCCC TGGACGTGCC CGCAATCGTT
CACAACGGCG ACAGCCTCGT TCTAGATGCC CAAACCCGCC GCAACTTGGA ACTCACCGCC
ACCCAGCGGG ATGGGCTACT GCAGGGATCG TTGCTATGGG CCATCGATCG CACCCTCACC
GCCATGGGAG GGCGTTGCCT ACGCCGCTGG ATTGAAGCCC CGCTTATGGA TCGTTCTGCG
ATCCAGCAAC GCCAAACCGT GGTGAGCCGC CTTGTGGAGA AGCGGCCCTT GCGCCAAACG
CTGCGCCGAC TGCTGCGACC CATGGGCGAT CTCGAACGGT TGGCAGGACG CGCCGGAGCC
GGCCATGCCG GTGCTAGAGA TTTAGTGGCC ATCGCCGACG GGCTGGAACG ACTGCCGCAA
CTCGCCGCTC AACTCAACGG CCAGCTCACA GATGGGCCTG CATGGCTCCA GGCACTGTTT
GATCCACAAC CACAGCTACA GGAGCTGGCA ACAACCGTCA GCAACACACT GAAGGAAGCG
CCGCCACTGT CCCTCAGCGA AGGGGGGTTC ATCCACGATG GAGTCGACCC TCTTCTCGAC
GGTCTGCGCA ACCAACTCGA TGATCAAGAC GCCTGGCTCG CGCAGCAAGA ACGACAGGAG
CGGCAGGGGA GCGGGATCAG CACCCTGAGG CTTCAACACC ACCGCACCTT CGGCTACTTC
CTCGCGGTGA GCAAAGCCAA AACATCAAGC GTGCCGGACC ACTGGATCCG ACGCCAAACG
CTCGCCAATG AAGAACGCTT CATCACCCCC GAGCTCAAGG AGCGGGAAGG ACACATTTTC
CAGTTGCGTG CCCGTGCCTG CCAGCGCGAG TACGAGCTGT TCGTGCAGTT GCGGGAGCAG
GTGGGACTGA TGGCCACATC AATCCGCGAA GCCGCTCGGG CGGTGGCCGG GCTCGATGCT
CTGACTGGCC TGGCGGACGT GGCCGCCACC AGCAACTTTT GTGCGCCTGA ACTGACCAAC
AACCGTGAGC TGACTCTCTC GGCAGCGCGG CACCCCGTGG TGGAACAGCT GCTGGTGGAA
ACTCCCTTCA CACCGAACGA TGTAGCCCTC GGCAACGGCA GAGATTTAGT CGTGCTGACT
GGACCCAACG CCAGTGGCAA AAGCTGTTAT TTGCGCCAAA TCGGTCTGAT CCAACTGCTC
GCTCAAGTCG GCAGCTGGGT GCCCGCCAAA GAGGCGAAAA TCGGCATTGC CGATCGGATT
TTCACCCGCG TTGGTGCCGT CGATGATCTC GCCGCTGGCC AATCCACCTT CATGGTGGAA
ATGGCCGAAA CAGCCAACAT CCTGCATCAC GCCACATCAC GCTCTCTGGT GTTGCTCGAC
GAGATTGGGC GAGGCACCGC CACCTTCGAT GGCCTCTCGA TCGCCTGGGC TGTGAGTGAA
CATCTGGCCG GCGACCTCAA GGCCCGCACC GTGTTCGCTA CGCATTATCA CGAGCTCAAC
GCCTTGGCTG GCGAACGCGA CAACGTGGCC AACTTTCAGG TGATGGTGGA AGAAACGGGT
GACAATCTGC TCTTCTTGCA CCAGGTGAGG CCAGGCGGAG CCAGCCGAAG CTACGGCATC
GAAGCCGCAC GTCTCGCTGG TGTTCCAATG GCGGTGGTGC AGCGCGCTCA GCAAGTGCTG
GATCAACTGG GAGATTGA
 
Protein sequence
MELSLQGSLF GAPEPTVNAP NTRPSTGDLP NPFNSDHNLS DADLSKDALA RPRRRNETSG 
STPTAGVDPD DRADETRDDT TSTDEPAWGH HSQLKPEQLT PVLRHYVELK MAHPERVLLY
RLGDFFECFF EDAITLSREL ELTLTGKDAG KAIGRVPMAG IPHHAAERHC SDLIRLGYSV
ALCDQLETTP TKGALLKRDI TRVLTPGTVL EEGMLTARRN NWLAAVVVEP ATQHQPFRWG
LAQADVSTGD VQVLQREGSD GLHQHLARLQ ASELLWSGDD PAPAWCPDRV GLTPMSSTPF
SRPEAEAVLL EHYNLASLDG IGLPEVPLAL QAIGGLLQYV GDTQPLEDNA RVPLDVPAIV
HNGDSLVLDA QTRRNLELTA TQRDGLLQGS LLWAIDRTLT AMGGRCLRRW IEAPLMDRSA
IQQRQTVVSR LVEKRPLRQT LRRLLRPMGD LERLAGRAGA GHAGARDLVA IADGLERLPQ
LAAQLNGQLT DGPAWLQALF DPQPQLQELA TTVSNTLKEA PPLSLSEGGF IHDGVDPLLD
GLRNQLDDQD AWLAQQERQE RQGSGISTLR LQHHRTFGYF LAVSKAKTSS VPDHWIRRQT
LANEERFITP ELKEREGHIF QLRARACQRE YELFVQLREQ VGLMATSIRE AARAVAGLDA
LTGLADVAAT SNFCAPELTN NRELTLSAAR HPVVEQLLVE TPFTPNDVAL GNGRDLVVLT
GPNASGKSCY LRQIGLIQLL AQVGSWVPAK EAKIGIADRI FTRVGAVDDL AAGQSTFMVE
MAETANILHH ATSRSLVLLD EIGRGTATFD GLSIAWAVSE HLAGDLKART VFATHYHELN
ALAGERDNVA NFQVMVEETG DNLLFLHQVR PGGASRSYGI EAARLAGVPM AVVQRAQQVL
DQLGD