Gene Syncc9902_1826 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_1826 
Symbol 
ID3742923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp1749869 
End bp1752313 
Gene Length2445 bp 
Protein Length814 aa 
Translation table11 
GC content58% 
IMG OID637772018 
ProductMutS 2 protein 
Protein accessionYP_377827 
Protein GI78185392 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCTCC CGGTGGTATC CCTTCCAACT GCGACTGAGA CCGTGACCCA CCCCAGTCAC 
CATCAGGCCT TGAGTGAAAC GCTTGAGCTT TTGGAGTGGC CTGTGGTGTG CGAGCACCTG
GCTACGTTTG CCAGCACGCG CATGGGGCTC GAATCGGCTC GGGCAACGCA GTTGCCCCAG
TCGCTGGCGG AGACGCTTCA GCGACAGGCC GAAACCGTTG AAATGGCGGT TTTGGACGAT
CTCACTGAGG GAGGGTTGAG TTTCCGAGGT GTGAATGACC TTCGGCCAGT TCTGCTGCGA
TGCCTCAAGG GTGGTGTGGC CTCCGGAGAG GAGTTGCTTG CCGTGGCTGG AACCCTCGCT
GCGGCACGAA AATTGCGTCG TCAAATCGAC GATCAGGAGC TGCGTCCCGT CTGTACAGCT
CTGATCGAAA CGATGGTCAC CCTGCCTGAT CTAGAGCAAC GCTTGAAGTT TTCCCTGGAA
GAAGGTGGTC GTGTTGCGGA TCGGGCGAGC CCACCCTTGG CGGGCCTTCG ACAGCAATGG
AATGGCGTGC GCCAGGAGCG ACGCGACAAA CTTCAGGAGC TCACGCGTCG TTACGCGTCC
TTTCTCCAAG ACTCAGTCAT CGCACAACGG CATGGACGCC CTGTCCTCGC CGTCAAAGCA
GGCGCTGTGG GTCAGGTGTC GGGTCAAGTC CACGACAGTT CAGCGTCTGG GAACACCGTT
TTTATTGAGC CCCGCTCGGT GCTCACGATG GGCAACAAGC TTGTCGATAT CGAGGCTCGA
ATTCGAAAAG AGGAGCAGCG CGTTCTTGCC GAGTTGAGCG ATCTGGTTGC CCAGGACGAG
CGAGTTCTCA ATTCACTTGT TGAGATTTTG TTAGCCCTGG ATTTGGCTTT GGCCCGCGGC
CGTTACGGCC GATGGCTCGG GGCTGTTCCA CCCCATTTGT TGGAAGATCC TGAGGCTCCC
TTCTTGCTCA GGGACCTGCG ACATCCATTG CTTATTTGGC AACACAAACG TTCCAGCGGC
TCTCCGGTGG TGCCGATCAG CGTGGATGTG TCTGCACAAC TCCGTGTAGT CGCGATCACC
GGACCCAACA CCGGCGGCAA AACGGTGAGC TTGAAGAGCC TTGGTTTAGT GGCCCTCATG
GCTCGCGCCG GAATGCTGAT TCCTTGCTCT GGCCGGCCGT CCTTGCCCTG GTGTGCCCAA
GTGTTGGCCG ATATCGGCGA TGAGCAATCT CTTCAACAAA GCTTGTCCAC ATTTAGTGGA
CATATCAAAC GTATCGGCAG GATTCTGCAG GCGCTTGAAT CCGGTCCAGT TCCCGCCTTA
GTTCTGTTGG ATGAAGTGGG GGCAGGGACA GATCCAAGTG AAGGCACGGC TCTCGCAACT
GCTCTACTGA AGGCGCTTGC GGATCGGGCA CGACTCACGA TCGCGACAAC GCACTTTGGC
GAGCTCAAGG CGCTCAAATA CACCGACGAC CGATTTGAAA ACGCTTCGGT GGCTTTCAAT
GCCGAAACTT TGTCTCCGAC TTATGAACTT CTCTGGGGGA TCCCTGGGCG TAGCAATGCC
CTAGCCATTG CGACGCGTTT AGGCCTTGAT GCGGGGGTGT TGGATCAAGC GCAAGCCCTT
TTGGCTCTGG CGGCGGAGGG GGAGGTGAAC ACGGTGATTC AAGGTCTTGA GGAGCAACGG
CAGCGACAGC AAGCTGCGGC TGAAGATGCC GCAGCGCTCT TGGCGCGAAC AGAGCTGTTG
CATGAAGAGT TGCTGTTGCG CTGGCAAAAG CAAAAGCAGC AGACGGCATT GCATCAAGAA
CAAGGACGTC AGCGATTGGA ACAGTCCATC CGTGAAGGAC AGAAAGAGGT GCGTTCCTTG
ATCCGCCGGT TGCGGGACGG CCGCGCCGAC GGTGAAACGG CGCGTAAGGC TGGACAACGC
CTTCGCAAGT TGGAAGACCA TCACCGTCCA ACAAAGGAGA AGCGCGCACC GAAACCCGGC
TGGCGTCCTG AGGTGGGAGA ACGCGTTCGG TTGCTGGCCT TGGGAAAGGC CGCAGAAGTG
TTGGCAATCT CAGATGACGG TCTGCAGCTC ACGGTGCGTT GCGGTGTGAT GCGCACCACC
GTGGATCTCA ATGCTGTTGA AAGCTTGGAT GGACGGAAGG CTGAACCACC TCCGGTTCCT
GTTGTGAAAG TGCAAGCGCG CTCTGGCTTG GGAGCAGGCG CTCAAGTACG TACGAGCCGC
AACACTTTGG ATATTCGAGG GATGCGTGTG CACGAAGCTG AATCCACCGT TGAAGAGCAG
CTCCGCAATG CCAATGGACC GCTGTGGGTG ATCCATGGAA TCGGCACAGG CAAGCTGAAG
CGCGGCCTGA GAGCGTGGCT GGATACGGTT CCCTACGTGG AGAGGGTCGT TGATGCGGAG
CAGGGCGATG GGGGGCCAGG TTGCAGCGTT GTTTGGGTGC GTTGA
 
Protein sequence
MNLPVVSLPT ATETVTHPSH HQALSETLEL LEWPVVCEHL ATFASTRMGL ESARATQLPQ 
SLAETLQRQA ETVEMAVLDD LTEGGLSFRG VNDLRPVLLR CLKGGVASGE ELLAVAGTLA
AARKLRRQID DQELRPVCTA LIETMVTLPD LEQRLKFSLE EGGRVADRAS PPLAGLRQQW
NGVRQERRDK LQELTRRYAS FLQDSVIAQR HGRPVLAVKA GAVGQVSGQV HDSSASGNTV
FIEPRSVLTM GNKLVDIEAR IRKEEQRVLA ELSDLVAQDE RVLNSLVEIL LALDLALARG
RYGRWLGAVP PHLLEDPEAP FLLRDLRHPL LIWQHKRSSG SPVVPISVDV SAQLRVVAIT
GPNTGGKTVS LKSLGLVALM ARAGMLIPCS GRPSLPWCAQ VLADIGDEQS LQQSLSTFSG
HIKRIGRILQ ALESGPVPAL VLLDEVGAGT DPSEGTALAT ALLKALADRA RLTIATTHFG
ELKALKYTDD RFENASVAFN AETLSPTYEL LWGIPGRSNA LAIATRLGLD AGVLDQAQAL
LALAAEGEVN TVIQGLEEQR QRQQAAAEDA AALLARTELL HEELLLRWQK QKQQTALHQE
QGRQRLEQSI REGQKEVRSL IRRLRDGRAD GETARKAGQR LRKLEDHHRP TKEKRAPKPG
WRPEVGERVR LLALGKAAEV LAISDDGLQL TVRCGVMRTT VDLNAVESLD GRKAEPPPVP
VVKVQARSGL GAGAQVRTSR NTLDIRGMRV HEAESTVEEQ LRNANGPLWV IHGIGTGKLK
RGLRAWLDTV PYVERVVDAE QGDGGPGCSV VWVR