Gene Syncc9605_0509 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9605_0509 
Symbol 
ID3736754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9605 
KingdomBacteria 
Replicon accessionNC_007516 
Strand
Start bp500151 
End bp502550 
Gene Length2400 bp 
Protein Length799 aa 
Translation table11 
GC content65% 
IMG OID637775107 
ProductMutS2 family protein 
Protein accessionYP_380838 
Protein GI78212059 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0204433 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.523172 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGTCAAG AGGCGGATCG GGCTCAGAAG GAAACCCTCG AACTGCTGGA GTGGCATCGG 
GTCTGTGACC ATCTCAGCGG CTTTGCCAGC ACCGGCATGG GCCGTGATGC GGCCCGGGTG
CAACCGCTTC CGGCCAGCCT GGATGAGTCG AAACAACGCC TGGCCGAGAC CGTTGAAATG
GCGGTGCTTA ATGACCTCAC CGAGGGCGGG CTCAGCTTCC GCGGCGTGCA GAACCTCGAG
CCCGTGGTGC TGCGTTGCAG CAAGGGAGGA GTGGCCTCCG GCGAAGAGCT GCTGGCCGTG
GCGGAAACGT TGGCAGCGGC CCGTCGCCTG CGCCGTCAGA TCGACGACCC CGAGCTGCGC
CCGGTTTGCA CCGCGTTGAT TGAAACGATG GTGACGCTGC CGGAGTTGGA GCAGCGGCTC
AAATTTGCCC TCGAAGAGGG CGGCCGCGTC GCTGATCGCG CCAGCTCGGC CTTATCGGCG
CTTCGGCATC AATGGAACGG ACTGCGCCAG GAGCGTCGCG ACAAACTTCA GGAGTTGCTG
CGTCGCCTGG CGCCATCTCT GCAGGACAGT GTGATCGCCG AGCGTCATGG CCGTCCTGTT
CTGGCGGTGA AGGCCGGTGC CGTAAGCCAG GTGCCAGGGC AGGTGCACGA CAGTTCGGCT
TCAGGCAGCA CCATCTTCGT GGAACCCCGC TCGGTGCTCA CCATGGGCAA TAAGCTGGCG
GAGCTGGAGT CCCGCATCCG GGATGAGGAA CGCAAGGTGT TGGCGGAGCT GAGTGCTCTG
GTGGCTGAAG AGGCGTCTGC CCTCAACCAG GTGGTGGCCG TGCTGCGCAC GCTTGATCTT
GCTCTTGCCC GTGGGCGTTA CGGCCGCTGG CTTGGCGGCG TCGAGCCGCA GCTTGAGCCG
GCAGCGGAGG CTCCGTTTCG CTTTTCAGGT CTGCGGCACC CGCTGTTGGT GTGGCAGCAC
AAGCGCGCGG ATGGGCCACC CGTGGTTCCC ATTTCGGTGG AGGTTTCTCC GGAGCTGCGG
GTTGTGGCGA TTACCGGGCC CAACACCGGT GGCAAAACGG TCACCCTGAA AAGCATCGGC
CTCGCTGCAC TGATGGCGCG CGCCGGGATG CTCTTGCCCT GCTCGGGCCA ACCCTCCCTC
CCCTGGTGTG CCCAGGTGCT GGCGGACATC GGCGATGAGC AATCCCTTCA GCAGAGCCTG
TCCACTTTCA GCGGCCACGT GAAGCGCATC GGGCGCATCC TTGAGGCGCT CCAGCGCGGC
AGTGCCCCTG CCCTGGTGCT GCTGGATGAG GTGGGTGCTG GAACAGATCC CAGTGAGGGA
ACGGCCCTGG CCACGGCTCT GCTCAAGGCT CTTGCCGATC GAGCCCGGCT CACGATTGCC
ACCACCCATT TCGGCGAACT CAAGGCCCTC AAATACGACG ACGCTCGTTT TGAGAATGCC
TCTGTGGCCT TTAACCCTGA GACGTTATCC CCCACCTACG AGTTGCTTTG GGGAATTCCG
GGACGCAGCA ATGCGCTGGC GATCGCGACG CGCCTGGGGC TCGATTCGGA TGTGCTTCAC
CAGGCCCAGC AGCTTCTGGC CCCAGGAGGT GATGGTGAGG TGAACAGTGT GATCCGTGGC
TTGGAGGAGC AACGGCAGCG CCAGCAGGCC GCGGCAGAAG ATGCTGCAGC ACTCCTGGCA
CGCACTGAGC TGCTGCACGA GGAGTTGCTG CAGCGCTGGC AGAAGCAAAA GCAGCAGACC
GCGCAACGTC AGGAGCAGGG TCGTCAACGC CTGGAGCAGT CGATCCGTCA GGGCCAGAAG
GAAGTTCGCA CCCTGATTCG CCGTCTGCGG GATGAACGCG CGGATGGGGA AACCGCGCGG
CGGGCTGGGC AACGGTTACG CAGCCTGGAG GACCATCACC GCCCCACCCC AGAACGGCGT
GCACCCAAGC CAGGCTGGCG TCCGGCCGTG GGCGATCACG TGCGCTTGCT GGCCCTCGGT
AAGGCTGCAG ATGTGTTGGC CATCACCGAT GACGGCCTTC AGCTGACAGT CCGTTGCGGG
GTGATGCGCA CCACGGTGGA TCTGACGGCG GTGGAAAGTC TGGATGGGCG TAAGCCCGAG
CCGCCTCCAA AGCCGGTGGT GAAGGTGCAT GCCCGTTCAG CCGGTGGCGG TGGTACACAG
GTGCGCACCA GTCGCAATAC CCTTGATGTG CGGGGCATGC GGGTGCATGA GGCCGAAGCA
GCGGTTGAGG AATGCTTGCG CAGTGCCAAT GGCCCGGTTT GGGTGATCCA TGGCATTGGC
ACGGGCAAGC TCAAGCGCGG CCTGCGCGCC TGGCTGGACA CGGTGCCCTA CGTGGAACGG
GTGACTGATG CGGAGCAGGG GGACGGCGGA CCGGGCTGCA GCGTTGTCTG GGTGCGCTGA
 
Protein sequence
MSQEADRAQK ETLELLEWHR VCDHLSGFAS TGMGRDAARV QPLPASLDES KQRLAETVEM 
AVLNDLTEGG LSFRGVQNLE PVVLRCSKGG VASGEELLAV AETLAAARRL RRQIDDPELR
PVCTALIETM VTLPELEQRL KFALEEGGRV ADRASSALSA LRHQWNGLRQ ERRDKLQELL
RRLAPSLQDS VIAERHGRPV LAVKAGAVSQ VPGQVHDSSA SGSTIFVEPR SVLTMGNKLA
ELESRIRDEE RKVLAELSAL VAEEASALNQ VVAVLRTLDL ALARGRYGRW LGGVEPQLEP
AAEAPFRFSG LRHPLLVWQH KRADGPPVVP ISVEVSPELR VVAITGPNTG GKTVTLKSIG
LAALMARAGM LLPCSGQPSL PWCAQVLADI GDEQSLQQSL STFSGHVKRI GRILEALQRG
SAPALVLLDE VGAGTDPSEG TALATALLKA LADRARLTIA TTHFGELKAL KYDDARFENA
SVAFNPETLS PTYELLWGIP GRSNALAIAT RLGLDSDVLH QAQQLLAPGG DGEVNSVIRG
LEEQRQRQQA AAEDAAALLA RTELLHEELL QRWQKQKQQT AQRQEQGRQR LEQSIRQGQK
EVRTLIRRLR DERADGETAR RAGQRLRSLE DHHRPTPERR APKPGWRPAV GDHVRLLALG
KAADVLAITD DGLQLTVRCG VMRTTVDLTA VESLDGRKPE PPPKPVVKVH ARSAGGGGTQ
VRTSRNTLDV RGMRVHEAEA AVEECLRSAN GPVWVIHGIG TGKLKRGLRA WLDTVPYVER
VTDAEQGDGG PGCSVVWVR