Gene Syncc9605_0078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9605_0078 
Symbol 
ID3737818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9605 
KingdomBacteria 
Replicon accessionNC_007516 
Strand
Start bp75614 
End bp78325 
Gene Length2712 bp 
Protein Length903 aa 
Translation table11 
GC content68% 
IMG OID637774659 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_380412 
Protein GI78211633 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.167909 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCGGT CCGCGTCCCA GCCCCCTGAC GACGCCCTGC AGGGCAATCT GTTCGGGGCG 
CCTGAGCCGG CCGCCCCATC GGCGACAGCA ACGGCCAGCG AGCCTGAGAC CGCCAGCCAC
GATCTCAGCG ACGACGAACT TGGGGCCGAT GCAGCCGCCC GGCCCCGAAC GCGCCAGGCC
ACGGCAAGCG AGGGCAGCAG CGAAGCTCCG GCTGCAAACG ATTCAGAACC CAGCAGAGAC
GAACCAGCCT GGGGCCATCA CAGCCAGCTG GACCCGCTGC AGCTCACACC GATGCTGCGC
CACTACGTGG AGCTAAAAGC AGCCCATCCC GAAAGGGTGC TGCTCTACCG CCTGGGCGAC
TTCTTCGAAT GCTTCTTCGA AGACGCGATC GAGCTCTCCC GGGTGCTGGA ACTCACCCTC
ACCGGCAAGG AGGGCGGCAA GGCCATCGGC CGGGTGCCGA TGGCGGGCAT CCCCCACCAC
GCCGCCGAGC GCTATTGCGC CGAACTGATC AAGCAGGGCT ACAGCGTGGC CCTCTGCGAT
CAACTCGAAA CCACTCCCAC CAAGGGTGCC CTGCTCAAGC GGGACATCAC CCGGGTGCTG
ACCCCCGGCA CCGTGTTGGA GGAGGGCATG CTCAGCGCCC GCCGCAACAA CTGGCTGGCG
GCGGTGGTGG TGGAGCCGGC CCAGGGAAAA CAACCGCTGC GCTGGGGCCT GGCCAGCGCC
GATGTGAGCA CCGGCGAGGT GCAAGTGATG CAACGAGAAG ACAGCAGCGC CCTGCACCAG
CAGCTGGCCC AGCAGGAGGC CTCCGAATTG CTCTGGGCCG CCGCCCTCGA TACTGAGCGG
CCGGCCTGGT GCCCGGAACG GTTGCGGCTG ACGCCAATGG CCAGCACCCC CTTCAGCCCA
GTGGAGGCGG AGCGCACCCT GCAGCAGCAC TACGGCCTCA GCAGCCTCGA TGGCCTCGGC
CTACCGGAGC ACCCCCTAGC CCTGCAGGCC CTCGGTGGCC TGCTGGGCTA CCTGCAGGAC
ACCCAGCCCC TGGAGGAGGA CAGCCGCATT CCCCTCGAAG TGCCGGCAAT CGTGCACCGC
GGCGATGCCC TGGTGCTGGA TGCCCAGACC CGCCGCAACC TCGAGCTCAC CGCCACCCAG
CGCGACAACC AGTTGCAGGG GTCATTGCTC TGGGCCATCG ATCGCACCCT CACCGCCATG
GGCGGCCGTT GCCTACGCCG CTGGCTGGAA GCACCGTTGA TGGACCGCCA GGCCATTCAG
CAGCGCCAGG ATCTGGTGAG CAGCCTGGTG GGTGAACGCA GCCTGCGACT GGCGATCCGC
CAGCTACTTC GGCCGATGGG GGACCTGGAA CGGCTGGCCG GCCGGGCCGG GGCGGGCCAT
GCCGGTGCCC GCGATCTTGT GGCCATCGCC GATGGCCTTG AACGGTTGCC CCAGCTCACC
GCTCGGCTGA AATCGGCAAT CAGCACAGGG CCGGAGTGGT TGCAGCAGCT GCTCAGCCCC
GATCCAGCCC TGGCGGAGCT GGCCCGAACA ATTCGCCACA AGCTGGTGGA GGCCCCACCG
CTCTCCCTCT CCGAGGGCGA TCTGATCCAT GACGGCGTCG ACCCGCTGCT GGATGGGCTG
CGCAACCAGC TGGACGATCA GGACGCCTGG CTGAGCCACC AGGAGCAGCA AGAGCGCCAA
CGCTGTGGCA TCAGCACCCT GAAGCTGCAG CACCACCGCA CCTTCGGCTA TTTCCTGGCG
GTGAGCAAAG CCAAGGCCAC CGCCGTGCCG GAACACTGGA TCCGGCGCCA GACCCTGGCC
AACGAAGAAC GCTTCATTAC CCCGGATCTC AAGGAGCGGG AGGGCCGCAT CTTCCAGCTG
CGGGCCCGGG CTTGCCAGCG GGAATACGAG CTGTTCTGCC AGCTGCGAGA GCAGGTGGGG
GCCATGGCCG CCCCGATCCG CCAGGCAGCC CGCGCCGTTG CTGCACTCGA TGCCCTCACC
GGCCTGGGCG ATGTGGCCGC CAGCGGTGGC TACTGCGCAC CAACCATCAC CGACGGCCGA
GGGCTGCAGC TGGAGGACAG CCGCCATCCG GTGGTGGAGC AACGGCTGGT GGAAACCGCT
TTCACCCCCA ATGATGTGCA GCTCGGTGAG GGCACCGATC TGGTGGTGCT CACGGGCCCC
AATGCCAGTG GCAAGAGCTG CTACCTACGC CAGATCGGCC TGATCCAGTT GATGGCTCAG
ATCGGCAGCT GGGTGCCGGC CCGCTCAGCC ACGGTCGGCA TTGCCGATCG GATCTTCACC
CGGGTGGGTG CTGTGGATGA TCTGGCCGCC GGCCAGTCCA CCTTCATGGT GGAGATGGCT
GAAACCGCCA ACATCCTTCA CCACGCCAGC GATCGTTCCC TGGTGCTGCT CGATGAAATC
GGGCGGGGCA CCGCCACCTT CGATGGCCTC TCCATCGCCT GGGCCGTGAG CGAGCACCTG
GCGGGCGACC TGGGCAGCCG CACGGTGTTC GCCACCCACT ATCACGAACT CAATAACCTG
GCCGCCGAAC GCGACAACGT GGCCAACTTC CAGGTGCTGG TGGAGGAAAC CGGTGAGGAT
CTGGTGTTCC TTCATCAGGT GCAGGCTGGT GGCGCCAGCC GTAGCTACGG CATCGAAGCG
GCACGCCTGG CCGGCGTTCC CAAGCCCGTG GTGCAACGGG CCCGTCAGGT GCTCGATCAG
TTGACGGCCT GA
 
Protein sequence
MPRSASQPPD DALQGNLFGA PEPAAPSATA TASEPETASH DLSDDELGAD AAARPRTRQA 
TASEGSSEAP AANDSEPSRD EPAWGHHSQL DPLQLTPMLR HYVELKAAHP ERVLLYRLGD
FFECFFEDAI ELSRVLELTL TGKEGGKAIG RVPMAGIPHH AAERYCAELI KQGYSVALCD
QLETTPTKGA LLKRDITRVL TPGTVLEEGM LSARRNNWLA AVVVEPAQGK QPLRWGLASA
DVSTGEVQVM QREDSSALHQ QLAQQEASEL LWAAALDTER PAWCPERLRL TPMASTPFSP
VEAERTLQQH YGLSSLDGLG LPEHPLALQA LGGLLGYLQD TQPLEEDSRI PLEVPAIVHR
GDALVLDAQT RRNLELTATQ RDNQLQGSLL WAIDRTLTAM GGRCLRRWLE APLMDRQAIQ
QRQDLVSSLV GERSLRLAIR QLLRPMGDLE RLAGRAGAGH AGARDLVAIA DGLERLPQLT
ARLKSAISTG PEWLQQLLSP DPALAELART IRHKLVEAPP LSLSEGDLIH DGVDPLLDGL
RNQLDDQDAW LSHQEQQERQ RCGISTLKLQ HHRTFGYFLA VSKAKATAVP EHWIRRQTLA
NEERFITPDL KEREGRIFQL RARACQREYE LFCQLREQVG AMAAPIRQAA RAVAALDALT
GLGDVAASGG YCAPTITDGR GLQLEDSRHP VVEQRLVETA FTPNDVQLGE GTDLVVLTGP
NASGKSCYLR QIGLIQLMAQ IGSWVPARSA TVGIADRIFT RVGAVDDLAA GQSTFMVEMA
ETANILHHAS DRSLVLLDEI GRGTATFDGL SIAWAVSEHL AGDLGSRTVF ATHYHELNNL
AAERDNVANF QVLVEETGED LVFLHQVQAG GASRSYGIEA ARLAGVPKPV VQRARQVLDQ
LTA