Gene Synpcc7942_2247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_2247 
Symbol 
ID3773903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp2312310 
End bp2314958 
Gene Length2649 bp 
Protein Length882 aa 
Translation table11 
GC content57% 
IMG OID637800694 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_401264 
Protein GI81301056 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0873729 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.14084 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACTT CTGAGCTTCC GGAGCATTTG GCTAAGCACG AGGTGCGCTA TGCCGATCAC 
CGTCAGGTGG AGCGCGATCG CCTGACGCCG ATGATGCAGC ACTACGCCGA GGTCAAGGAT
CAGCATCTGC AGCAGATCCT GCTCTATCGG ATGGGCGACT TTTTCGAGTG CTTCTTTCAG
GATGCGATCG TGGTTGCCCG CGAGCTAGAG CTGGTGTTGA CCAGCAAGGA AGCAGGGAAA
GAAGTCGGGC GTGTGCCGAT GGCGGGCATT CCCTATCACG CCCTCGATCG CTATGCCAGC
CAGTTAGTTG AGAAGGGTTA CGCGATCGCG ATCTGCGATC AGGTAGAAAC GGCTGCTCAG
GCGCAAGGGC CGCTCGTCCG CCGCGAAATC ACGCGGATCA TCACACCGGG CACCATCCTC
GAAGAGGGAA TGCTGCAGGC ACGCCGCAAT AACTTTTTGG CAGCGGTCGT GATCGCCGGC
GAGCATTGGG GGCTGGCCTA CGCAGATAGC TCCACTGGTG ACTATTGGAC GAGCCAAAGT
ACTGGTCTGG AGGGACTGAC CCAGGAGCTC TATCGCCTGC AACCCTCAGA AGTGCTCTTT
CCCAGTGCCG CACCGGATTT AGCCGGATTG CTGCGGCCGG GGCAGTCCAA GCCGCAACAG
ATTCCCGACT GCTTGCCGGA TTCGTTTTGC TACGCCCTGC GATCGCCCAT GCCCTTTGAA
TTGCATGAGG CACGGCAGCG GTTGCTGGAA CATTTCCAAC TGCGATCGCT GGAAGGCTGT
GGCTGTGAGC AATTACCCTT GGCGATCCGT GCTGCTGGTG GTCTCCTGGA CTACTTGGGA
GAAACCCAGC GGGAAAGCCT TGCGCCATTA CAAAAGCCGC GTACCTATTC GCTGTCGGAG
TTTTTAATCC TCGATCAACA AACCCGCCGC AATTTAGAGA TCACTCAAAC TCAGCGGGAT
GGCAGCTTCC ACGGCTCGCT GCTCTGGGCG CTCGATCGCA CGATGACTAG CATGGGTGGC
CGCCTGTTGC GCCGCTGGTT GCTGCAACCC TTGCTCAACC CTGAGGCGAT TCGCAACCGC
CAAGCCGCGA TTCAAGAACT CTGTCAGGAT GGGCGACTTC GGCAGGATCT GCGATCGCTA
CTCCAGAAGA TCTACGACCT TGAGCGGCTC TCAGGCCGCG CTGGTGCTGG GACGGCCAAT
GCCCGCGATC TCTTGGCGCT GGCGGAGTCG TTGCTACGCC TACCGGAGCT GGCGCAGTTA
CTCAGTCGTG CCCAATCACC CTTGCTGGCG CAATTACAGC AGGTGCCACC GGAACTGGAG
CAGTTGGGCG ATCGCCTCCA GCAGCATTTA GTCGAATCGC CCCCCTTACA ACTGACTGAA
GGCGGGTTGA TTCGATCCGG GGTTGCGATC GCCTTGGATG AGTTGCGGCA ACAAGTCGAG
AGCGATCGCC AGTGGATTGC CAGCCTCGAA GCCAGCGAGC GTACTGCGAC TGGCATCAAC
AGCCTCAAGG TGGGTTACAG CAAAACCTTC GGCTATTTCA TTAGCCTGAG TCGTAGCAAA
GCCGATCAGG TGCCCGATCA CTACATCCGC CGCCAAACCC TGACCAACGA AGAGCGCTTT
ATTACACCGG ATCTAAAGGA GCGGGAAAGC CGAATCCTCA ATGCTCAGAC GGATCTCAAT
CAACTGGAAT ACGACCTTTT TGTTGGGCTG CGCAGCGAAG TCAGTCACCA TGTCGAAACA
ATCCGCGCGA TCGCCACAGC AGTCGCTGCC GCTGATGTCT TAGCAGCTTT AGCGGAGGTA
GCAGTCTACC AAAACTATTG CTGCCCTGAG ATTCGTGACG ATCGCCAGTT GGCGATTCAG
GATGGTCGCC ACCCAGTCGT TGAGCAGGCT TTGCCGAGCG GCTTTTATGT ACCCAATAGT
TGCGGCCTCG GTAGCGATCG CGGCCCAGAT CTAATTGTTT TGACCGGGCC GAATGCCAGC
GGCAAAAGTT GTTATTTACG ACAGGTTGGC CTCATTCAAC TGCTGGCTCA AATTGGTAGT
TTTGTTCCCG CGAAGAATGC CCAAGTTGGA ATTTGCGATC GCATCTTCAC CCGCGTTGGA
GCCGTCGATG ATCTGGCAAC TGGCCAATCA ACTTTCATGG TGGAGATGAA CGAAACGGCT
AACATCCTCA ACCATGCCAC CGCGCGATCG CTGGTCTTGC TGGATGAGAT TGGTCGCGGT
ACCGCCACCT TTGATGGCCT CTCGATCGCC TGGGCTGTGG CGGAATATCT AGCAAGGGAA
ATCCAAGCCC GCACGATTTT TGCAACGCAC TACCACGAGC TGAATGAGCT GTCAGGCCTG
CTCAAGAACG TCGCCAACTT CCAAGTCACC GTTAAAGAAC TGCCCGATCG CATTGTCTTC
CTGCATCAAG TTCAGCCGGG GGGCGCCGAT CGTTCCTATG GCATTGAAGC AGCGCGGCTC
GCAGGTTTAC CCAGCGAGGT GATCGATCGC GCCCGTGAAG TGATGAGCCG CATTGAAAAA
CACAGCCGCA TTGCTGTGGG GTTACGGCGT GGCAATGGTA GCCAGCGTCG GACTCAGGCG
TCGCCCCAGC AGATAGATGC CACGATCGCT ACTGAACAAT TGGGCCTCTT CAGTGGTCCT
TCGCACTAA
 
Protein sequence
MTTSELPEHL AKHEVRYADH RQVERDRLTP MMQHYAEVKD QHLQQILLYR MGDFFECFFQ 
DAIVVARELE LVLTSKEAGK EVGRVPMAGI PYHALDRYAS QLVEKGYAIA ICDQVETAAQ
AQGPLVRREI TRIITPGTIL EEGMLQARRN NFLAAVVIAG EHWGLAYADS STGDYWTSQS
TGLEGLTQEL YRLQPSEVLF PSAAPDLAGL LRPGQSKPQQ IPDCLPDSFC YALRSPMPFE
LHEARQRLLE HFQLRSLEGC GCEQLPLAIR AAGGLLDYLG ETQRESLAPL QKPRTYSLSE
FLILDQQTRR NLEITQTQRD GSFHGSLLWA LDRTMTSMGG RLLRRWLLQP LLNPEAIRNR
QAAIQELCQD GRLRQDLRSL LQKIYDLERL SGRAGAGTAN ARDLLALAES LLRLPELAQL
LSRAQSPLLA QLQQVPPELE QLGDRLQQHL VESPPLQLTE GGLIRSGVAI ALDELRQQVE
SDRQWIASLE ASERTATGIN SLKVGYSKTF GYFISLSRSK ADQVPDHYIR RQTLTNEERF
ITPDLKERES RILNAQTDLN QLEYDLFVGL RSEVSHHVET IRAIATAVAA ADVLAALAEV
AVYQNYCCPE IRDDRQLAIQ DGRHPVVEQA LPSGFYVPNS CGLGSDRGPD LIVLTGPNAS
GKSCYLRQVG LIQLLAQIGS FVPAKNAQVG ICDRIFTRVG AVDDLATGQS TFMVEMNETA
NILNHATARS LVLLDEIGRG TATFDGLSIA WAVAEYLARE IQARTIFATH YHELNELSGL
LKNVANFQVT VKELPDRIVF LHQVQPGGAD RSYGIEAARL AGLPSEVIDR AREVMSRIEK
HSRIAVGLRR GNGSQRRTQA SPQQIDATIA TEQLGLFSGP SH