Gene Synpcc7942_1780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1780 
SymbolmutL 
ID3774355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1845760 
End bp1847385 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content58% 
IMG OID637800221 
ProductDNA mismatch repair protein 
Protein accessionYP_400797 
Protein GI81300589 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.644143 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATCG CTCCCACAAT TCAACCGCTG CCCGCGCCTT GGGTCGCTCG CATGGCTGCA 
GGAGAAGTTA TCGATTCGCT GGCTGCGGTT GTCCGAGAAC TTTGTGAAAA CAGTTTAGAT
GCGGGCACGC AGCGGCTTGT GATTGACCTC TGGCCCGAGC AATGGCGGGT CCGCGTAGCT
GACGATGGCA TGGGTCTTTC CCTCGAGAAT CTGCAACTCG CAGCCCTGGC TCATACCAGC
AGCAAGCTCA GCCCCGATCG CTTTACGGCG GAACAGTTGG GCTTCCGCGG CGAAGCGCTG
CACAGCCTAG CGCGGCTGGG TCGCCTGACG ATTGCCAGTC GCACGGCAGA TGCTGCAGCA
GGTTGGCAAA TCAGTTACGA CATCGGCGGT CAGCCTCAAG TTGCAACCCC GATCGCGATC
GCACCGGGTT GCATTGTCGA AGTCCAGCAG CTTTTTCAGG ATTGGCTAGA GCGTCGTCAG
AGTCAGCCGA GTCCGGCCCA ACAACTGCGA GCGGTGCAGC AGCAGATTCA AAATCTGGTC
TTGGCGCACC CGAGCATCAC GGTTCAAGTC ACACAGAACG ATCGACCCTG GCTCGCGTTT
GCGCCTGTTC ATCATCCTCA AGAGCGACTA TTGCAATTAC TGCCCAATAC TAGTCCTGCT
GATTGGCGAT CGCAGTCTCT CGATCTTGGT TCTGAAGGGC AATTGCAAGT CGTCGTTGGC
TTGCCCGATC GCTGTCACCG TCGCCGTCCC GATTGGGTCA AGGTTGCGGT CAATGGTCGT
GTGGTGCGGG TGTCGGAACT GGAGCAAGCG ATGATCGGTG CGTTACATCG CAGCCTGCCC
CGCGATCGCT TCCCGGTTGT CTTTGCGCAT TTGCAAGTGC CGCCCAGCCA AGTCGATTGG
CACCGCCATC CGGCTAAGGC TGAGTTATTC CTACAGGATT TGCCAGTCTG GTGTGAGCGT
ATTCAAGAGG CGATCGCCCA AACCCTACCC CTCGATCCCG CAGAAGAAAT TGAGTTGCCC
GCCAGTACAG CCTTACTTCG AGTTGCAGAA ACAACAGGTA GCTATAGCGA GTCTCCGAGT
CACCTGCGGG CGATCGCTCA AGTTCTCAAT ACCTATGTAT TGGCGGAACA GGGCGATAGT
CTTTGGCTAA TTGAGCAGCA TATTGCCCAC GAGCGTGTTC TGTTTGAACA GTTGCAAGAC
GACTGGCAAT TGGTGGCTTG TCAGCAGCCA ATCCTGTTAT CCAAACTGTC GCTAGAGCAA
AGGCTCCAAC TTGATCGTTT GGGGATTACT GCCGAAGATT TTGGCGAGGA TCTTTGGGCG
ATCCGGCACA TTCCCGCAGC GCTAGCCCAG CGTGAGGATC TCGTCGAGGC ACTGCTAGAA
CTCAGTCGTG GTTGGGATCT CAGCGCAGCA CAAGCCGCGA TCGCCTGCCG AACGGCGATC
CGCAATGGCA CGCTGCTCTA TCCTGAGGAG CAGCAAACCT TGATCGATCG CTGGCAGCAC
TGTCGCCAAC CTCGCACCTG TCCCCATGGA CGTCCGATCG CGCTGGTGCT ACCGGAAACC
AGTCTGGCGC GTTACTTCCG TCGCCAGTGG ATGATTGGCC GAAGCCACGG TTTGGGCGAT
CCCTAA
 
Protein sequence
MAIAPTIQPL PAPWVARMAA GEVIDSLAAV VRELCENSLD AGTQRLVIDL WPEQWRVRVA 
DDGMGLSLEN LQLAALAHTS SKLSPDRFTA EQLGFRGEAL HSLARLGRLT IASRTADAAA
GWQISYDIGG QPQVATPIAI APGCIVEVQQ LFQDWLERRQ SQPSPAQQLR AVQQQIQNLV
LAHPSITVQV TQNDRPWLAF APVHHPQERL LQLLPNTSPA DWRSQSLDLG SEGQLQVVVG
LPDRCHRRRP DWVKVAVNGR VVRVSELEQA MIGALHRSLP RDRFPVVFAH LQVPPSQVDW
HRHPAKAELF LQDLPVWCER IQEAIAQTLP LDPAEEIELP ASTALLRVAE TTGSYSESPS
HLRAIAQVLN TYVLAEQGDS LWLIEQHIAH ERVLFEQLQD DWQLVACQQP ILLSKLSLEQ
RLQLDRLGIT AEDFGEDLWA IRHIPAALAQ REDLVEALLE LSRGWDLSAA QAAIACRTAI
RNGTLLYPEE QQTLIDRWQH CRQPRTCPHG RPIALVLPET SLARYFRRQW MIGRSHGLGD
P