Gene Cyan8802_1964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_1964 
SymbolmutL 
ID8391279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp1984838 
End bp1986523 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content45% 
IMG OID644979944 
ProductDNA mismatch repair protein 
Protein accessionYP_003137690 
Protein GI257059802 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.176062 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCCC CTATCCAACC TTTACCCTTA AATGTTATTA ACCTGATCGC TGCTGGAGAG 
GTAATAGACT CCATCGCAGC AGTGGTAAGG GAATTGGTAG AAAATGCCTT AGATGCCGGG
GCAACTCGTT TAGTAATTTC GCTATTTCCT GAAAGTTGGC GAGTTCAGGT AGCCGATAAT
GGAACCGGAA TGACGTTAGC CGATCTCCGT CACTGTGCCT TACCCCACAG TACCAGCAAA
ATTCATCAAC TTGATGATCT GTGGAAAATT ACGACTTTAG GGTTTCGCGG AGAAGCATTA
CACAGTTTAG CCCAAGTAGC CGATTTAGAA ATTGCCAGTC GCTGTACCTC TGATGGAGTA
GGATGGTGTT TGCGCTATCA GTCTTCAGGA GAACCCCTCA GGGAAGAACC CACCGCGATC
GCCCCTGGTA CGATTGTCAC GGTAGGGAAT CTTTTTGGCA AGATGCCTGT TCGTCGTCAA
GGTTTACCAG CAATCTCAAC CCAACTCAAA GCAGTACAAA GTTTCATTGA AAATATGGCC
TTGTGCCATC CCCAAGTCAC TTGGCAAGTC TGGCACAATC AGCGATTATG GTTAAATATT
AGTCCAGGGA AAACCCCTCA ACAGATTTTA CCCCAACTCC TCAAGGGGGT TCATTATCAC
GATTTACAGT TTGTTTCCCA AGGTGTTAAG AGTCCTCAAG AATCAACCCA GAAGGATTTG
GATTTAATTG AAGTTACCCT AGGATTACCC GATCGCTGCC ATCGTCACCG ACCCGATTGG
GTTAAAGTGG GGATTAATGG TCGGATCGTG CGATCGCCCC CGGTAGAACA GGCAATTTTA
ACGGCTTTTA GTCGAACCTT GCCTAAAGAT CGCTTTCCTG TCTGTTTTAT CCATTTAACC
CTCTGTCCGA GTCAAATTGA TTGGAACCGT CATCCGGCTA AGGTGGAAAT TTATCTCCAT
TCCCTCGATT TTTGGCGAGA ACAGGTGTCT AAACTGATTG AACAGGGGTT AAGGTTATCA
CCCCAAACCC TGGCCTCTGC TGCCCAAAAT CAACGGGTAG GGAAGTTACT CAAAGCATCC
GAAGAAAAAG CATCCTATCG CGTTGATGCT AAGGATCACC AGACTGATGC TAACGCGGTT
GGGTTAATGC CCTTAAAGGC TGTGGCACAG GTACGCAATA CTTATATTGT GGCTGAACAT
TCGACGGGGT TATGGTTAAT TGAACAACAT ATCGCCCATG AACGAGTGTT GTATGAAACG
TTGCAGGATA ATTGGCAATT AATCCCGCTA GAGACTCCGA TTATTTTAAC AAAATTATCA
GACAATCAAG TGGAACAATT AGCCAGAATT GGTTTAGAAA TTGAAGTTTT TGGAGAGCAA
CTTTGGGCAG TTCGGACAGT TCCTAAACTG TTATCAACGA GGGAAGATTG TCCAGAGGCT
TTAGTCGAAT TAAGCATAGG AGGAGATTTA CAAACGGCTC AAGTGGCTGT TGCTTGTCGT
AGTGCGATTC GTAACGGAAC CCCCATGACG CTATCCCAAA TGCAGGAACT GTTAGACCAA
TGGAAAACTA CCCGTAATCC TGCCACTTGT CCCCACGGAA GACCTATTTA TTTATCCTTA
GAGGAATCTT CTTTATCTCG GTTTTTTCGT CGTCATTGGG TCATTGGCAA AAGCCATGGA
ATCTAA
 
Protein sequence
MSSPIQPLPL NVINLIAAGE VIDSIAAVVR ELVENALDAG ATRLVISLFP ESWRVQVADN 
GTGMTLADLR HCALPHSTSK IHQLDDLWKI TTLGFRGEAL HSLAQVADLE IASRCTSDGV
GWCLRYQSSG EPLREEPTAI APGTIVTVGN LFGKMPVRRQ GLPAISTQLK AVQSFIENMA
LCHPQVTWQV WHNQRLWLNI SPGKTPQQIL PQLLKGVHYH DLQFVSQGVK SPQESTQKDL
DLIEVTLGLP DRCHRHRPDW VKVGINGRIV RSPPVEQAIL TAFSRTLPKD RFPVCFIHLT
LCPSQIDWNR HPAKVEIYLH SLDFWREQVS KLIEQGLRLS PQTLASAAQN QRVGKLLKAS
EEKASYRVDA KDHQTDANAV GLMPLKAVAQ VRNTYIVAEH STGLWLIEQH IAHERVLYET
LQDNWQLIPL ETPIILTKLS DNQVEQLARI GLEIEVFGEQ LWAVRTVPKL LSTREDCPEA
LVELSIGGDL QTAQVAVACR SAIRNGTPMT LSQMQELLDQ WKTTRNPATC PHGRPIYLSL
EESSLSRFFR RHWVIGKSHG I