Gene PCC8801_1937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1937 
SymbolmutL 
ID7102885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2014559 
End bp2016244 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content45% 
IMG OID643474998 
ProductDNA mismatch repair protein 
Protein accessionYP_002372131 
Protein GI218246760 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTCCC CTATCCAACC TTTACCCTTA AATGTTATTA ACCTGATCGC TGCTGGAGAG 
GTAATAGACT CCATCGCAGC AGTGGTAAGG GAATTGGTAG AAAATGCCTT AGATGCCGGG
GCAACTCGTT TAGTAATTTC GCTATTTCCT GAAAGTTGGC GAGTTCAGGT AGCCGATAAT
GGAACCGGAA TGACGTTAGC CGATCTCCGT CACTGTGCCT TACCCCACAG TACCAGCAAA
ATTCATCAAC TTGATGATCT GTGGAAAATT ACGACTTTAG GGTTTCGCGG AGAAGCATTA
CACAGTTTAG CCCAAGTAGC CGATTTAGAA ATTGCCAGTC GCTGTACCTC TGATGGAGTA
GGATGGTGTT TGCGCTATCA GTCTTCAGGA GAACCCCTCA GGGAAGAACC CACCGCGATC
GCCCCTGGTA CGATTGTCAC GGTAGGGAAT CTTTTTGGCA AGATGCCTGT TCGTCGTCAA
GGTTTACCAG CAATCTCAAC CCAACTCAAA GCAGTACAAA GTTTCATTGA AAATATGGCC
TTGTGCCATC CCCAAGTCAC TTGGCAAGTC TGGCACAATC AGCGATTATG GTTAAATATT
AGTCCAGGGA AAACCCCTCA ACAGATTTTA CCCCAACTCC TCAAGGGGGT TCATTATCAC
GATTTACAGT TTGTTTCCCA AGGTGTTAAG AGTCCTCAAG AATCAACCCA GAAGGATTTG
GATTTAATTG AAGTTACCCT AGGATTACCC GATCGCTGCC ATCGTCACCG ACCCGATTGG
GTTAAAGTGG GGATTAATGG TCGGATCGTG CGATCGCCCC CGGTAGAACA GGCAATTTTA
GTAGCATTTA GTCGAACCTT GCCTAAAGAT CGCTTTCCTG TCTGTTTTAT CCATTTAACC
CTCTGTCCGA GTCAAATTGA TTGGAACCGT CATCCAGCCA AGGTGGAAAT TTATCTCCAT
TCCCTCGATT TTTGGCAAGA ACAGGTGTCT AAACTGATTG AACAGGGGTT AAGGTTATCA
CCCCAAACCC TGGCCTCTGC TGCCCAAAAT CAACGGGTAG GGAAGTTACT CAAAGCATCC
GAAGAAAAAG CATCCTATCG CGTTGATGCT AAGGATCACC AGACTGATGC TAACGCGGTT
GGGTTAATGC CCTTAAAGGC TGTGGCACAG GTACGCAATA CTTATATTAT GGCTGAACAT
TCGACGGGGT TATGGTTAAT TGAACAACAT ATCGCCCATG AACGAGTGTT GTATGAAACG
TTGCAGGATA ATTGGCAATT AATCCCGCTA GAGACTCCGA TTATTTTAAC AAAATTATCA
GACAATCAAG TGGAACAATT AGCCAGAATT GGTTTAGAAA TTGAAGTTTT TGGAGAGCAA
CTTTGGGCAG TTCGGACAGT TCCTAAACTG TTATCAACGA GGGAAGATTG TCCAGAGGCT
TTAGTCGAAT TAAGCATAGG AGGAGATTTA CAAACGGCTC AAGTGGCTGT TGCTTGTCGT
AGTGCAATTC GTAACGGAAC CCCCATGACG CTATCCCAAA TGCAGGAACT GTTAGACCAA
TGGAAAACTA CCCGTAATCC TGCCACTTGT CCCCACGGAA GACCTATTTA TTTATCCTTA
GAGGAGTCTT CTTTATCTCG GTTTTTCCGT CGTCATTGGG TCATTGGCAA AAGCCATGGA
ATCTGA
 
Protein sequence
MSSPIQPLPL NVINLIAAGE VIDSIAAVVR ELVENALDAG ATRLVISLFP ESWRVQVADN 
GTGMTLADLR HCALPHSTSK IHQLDDLWKI TTLGFRGEAL HSLAQVADLE IASRCTSDGV
GWCLRYQSSG EPLREEPTAI APGTIVTVGN LFGKMPVRRQ GLPAISTQLK AVQSFIENMA
LCHPQVTWQV WHNQRLWLNI SPGKTPQQIL PQLLKGVHYH DLQFVSQGVK SPQESTQKDL
DLIEVTLGLP DRCHRHRPDW VKVGINGRIV RSPPVEQAIL VAFSRTLPKD RFPVCFIHLT
LCPSQIDWNR HPAKVEIYLH SLDFWQEQVS KLIEQGLRLS PQTLASAAQN QRVGKLLKAS
EEKASYRVDA KDHQTDANAV GLMPLKAVAQ VRNTYIMAEH STGLWLIEQH IAHERVLYET
LQDNWQLIPL ETPIILTKLS DNQVEQLARI GLEIEVFGEQ LWAVRTVPKL LSTREDCPEA
LVELSIGGDL QTAQVAVACR SAIRNGTPMT LSQMQELLDQ WKTTRNPATC PHGRPIYLSL
EESSLSRFFR RHWVIGKSHG I