Gene CPF_1359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1359 
SymbolmutL 
ID4201601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1535490 
End bp1537514 
Gene Length2025 bp 
Protein Length674 aa 
Translation table11 
GC content29% 
IMG OID638082240 
ProductDNA mismatch repair protein 
Protein accessionYP_695805 
Protein GI110800749 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.52416 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAATAGAA TAAATATTTT AAATGCAGAT ACAGCAAATA AAATAGCAGC AGGAGAGGTT 
GTTGAAAGAC CCTCTTCTGT GGTTAAAGAA CTTGTAGAAA ATTCATTAGA TGCAGGGGCT
AAAAATATAA CTATAGAGAT TCAAAATGGT GGAGAATCTC TTATAAAAAT AATAGATGAT
GGCTCAGGGG TTCATCCAGA GGATGTTGAA AAAGCTTTTA ATCCTCATGC TACAAGTAAA
ATAAAAGATA CTTATGATAT TTTTAGTATA AATACCCTTG GATTTAGAGG AGAAGCTTTG
CCTAGTATAG CTTCTATAGC AAGGGTTGAT TTTAAGAGTA AAGTATCAGA CTTTGACATG
GGTAAGGAAT TAGTAATTAG CGGGGGAGAA AAAGAATCTT TAACGGATTG TTCTATGAAT
AGAGGAACTC AAATAGAAGT TAGGGATTTA TTCTTTAATG TACCTGCTAG AAAGAAGTTT
TTAAAGACAA CGGCTAGAGA AAGTGCCTTA ATAAATGACT TAGTAAACAG AATTTCACTA
GCTAACCCAG ATGTATCATT TAAATTATTT AATAACAATA AAAAGATTTT AAATACTTAT
GGCAATGGAA AATTAATAGA TGTTATAAGA ACTATTTATG GTAAGTCCAC TGCTGAAAAT
TTAATATATT TTGAAGAGCA TAAGGACACA GCTTCTGTTT ATGGATTTAT AGGAAATGAT
ACCTTAGCAA GAGCCTCTAG AAATAATCAA AGTCTTTTTG TAAATAAGAG ATATGTAAAA
AATAGAAGCT TAACTGTAGC TGTGGAAAAT GCCTTTAGAT CCTTTAATGT TACAGGTAAG
TTTCCATTCT TCGTATTATT TATAGATACT TATCCAGAGC TTATAGATGT TAACATACAT
CCAACAAAAT CTGAAATTAA ATTTAAAGAT GAACGTTTTA TATTCAAGGT AGTCTTTGAT
GCTGTTCATT CAGCTATGAG GGAATATGTA AAAGATACCT TTACTCTTCC AGAAGAAGAG
GAGAAAAAAT TTGAAGCTTT AAAAGAAGAA GTTATTCAGG AAAGCTTAGA TGAGGAAATA
AGTACCTTAG AAAAGTTAAA AGAAAATATA AATTATAAGG TAAGTGAGGA TAGAAAGAAG
GAAGAGATTT ATTCTTATAA TCCCTCTAAG GATTATGAAG CTAAAACAGA GGTTAATATT
CCAGTAGATT TCTTATCAAA AGAAAATCAG GAGGAATCTT TTAGTATTAA TAACTCTTTA
GAAAATAATA ATTTTAAAGA AGGTTCAGCT AAAAGAGAGA TTTCATATGA TCCTATACTT
ATAAAAAATG AACTTAAAGA TAAAGTAAGT GAAAGTACTT CTGAATCACT TGAAAGAAGT
GACTATAAAT GTAATAAGAA TGAATATGGA AATTCCATAG AGGAAATAAT TTATAGAGAA
GCAAAATTCC CTAAGCTAAG AGTTATTGGT CAATTTAATA AAACCTATAT ATTAGCTGAG
TATGATTCTA CTTTATATTT AATAGACCAA CATGCAGCTC ATGAGAAGAT TTTATTTGAG
AAGTATTCTT CAGATATAGC TAAAAAGAGG GTTGAAATTC AGCCTCTAAT GATTCCACTA
GTAGTAACCT TACCTACAGA GGATTATCTT TATTATGATG AAAATAAAGA GATTTTTGAA
AAGGCAGGAT TTAAAATAAG CGATTTTGGT GATAATTCTA TAAGAATTGA AGAGGTACCA
TACTTTTTAG ATAAATTAAA TCCAACAGAG CTAATAACAT CTATGATAAA TAACTTGAAG
AAAATGGGTA CTGGAGAAAC TGTAGAGGTT AAATATAATA AAATAGCATC TATGTCCTGT
AGGGCGGCAG TTAAGGCTAA TGATGTTTTA AGCATATTAG AAATGGAAAA CTTAATAGAG
GATTTAAGAT ACATAAATGA TCCTTTCCAC TGTCCACATG GACGTCCAAC TATAATTAAA
TTTACCAGTT ATGAATTAGA TAAGAAGTTT AAAAGAATAA CTTAA
 
Protein sequence
MNRINILNAD TANKIAAGEV VERPSSVVKE LVENSLDAGA KNITIEIQNG GESLIKIIDD 
GSGVHPEDVE KAFNPHATSK IKDTYDIFSI NTLGFRGEAL PSIASIARVD FKSKVSDFDM
GKELVISGGE KESLTDCSMN RGTQIEVRDL FFNVPARKKF LKTTARESAL INDLVNRISL
ANPDVSFKLF NNNKKILNTY GNGKLIDVIR TIYGKSTAEN LIYFEEHKDT ASVYGFIGND
TLARASRNNQ SLFVNKRYVK NRSLTVAVEN AFRSFNVTGK FPFFVLFIDT YPELIDVNIH
PTKSEIKFKD ERFIFKVVFD AVHSAMREYV KDTFTLPEEE EKKFEALKEE VIQESLDEEI
STLEKLKENI NYKVSEDRKK EEIYSYNPSK DYEAKTEVNI PVDFLSKENQ EESFSINNSL
ENNNFKEGSA KREISYDPIL IKNELKDKVS ESTSESLERS DYKCNKNEYG NSIEEIIYRE
AKFPKLRVIG QFNKTYILAE YDSTLYLIDQ HAAHEKILFE KYSSDIAKKR VEIQPLMIPL
VVTLPTEDYL YYDENKEIFE KAGFKISDFG DNSIRIEEVP YFLDKLNPTE LITSMINNLK
KMGTGETVEV KYNKIASMSC RAAVKANDVL SILEMENLIE DLRYINDPFH CPHGRPTIIK
FTSYELDKKF KRIT