Gene Apre_0949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0949 
Symbol 
ID8397735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1013136 
End bp1014971 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content28% 
IMG OID644995296 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_003152698 
Protein GI257066442 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTATAC TAAAACTAGA TGATAAAACA ATTGAAAAAA TTGCTGCTGG AGAAGTAATT 
GAATCACCAG TATCTATTGT AAAAGAACTA GTAGAAAATT CTATAGATGC AGGAGCTGAT
AGGATAACAG TTGAGATTAA AAATGGTGGC AAGACTTATA TAAGAGTCAC AGATAATGGA
TCAGGTATAA GTAACGAAGA AATCGAACTT GCTTTTAGTA AGCACGCCAC ATCTAAAATT
CGTGACTTCA ATGATTTATA TGATATATAC TCGCTAGGAT TTAGGGGTGA GGCCTTAGCT
AGTATAGTAA GTGTCAGTGA AGTTACTGCA ATAAGCAAAA CAAAAGATGA ACTTGTAGGA
AGTAAAATTA ATTTTAACAA TGGTGATAAA ACTCTTACCA GTATTGCCAC AAATGTTGGA
ACTAGCATCA TAGTTGAGGA TTTATTTAGA GATATTCCTG TAAGAAGGAG ATTTCTTAAG
TCAGATATTG TTGAATCTAA TCATATAAGC AAACTTATGT ATGCCATAGC AATTGGATAT
CCTGAAATCT CATTTAAGTT TATCAAAAAC GAAAATATTG AATTTTCTAC AATAAAAAAT
GAAGCTTTGA AAATGAGAAT AGCTAAACTT TTGGATGATA AGTTAGAGGA CCATCTAATA
AAAATTTCTG ATAACAACGA CATCTATCAA ATCGAAGGAT TTATATCTAA TAATAATTAT
TATAGGGGAA ATAGGTCTTT GCAATATATT TATATTAACA ATAGACTTGT TGAAAGTGAA
TTGATAAGAG ATAGGATAGA GATGGCCTAC AGGGGAAATA TACCAAACGG AAGATTTCCG
GTTTTCTTTC TATATATAAA GACCAATCCT AAAAATATAG ATGTCAACGT CCATCCGAAC
AAGAGAGTGA TTAAGTTTTC TTATGAAGAC CAATTGATTA ATCTTATAGA TTCAAGCATT
AGTAGGTATA TAAATGCTTC TAATGGAGTA AAAGAAGTTA GTATTGAAGA AAATAATAAG
AATGATTTAA TGGATTTTTC AGATTATTCT CAAATACTAA ATAACTATAA TAATAGTAAA
TCTCTTGTAA GAGAAAGTTC ATTTGACAAT TTGTATGAAT CTGAAAATAA TGATTCTAAC
AAAGAAGAAG CTAAAGACTT TTTTGATAGT AATATAGACA TAAGTTTTAA GGAAAAATCA
AACTATGAAG AATGTCTAGA AAATAACAAT ACTGGGCTAG AAGAAGAAAT CGAAGAAAAT
AGTTACATTA AAGATTTTGA ATATCTGAAC TATAAGTGTT CAATATTTGC TAGGTATTCT
ATATTTGAAA GAAAAGATAA ACTTTTCATA CTAGATCATA GGAGAGCTAG CGAAAAGATT
AATTTTAGTA GATTCTTAAA TCAGTTTGAA AATAAAACTA TGGATGAACA AATTCTTTTA
AATCCATTAA TAATTAACTT GAATAAATTT GATATTGACA GGTTCATTGA AAAGAAAGAT
GTAATAAATA GACTAGGATT CGATGCAGAA ATAATTGGAG ATAGGTCAAT AATTATTCGC
TCAGTTCCTT TCATCTTTTC TGTACCTGAA GATGATAAGT TCTTCTACGA CTTATTAGAC
CTTGATTATA GCAAGGATAC AGATTATTTG TATAAAAAAC TTAAGAAATT AAATCTCTCT
TTATCTTTTA GAAAAGGAGA TAAGATAAAT GAAGCTGAGG CTTATGAACT TATTGGAAAC
ATAAAAGAAC TAGATAATCC ATATACAACT TATGACGGGA AAGCAGTTCT TATAGAGATA
AATGAAAAAG ATGTGGAGAA ATATTTTGAA AGATAG
 
Protein sequence
MTILKLDDKT IEKIAAGEVI ESPVSIVKEL VENSIDAGAD RITVEIKNGG KTYIRVTDNG 
SGISNEEIEL AFSKHATSKI RDFNDLYDIY SLGFRGEALA SIVSVSEVTA ISKTKDELVG
SKINFNNGDK TLTSIATNVG TSIIVEDLFR DIPVRRRFLK SDIVESNHIS KLMYAIAIGY
PEISFKFIKN ENIEFSTIKN EALKMRIAKL LDDKLEDHLI KISDNNDIYQ IEGFISNNNY
YRGNRSLQYI YINNRLVESE LIRDRIEMAY RGNIPNGRFP VFFLYIKTNP KNIDVNVHPN
KRVIKFSYED QLINLIDSSI SRYINASNGV KEVSIEENNK NDLMDFSDYS QILNNYNNSK
SLVRESSFDN LYESENNDSN KEEAKDFFDS NIDISFKEKS NYEECLENNN TGLEEEIEEN
SYIKDFEYLN YKCSIFARYS IFERKDKLFI LDHRRASEKI NFSRFLNQFE NKTMDEQILL
NPLIINLNKF DIDRFIEKKD VINRLGFDAE IIGDRSIIIR SVPFIFSVPE DDKFFYDLLD
LDYSKDTDYL YKKLKKLNLS LSFRKGDKIN EAEAYELIGN IKELDNPYTT YDGKAVLIEI
NEKDVEKYFE R