Gene Emin_0215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0215 
Symbol 
ID6263562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp231486 
End bp233312 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content43% 
IMG OID642610678 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_001875114 
Protein GI187250632 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.137951 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAAAA TTAAAATTTT GGAAGAAAGC GTTGCCGCAA GAATAGCCGC CGGGGAAGTG 
ATTGAGCGGC CCGCAGGCGT TTTAAAAGAA CTTTTGGAAA ACGCCGTTGA CGCGGGGGCC
GACACAATTA ATATTGATAT TGACGGGGCG GGAAAAAAAC TTATAAGAGT TAATGACAAC
GGCTCGGGCA TGAGTAAAGA GGATTTAACT TTATCTGTTG TACGCCATTC AACAAGTAAA
ATAAAAAATT TTGAGGATTT GGACTCTTTG GATACTTTTG GTTTCAGGGG CGAGGCACTG
TATTCCGTGG CGGCTGTTTC AAAACTTTCC ATTTCCAGCG CCGAGGAAGG CGGCAGCGGC
AACAAAATTA TTGTTGAAGG CGGTAAATTA ATTTCCGTTT CGCCCTCGCC GAATATAAAA
GGTACAACGG TTGAAGTAAA GGACTTGTTT TATAACACGC CCGCCAGGCT AAAATTTTTA
AAGTCGGATA ATTATGAACG TTCCCTTCTT TTAAAAGTAG TTGAGGAAAG CGCCCTTGCA
AATTTACATG TTTCTTATAA TGTACGTACG GACGGCAGGC TTGTATACTC TTTTTTAGCC
TCAAACGGTG ATTTTAAAAA GACTGTAATC CAAAGAGCGG GGCAAATTTT AGGCGCGGAG
ATAGCTGTCT CTTTAATATC CGTTGAAGAT GAACGCTTTG GTTTTAAAGC TTTTTTAACG
CCGTTAAGCA AACTTACAGC CGTGCGTGAT TTACAATTTT TCTTTATTAA TAAAAGGCCT
TTAACAAGCA AAACTTTACA ACAGGCAGTA TACAAAGCTT ACCATGGCAG GCCTAAAGAC
AGGCACCCCG CTTTTATTGT TTTTATGAAT ATGCCAGCTG CTGACTTTGA CGTTAATATA
CACCCTCAAA AGCGTGATGT GAAATTTGCA CAGGAAAACG CCGTATTCGG TTTTTTAATG
AATGTTACGC AGCGCGCTTT AACCGGCGCG GCCCAGCCTG TTGATATAAA TATTACTCCG
GCGTCTCCGC CCGCGGCTAC TATGGAATTT AGCTTCGCCA AACCCCGCGC GGAAGAACCT
TCATACAAAC CTTTCGGACA AAATATTGTT GAGGAAAATG TTTTTGCCCC TATAAGTAAA
CAGGCGGTTG TAGTAAAAGA TTTTGAGGAC CCTGTTTCTT ATAATCCCGA ACCTGCAGAG
CCTAAAAAAG AAATGGGCGC GCATGTTCAA ACGGACAATC CTTCCTGGTG GCAGGGGCCT
TACAGATTTT TAGGTTCACT ACATAAAAGT TATTTGATTT ACGAAACCGA ACTCGGCCTT
ATGCTTGTTG ACCAGCACGC CGCGCGCGAA AGAGTTTTGT ATGAAGAGTA CCTTAAAAAG
ATGGAAGAAA ATGAGCTCGG CATACAACCC TTAATGTTCC CGGTTACGGT TGATTTGCCC
GCCAGTAATG TTGAAAATTT AATGCTTTGG AAAGACTGGC TTAAAACAGC GGGTTTTGAA
ATTGAGCAGT TTTCACCCAG GACTGTACTG GTAAACGCCG TGCCCAACAT TTTCAGGTTT
AAAGAAGACA GCCTTAAAGA ATTTATAGTA AGCCTGGCCG GTATTGTGGG CGATCCTCTG
AAAAGCTCTG ACGAGCTTAA AAAGAAAACG GTTGCCATGT TAGCCTGCAA GAAATCAATT
AAAGCAAAAG AAGACGTGAG CATGGCGGAG GCCGACGCCT TACTTTTAGA TTTAAAAAGA
TGCCAAGACG GCATGCATTG CCCGCACGGG CGGCCTGTTA TGGTGTCTTT AAGCGCGGCC
GAGCTTACCA AAAAATTTGG CAGATAA
 
Protein sequence
MGKIKILEES VAARIAAGEV IERPAGVLKE LLENAVDAGA DTINIDIDGA GKKLIRVNDN 
GSGMSKEDLT LSVVRHSTSK IKNFEDLDSL DTFGFRGEAL YSVAAVSKLS ISSAEEGGSG
NKIIVEGGKL ISVSPSPNIK GTTVEVKDLF YNTPARLKFL KSDNYERSLL LKVVEESALA
NLHVSYNVRT DGRLVYSFLA SNGDFKKTVI QRAGQILGAE IAVSLISVED ERFGFKAFLT
PLSKLTAVRD LQFFFINKRP LTSKTLQQAV YKAYHGRPKD RHPAFIVFMN MPAADFDVNI
HPQKRDVKFA QENAVFGFLM NVTQRALTGA AQPVDINITP ASPPAATMEF SFAKPRAEEP
SYKPFGQNIV EENVFAPISK QAVVVKDFED PVSYNPEPAE PKKEMGAHVQ TDNPSWWQGP
YRFLGSLHKS YLIYETELGL MLVDQHAARE RVLYEEYLKK MEENELGIQP LMFPVTVDLP
ASNVENLMLW KDWLKTAGFE IEQFSPRTVL VNAVPNIFRF KEDSLKEFIV SLAGIVGDPL
KSSDELKKKT VAMLACKKSI KAKEDVSMAE ADALLLDLKR CQDGMHCPHG RPVMVSLSAA
ELTKKFGR