Gene Mlg_0571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0571 
Symbol 
ID4270901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp619294 
End bp621204 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content72% 
IMG OID638125313 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_741415 
Protein GI114319732 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.707368 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00000000963067 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTCAGGC CCATACAGCA ACTGCCCCTG CAGCTCGTCA ACCAGATCGC CGCCGGCGAG 
GTGGTGGAGC GCCCGGCGTC CGTGCTCAAG GAGCTGGTGG AGAACAGCCT GGATGCCGGT
GCCCGGAGCC TTGCGGTGGA GTTGGAGCAG GGCGGCAAGC GCCTGATACG GGTGCGCGAT
GATGGCAACG GCATCCCCCG CGAGCAGCTC GGCCTGGCGC TGCGGCGCCA CGCCACCAGT
AAGATCACGT CGCTGAGTGA GCTGGAGCAG GTGGTCAGTC TGGGGTTTCG CGGTGAGGCG
CTGCCGAGCA TCGGTGCGGT CTCCCGGCTC CGGCTGATCT CCCGGCCGCC CGGGGCTGAG
CACGCCTGGG CGGTGCGCAC CGATGGCGAC AGCGAGCCCG CCGGGCCCGA GCCGGCGGCG
CATCCGCCGG GGACCACCGT CGAGGTGCGT GACCTTTTCT TCAACACCCC CGGTCGACGC
AAATTCCTGC GCACCGACCG CACCGAGTTC AGCCATGCCC AGGAGGCCCT GCGGCGCCTG
GCGCTGGGGC GCTTCGACGT GGCCTTCCGG CTGCAGCACA ACGGGCGCAC CGTGCTCGAC
CTGCCGCCGG CCGGTGACCG GGCCGGCGCC GAGCGGCGGC TGGGTGAGCT CCTCGGTGAG
GGCTTCCTGG GTGAGTGTAT CCACCTGGAG TGTGCCGCCG CCGGCCTGAA GCTGAGCGGC
TGGCTGGCGC TGCCCACCTT TTCCCGCAGC CAGGGGGATC TGCAGTATTT CTATGTCAAC
GGCCGCATGA TCCGCGATCG CATGGCCGGC CACGCCCTGC GTCGGGCCTA CGCCGACGTG
CTCTACCGCG ACCGCTTCCC CGCCTACCTG CTCTACCTGG ACCTGGACCC CGACCGGGTG
GACGTGAACG TGCACCCCAC CAAGCACGAG GTGCGCTTCC GCGACAGTCG GTTGGTCTAT
GATTTTCTGT TCCGACAGGT GCGGGAGGCG CTGGCTCGCG TCAGCCCTGC TACGGCGTCC
GGGGTGCAGC CACCGCAGGG GTCGGTGGCG TCGGCCGAGG GGCCGCGCAG CTTGGCCGCT
GCCGCGGGGG AACGGTGGGG TGGGGCCGCA CCAGCCGCTG GCGCCGCGCC AACGGCCCGG
CGATCGCCAC ACCAGCACGG CCTGGGGTTG CCGCTGGAGG AGGCCCGGTT GCTCTATGGC
GAGCGCAGCA AGGCGCACGC GGCCGGCCCG GCCGTCGCGT CTCCCTCCGG CGTCGTCCGG
GACGCACCTG CCGGGGAGGC CGTCGCGTGT GAGACCGGAG GTGCGGGCGA TCCGACGGAA
CGGGGTGGCC CGCCGCTGGG CCACGCCCTG GCCCAGGTCC ATGGGGTCTA CATCCTGGCC
CAGAACGACC AGGGCCTGGT CTTGGTGGAC ATGCATGCCG CCCATGAGCG GGTGGTCTAC
GAACGCATGA AGGCGCAACT CTCGGGCAGT GGCATCGCCA GCCAGGCCCT GCTGATGCCG
GAGGGGCTGA GCGTGACGCC GGCGGAGGGC GAAGAGGTGG AGCGCGCCGG CGAGCGTTTC
CGGCAACTGG GTTTTCAGGT GGACCGGGTG GCCCCGGACC GGGTGCTGGT CCGTGCGGTG
CCGGCCCTGC TGGCCAATGC CGAGCCGGTC GCGCTGGTGC GTGACGTGCT GGCCGACCTC
CGGACCCAGT CGCGTAGCCG CCAGGTGGAG GAGGCGCTGA ACCACGTCCT GGCCACCATG
GCCTGCCACG GTTCCGTGCG CGCCAACCGG CGGCTGACCC TGCCGGAGAT GGACGCGCTG
CTGCGCGAGA TGGAGGCCAC CCCGAACAGC GGCCAGTGCA ACCACGGCCG GCCGACGTGG
ACCGTGCTGG ATATGGACGC CCTGGACCGG CTGTTCATGC GGGGGCAGTG A
 
Protein sequence
MVRPIQQLPL QLVNQIAAGE VVERPASVLK ELVENSLDAG ARSLAVELEQ GGKRLIRVRD 
DGNGIPREQL GLALRRHATS KITSLSELEQ VVSLGFRGEA LPSIGAVSRL RLISRPPGAE
HAWAVRTDGD SEPAGPEPAA HPPGTTVEVR DLFFNTPGRR KFLRTDRTEF SHAQEALRRL
ALGRFDVAFR LQHNGRTVLD LPPAGDRAGA ERRLGELLGE GFLGECIHLE CAAAGLKLSG
WLALPTFSRS QGDLQYFYVN GRMIRDRMAG HALRRAYADV LYRDRFPAYL LYLDLDPDRV
DVNVHPTKHE VRFRDSRLVY DFLFRQVREA LARVSPATAS GVQPPQGSVA SAEGPRSLAA
AAGERWGGAA PAAGAAPTAR RSPHQHGLGL PLEEARLLYG ERSKAHAAGP AVASPSGVVR
DAPAGEAVAC ETGGAGDPTE RGGPPLGHAL AQVHGVYILA QNDQGLVLVD MHAAHERVVY
ERMKAQLSGS GIASQALLMP EGLSVTPAEG EEVERAGERF RQLGFQVDRV APDRVLVRAV
PALLANAEPV ALVRDVLADL RTQSRSRQVE EALNHVLATM ACHGSVRANR RLTLPEMDAL
LREMEATPNS GQCNHGRPTW TVLDMDALDR LFMRGQ