Gene TM1040_0102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0102 
SymbolmutL 
ID4078687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp107657 
End bp109591 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content64% 
IMG OID638005389 
ProductDNA mismatch repair protein 
Protein accessionYP_612097 
Protein GI99079943 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACAC TCGACCCCCA AATCAGCGAA AAACCACCGA TTGAGGCGCC GGCCAAACCC 
GCGCGCCCCG TGATCCGGCA ACTGGATGAC GGCGCCATCA ACCGCATTGC GGCTGGTGAG
GTGGTCGAGC GTCCGGCCTC GGCCGTCAAG GAACTTGTGG AGAACGCCAT CGACGCCGGT
GCGACCCGCA TCACGGTAGA GATCGCCGAT GGTGGTAAGA CGCTGATCCG GGTGATCGAC
AATGGCTGCG GGATGACACC AGAGGACCTG CCGTTGGCGC TGTCGCGTCA TGCGACTTCC
AAGATTGATG GCTCTGATCT GTTGAACATT CACACCTTTG GCTTTCGGGG CGAGGCGCTG
CCGAGCCTTG GCGCGGTGGG GCGGCTGGCG ATCACCAGCC GGGCCGAAGG ACATGACGCA
GCCCAGATCC GCGTGTCAGG CGGCCATATG GAGCCTGTGA GGCCCGCGGC GCTCCGGCAG
GGTAGCATCG TGGAGCTGCG CGATTTGTTT TTTGCAACAC CTGCGCGGCT CAAGTTCATG
CGCACCGACC GGGCGGAAAT GCAGGCGATC TCGGACACGG TCAAGCGGCT GGCGATGGCG
GAGCCTTCAG TGGGCTTCAC CTTGCGCGAT GTCTCAGGTG GCGGAGAGGG CCGCGTGACC
TTCCGCGCAG ATCCCATGAA CGGCGATCTC TTTGATGCGT TGCACGGTCG GCTGGCGCAT
GTCATCGGGC GTGAGTTCGC CGAGAACGCC CTGAAAATCG ATGCCACGCG CGAGGGTATC
CGGCTTTATG GCTATGCGGC CCTGCCGACC TATTCGCGCG GTGCGGCAGT GACGCAGTTC
CTGTTTGTGA ATGCGCGTCC GGTAAAGGAC AAGATGCTCA CAGGGGCGCT GCGGGCGGCC
TATATGGATT TCCTCAGCCG GGATCGCCAC CCGGCGGCGG CCCTCTTTAT CGACTGTGAC
CCGACGCTGG TGGACGTGAA CGTACATCCG GCAAAATCCG AGGTGCGCTT CCGTGATCCG
GGGCTGGCGC GCGGGCTGAT CGTCTCGGCG CTGCGCCACG CGCTCGCCGA GGCCGGCCAT
CGTGCCTCCA GCACCGTGGC GGGCGCGACC CTGGGGGCGA TGCGACCGGA ACAGCCGACT
GCGACAGGGG CGCCGAGGGT GTATCAGATG GACCGGCCGT CCTTGGGCGC ACGGCACAGC
GCCTATGCGG CGCAGACCCC CGCTCAGCCG CAGGCACCGT CCTATGCTGC GCCGCCTCCC
TCTGATGCGG TAGGGTTTGC AGAGTTCTCA GGCACTTACA GTGGCCGTCT GGTCGAAGAG
ACACCTCTTG AGGAGGCTCA GCCCCCCGCC GAGGATCAGC CCCTTGGCGC GGCACGCGGG
CAGGTGCATG AGAATTATAT CATCGCCCAG ACCCGCGACG GGATGGTGAT CGTCGATCAG
CACGCCGCCC ATGAACGGCT CGTTTATGAG CGGCTCAAAC GTCAGTTGGC CGAAAATGGC
GTCGCGACCC AAGGTCTGCT GATCCCAGAA ATCATCGAAC TCTCGGATGG GGATTGTGCG
CGCCTGCTGG AAGTTGCCGA GGATCTTGCC AGGTTGGGAC TGGGCATTGA GGCCTTTGGC
GGCAGCGCCG TGGCTGTGCG AGAAACGCCC GCCATTCTGG GCGAGGTCAA TGCCGAGGCT
ATGATCCGCG ACATTCTGGA TGAGCTGGCG GATCAGGGTG AGAGCCAGCT GGTGCAGGCG
CGCCTTGAGG CAATCCTGTC GCGGGTGGCC TGTCATGGCT CGATCCGTTC CGGACGGCGC
ATGCGCGGCG AGGAAATGAA CGCGCTCCTG CGGGAAATGG AACAGACGCC CCATTCCGGC
CAGTGCAATC ACGGCAGACC CACCTATGTG GAGCTCAAAC TCGCGGATAT CGAGCGCCTC
TTTGGGCGCA GCTAA
 
Protein sequence
MATLDPQISE KPPIEAPAKP ARPVIRQLDD GAINRIAAGE VVERPASAVK ELVENAIDAG 
ATRITVEIAD GGKTLIRVID NGCGMTPEDL PLALSRHATS KIDGSDLLNI HTFGFRGEAL
PSLGAVGRLA ITSRAEGHDA AQIRVSGGHM EPVRPAALRQ GSIVELRDLF FATPARLKFM
RTDRAEMQAI SDTVKRLAMA EPSVGFTLRD VSGGGEGRVT FRADPMNGDL FDALHGRLAH
VIGREFAENA LKIDATREGI RLYGYAALPT YSRGAAVTQF LFVNARPVKD KMLTGALRAA
YMDFLSRDRH PAAALFIDCD PTLVDVNVHP AKSEVRFRDP GLARGLIVSA LRHALAEAGH
RASSTVAGAT LGAMRPEQPT ATGAPRVYQM DRPSLGARHS AYAAQTPAQP QAPSYAAPPP
SDAVGFAEFS GTYSGRLVEE TPLEEAQPPA EDQPLGAARG QVHENYIIAQ TRDGMVIVDQ
HAAHERLVYE RLKRQLAENG VATQGLLIPE IIELSDGDCA RLLEVAEDLA RLGLGIEAFG
GSAVAVRETP AILGEVNAEA MIRDILDELA DQGESQLVQA RLEAILSRVA CHGSIRSGRR
MRGEEMNALL REMEQTPHSG QCNHGRPTYV ELKLADIERL FGRS