Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1114 |
Symbol | |
ID | 3833246 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1142325 |
End bp | 1144187 |
Gene Length | 1863 bp |
Protein Length | 620 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637829042 |
Product | DNA mismatch repair protein MutL |
Protein accession | YP_429971 |
Protein GI | 83589962 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000422632 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACAGGGA GCCAGGAGAC CAGAATTACG ATCCTGGATG CTATGACTGC CAACCAGATA GCCGCCGGGG AAGTGGTAGA GAGGCCGGCT TCAGTGGTGA AGGAGCTGGT AGAAAACTCC CTGGACGCAG CGGCCCGGCA CATTACCGTG GAGATTGAGG GCGGCGGCCT ACAGCTCATC CGCGTCCGGG ATGACGGCAG GGGGATAGAG CCTGAAGATG CCCCCCTGGC CTTTGCCCGC CACGCCACTA GTAAAATTCG CCGGGCCGCG GACCTGGCGC GGATTACCAC CCTCGGTTTC CGGGGGGAGG CCCTGGCTAG CATCGCCGCC GTAGCCAGGG TGGAGATGGC CACCCGCCCC CCGGGAAGAC CGGGCGGCAC CCTGGTACGG GTGGCCGGAG GCAAGCCTCC GGAGGTCACG GAAACCGGCT GCCCCCCCGG GACCTCAGTT ACAGTAAAGG ACCTGTTTTA TAATACCCCG GCCCGGCGCC AGTATTTAAA GAAACCTTCT ACAGAAGCCA GGGCGATTGT AGCCACGGTA GAAAGGCTGG CCCTGGGGCA CCCCGGCGTG GCCTTCTCTT TGAGCCTCGA CGGTAGGCGT TCCCTGGCTA CCCCGGGTAA CGGCGACCTG CAGGCCGTCC TGGCAGCCCT TTATGGCCTG GAGATTGGTC GTGAGTTGCT GCCCTTTAAC GGCTCCGGCG CCGGCTGGAG TTTGCACGGT TTTACCTCGC CGCCATGGCT CCACCGTTCC AACCGGGATC AGCAGGTGCT GCTAATTAAT GGCCGCTATA TTACCAACCG GCTCCTCACC TGGGCGATCG AGAGCTGTTA TCGGAATGTA ATTCCAGCCG GCCGCCACCC CCTTTTTGTC CTTCATCTGG CAGTAGACCC GGGTGAGGTG GATGTCAATG TTCACCCGGC TAAACTGGAG GTCCGCCTGC AGAGGGAGCA GGACCTGGCC CGGCAGGTGA CAAACCTTGT TAAAGGGGCT CTTTTTACTC CCAGGGCTGT CGCCCCGGCA ACCATTTCCC GTTCCGGGGA TAGGAAAGGC GCTGGGTCCG CACCACCGGT GCAGCAGGGT TTTACCTTCC GGGAACCGGA CAAGCAGGCC CGCTATTGGG GTGAGTATGT ACTAAGGGAA AGAGCCCGGG AAAACCGGGA ACCGGAATGG CCGGAAAAAA CAGGGGAGAA TACCGGGGCA ATCAAGACGC GGGAGGTACC AGAAGGAAAC GGCCCGGTAG AGAGGGATAA ACCCGACCCT ATCGGGCCAG AAACGCCGGC AGAAGAAACG GGCAAGCAGG TCCTACCACC TTTGCGGGCC CTGGGCCAGG TTTTTAATAC CTATATCCTG GCGGGGGGCG AAGACGGCCT GTATATAATT GACCAACATG CGGCCCATGA GCGCTGCCGT TATGAAGCCC TGGTAAAAGA GGGGACGCCT GGAAGTCACC CGGCCCAGAT GCTGGAACCG CCCTTACCCC TGCATCTGGC CCCGGATATG CAAGTCAAGC TTATTGATCA GATAATAACC CTGCGGGAAC TGGGTTTTAT CATCGAGGAA TTCGGAACCG GCGTCTTTTT ATTACGCTCG GTTCCCCTGG GAATTCCTCC AGGTAAAGAA AGGGAGGTCT TAGAGGATTT CCTGGCGGAA AGCACCCTCC CGGCGCCGGA AAGGCTCTTG AAGTTAATTG CCTGTCACGG GGCAATCAAA GCCGGGCAAT CCCTGGCAGG GGCCGAGATG CAAAAACTCC TCGATGACCT GCGGGGGGTT GACCATCCCT ATACCTGCCC CCACGGCAGG CCGGCGGTAG TACGCCTGGA TGAGGCCCAG CTAGCGCGGT ATTTTCACCG ACACTTAAAG TGA
|
Protein sequence | MTGSQETRIT ILDAMTANQI AAGEVVERPA SVVKELVENS LDAAARHITV EIEGGGLQLI RVRDDGRGIE PEDAPLAFAR HATSKIRRAA DLARITTLGF RGEALASIAA VARVEMATRP PGRPGGTLVR VAGGKPPEVT ETGCPPGTSV TVKDLFYNTP ARRQYLKKPS TEARAIVATV ERLALGHPGV AFSLSLDGRR SLATPGNGDL QAVLAALYGL EIGRELLPFN GSGAGWSLHG FTSPPWLHRS NRDQQVLLIN GRYITNRLLT WAIESCYRNV IPAGRHPLFV LHLAVDPGEV DVNVHPAKLE VRLQREQDLA RQVTNLVKGA LFTPRAVAPA TISRSGDRKG AGSAPPVQQG FTFREPDKQA RYWGEYVLRE RARENREPEW PEKTGENTGA IKTREVPEGN GPVERDKPDP IGPETPAEET GKQVLPPLRA LGQVFNTYIL AGGEDGLYII DQHAAHERCR YEALVKEGTP GSHPAQMLEP PLPLHLAPDM QVKLIDQIIT LRELGFIIEE FGTGVFLLRS VPLGIPPGKE REVLEDFLAE STLPAPERLL KLIACHGAIK AGQSLAGAEM QKLLDDLRGV DHPYTCPHGR PAVVRLDEAQ LARYFHRHLK
|
| |