Gene Moth_1114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1114 
Symbol 
ID3833246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1142325 
End bp1144187 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content60% 
IMG OID637829042 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_429971 
Protein GI83589962 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000422632 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACAGGGA GCCAGGAGAC CAGAATTACG ATCCTGGATG CTATGACTGC CAACCAGATA 
GCCGCCGGGG AAGTGGTAGA GAGGCCGGCT TCAGTGGTGA AGGAGCTGGT AGAAAACTCC
CTGGACGCAG CGGCCCGGCA CATTACCGTG GAGATTGAGG GCGGCGGCCT ACAGCTCATC
CGCGTCCGGG ATGACGGCAG GGGGATAGAG CCTGAAGATG CCCCCCTGGC CTTTGCCCGC
CACGCCACTA GTAAAATTCG CCGGGCCGCG GACCTGGCGC GGATTACCAC CCTCGGTTTC
CGGGGGGAGG CCCTGGCTAG CATCGCCGCC GTAGCCAGGG TGGAGATGGC CACCCGCCCC
CCGGGAAGAC CGGGCGGCAC CCTGGTACGG GTGGCCGGAG GCAAGCCTCC GGAGGTCACG
GAAACCGGCT GCCCCCCCGG GACCTCAGTT ACAGTAAAGG ACCTGTTTTA TAATACCCCG
GCCCGGCGCC AGTATTTAAA GAAACCTTCT ACAGAAGCCA GGGCGATTGT AGCCACGGTA
GAAAGGCTGG CCCTGGGGCA CCCCGGCGTG GCCTTCTCTT TGAGCCTCGA CGGTAGGCGT
TCCCTGGCTA CCCCGGGTAA CGGCGACCTG CAGGCCGTCC TGGCAGCCCT TTATGGCCTG
GAGATTGGTC GTGAGTTGCT GCCCTTTAAC GGCTCCGGCG CCGGCTGGAG TTTGCACGGT
TTTACCTCGC CGCCATGGCT CCACCGTTCC AACCGGGATC AGCAGGTGCT GCTAATTAAT
GGCCGCTATA TTACCAACCG GCTCCTCACC TGGGCGATCG AGAGCTGTTA TCGGAATGTA
ATTCCAGCCG GCCGCCACCC CCTTTTTGTC CTTCATCTGG CAGTAGACCC GGGTGAGGTG
GATGTCAATG TTCACCCGGC TAAACTGGAG GTCCGCCTGC AGAGGGAGCA GGACCTGGCC
CGGCAGGTGA CAAACCTTGT TAAAGGGGCT CTTTTTACTC CCAGGGCTGT CGCCCCGGCA
ACCATTTCCC GTTCCGGGGA TAGGAAAGGC GCTGGGTCCG CACCACCGGT GCAGCAGGGT
TTTACCTTCC GGGAACCGGA CAAGCAGGCC CGCTATTGGG GTGAGTATGT ACTAAGGGAA
AGAGCCCGGG AAAACCGGGA ACCGGAATGG CCGGAAAAAA CAGGGGAGAA TACCGGGGCA
ATCAAGACGC GGGAGGTACC AGAAGGAAAC GGCCCGGTAG AGAGGGATAA ACCCGACCCT
ATCGGGCCAG AAACGCCGGC AGAAGAAACG GGCAAGCAGG TCCTACCACC TTTGCGGGCC
CTGGGCCAGG TTTTTAATAC CTATATCCTG GCGGGGGGCG AAGACGGCCT GTATATAATT
GACCAACATG CGGCCCATGA GCGCTGCCGT TATGAAGCCC TGGTAAAAGA GGGGACGCCT
GGAAGTCACC CGGCCCAGAT GCTGGAACCG CCCTTACCCC TGCATCTGGC CCCGGATATG
CAAGTCAAGC TTATTGATCA GATAATAACC CTGCGGGAAC TGGGTTTTAT CATCGAGGAA
TTCGGAACCG GCGTCTTTTT ATTACGCTCG GTTCCCCTGG GAATTCCTCC AGGTAAAGAA
AGGGAGGTCT TAGAGGATTT CCTGGCGGAA AGCACCCTCC CGGCGCCGGA AAGGCTCTTG
AAGTTAATTG CCTGTCACGG GGCAATCAAA GCCGGGCAAT CCCTGGCAGG GGCCGAGATG
CAAAAACTCC TCGATGACCT GCGGGGGGTT GACCATCCCT ATACCTGCCC CCACGGCAGG
CCGGCGGTAG TACGCCTGGA TGAGGCCCAG CTAGCGCGGT ATTTTCACCG ACACTTAAAG
TGA
 
Protein sequence
MTGSQETRIT ILDAMTANQI AAGEVVERPA SVVKELVENS LDAAARHITV EIEGGGLQLI 
RVRDDGRGIE PEDAPLAFAR HATSKIRRAA DLARITTLGF RGEALASIAA VARVEMATRP
PGRPGGTLVR VAGGKPPEVT ETGCPPGTSV TVKDLFYNTP ARRQYLKKPS TEARAIVATV
ERLALGHPGV AFSLSLDGRR SLATPGNGDL QAVLAALYGL EIGRELLPFN GSGAGWSLHG
FTSPPWLHRS NRDQQVLLIN GRYITNRLLT WAIESCYRNV IPAGRHPLFV LHLAVDPGEV
DVNVHPAKLE VRLQREQDLA RQVTNLVKGA LFTPRAVAPA TISRSGDRKG AGSAPPVQQG
FTFREPDKQA RYWGEYVLRE RARENREPEW PEKTGENTGA IKTREVPEGN GPVERDKPDP
IGPETPAEET GKQVLPPLRA LGQVFNTYIL AGGEDGLYII DQHAAHERCR YEALVKEGTP
GSHPAQMLEP PLPLHLAPDM QVKLIDQIIT LRELGFIIEE FGTGVFLLRS VPLGIPPGKE
REVLEDFLAE STLPAPERLL KLIACHGAIK AGQSLAGAEM QKLLDDLRGV DHPYTCPHGR
PAVVRLDEAQ LARYFHRHLK