Gene Moth_0081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0081 
Symbol 
ID3832690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp80294 
End bp83845 
Gene Length3552 bp 
Protein Length1183 aa 
Translation table11 
GC content60% 
IMG OID637828013 
Producttranscription-repair coupling factor 
Protein accessionYP_428963 
Protein GI83588954 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1197] Transcription-repair coupling factor (superfamily II helicase) 
TIGRFAM ID[TIGR00580] transcription-repair coupling factor (mfd) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000883466 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACAATA ACAGCATATT ACAAATTATC AGAGATAGCG AACAATTTTA CGCCATCGCC 
GGGGGCCTGC GTCGGGGCGC AGGCGAGGTG CAGCTTTATG GCCTGCCGGA AGGGATGAAG
GGCTTGTGGC TGGCGGCTAT GCTGGATGAA TTTAACCCCA TTCTGGTAGT CACCCCCGGG
AGCGAAGAAG CCCAGCGCCT GGCAGCGGAT ATAGAATCCT TCTGGCCGGG GGAGGGCATT
GACTACCTGC CGCCGTCTGA GCTCCTGCCG TTGGAGGTTT ATACCCCCAG CCCGGAACTG
GCCGCCCAGC GCCTGAAGGT CCTGACGAAC CTGGTCAGCG GCCGGACCCG CATCCTTGTG
GTGCCGGTGG AGGACTTGCT CCGCAAGCTG CCGCCGCCGG ATACCCTGCG CCATTCCCTC
CAGTCCCTGG AGGTAGGCCA GATTATTGAC CGCGAGGCCC TTTTGCAGAA ATTAACTGGC
CTGGGGTACA GGCGGGAAGA GGTGGTTGAG GCCCCGGGCC AGCTGGCGGT AAGGGGAGGT
ATTATTGATA TCTTCCCCCT AGGAGCCGAG GAACCGGTGC GTTTAGAATT ATTTGGCGAT
GAAATCGATT CCTTGCGCCG CTTTGACCCC GTCAGCCAGC GCTCGGTGGC CGACCTGCGG
GCTATAGTCG TGGGGCCGGC CCAGGAAGTC CTCCCGCCCC CGGACCTGGG GCCGGGGCTG
GAAACCCTGA AGGCAGAATT TTCCCAGACC TATGCCACCC TGCGCCAGCG CCAGCCCCAG
GCCGCCAGGG AACTGAAGGA CCGGGTCCAG GAGTTGATTG CCATGCTGGA GGCTGGTTCC
TGGCCCGGTG GTAGCAGCCA GCTCCAACCC TTCTTTTACC CCCGGCAGGC CACCCTTTTT
GAATACTTCC AGCGCCAACC CCTGCTGGTC CTTGACGACC CGGCTCGTCT GCTGGAAGAG
ATGCGCCGTC GGGAGCAACA GCGCCTGGGA ATTTTTACCG ATATGCTGGC CGCTGGCCTG
GCTCTGCCCT CCCAGGGCCA GGCCTATCTG GATAGCGCCG ACCTGGAACG GTTGTGGCAG
CGCTACCAGC GCCTGTATTT CTCCCTGCTA CCCCGTCGGG TACCGGGGAG CAATCCCCGG
CCGGCAGTAG GTATCAGTGC CCAGACCATA CCGGCTTTCC AGGGTAAACT CGGCCTGGTG
GTAGAAGAAT TGACCCGCTG GCGCCGCGAG GGGTACCGGA TCATCCTGAT GGTAGCTGAC
CCGAACCGGG TGGTAGCCCT GCGCCAGGCC CTGGCCGAGC AGGGTATCGA GGCCCTGACC
CATCCTGAGG CCAGGGACAC CCTGGGACGG GGAGAGGTAA TAATGGTCTC CGGCCGCCTG
CGCCAGGGCT TTACCTGGCC GGAGATGCGC CTGGCCATTA TCGGGGATAC GGAGATCTAC
GGCCCGATCA GGAGGCCGCG GCGGGTCAAG ACGCCCCGCG AAGGAAGCAA GATTAGCTCC
TTTACTGACC TCAAAGAGGG AGACTATGTT GTCCACGTTC ACCACGGTAT CGGTCGTTAC
CTGGGCCTGC AACAACTGGA CGTAGGCGGG GTTAAAAAGG ACTACCTCCT CATTCAATAT
GCGGGCAAGG ACCGCCTCTA TGTACCTGTG GACCAGGTTT CCCTGGTCCA GAAGTATGTC
GGCGGTGAGG GGCATGTTCC CCGCCTGTAT CGCCTGGGCG GCAATGAATG GAACAAGGTC
AAGAGCCGGG TTCAGGAGGC CGTCCAGGAG ATGGCTCAGG AACTGCTGGA CCTTTATGCC
CGTCGGGAGG CCATCCCCGG GCATGCCTTT GGGCCCGATA CCCCCTGGCA GCGGGAATTC
GAGGAGGCAT TCCCTTATAC GGAAACGCCG GATCAGCTCC GGGCCATCGC CGAGGTTAAG
GCCGATATGG AAAAGCCCCG GCCCATGGAC CGCCTCCTTT GCGGCGACGT GGGCTACGGC
AAGACGGAAG TGGCTATGCG GGCGGCCTTC AAGGCGGTCA TGGATGGTAT GCAGGTGGCT
GTCCTGGTAC CCACCACTAT CCTGGCCCAG CAGCATTACG AGACCTTTAA AGCCCGTTTT
GCCCCTTTTC CGGTAAAAAT CGCCGTATTG AGCCGCTTTT GTTCGCCCAG GGAACAAAAG
GTAGTCGTGG AGGCTTTAAA GCGGGGTGAG GTGGATATAG TTATCGGTAC CCACCGGCTT
CTCTCCAGCG ATGTAAATTT TAAAAACCTC GGCCTGGTGA TTATCGATGA GGAACAGCGC
TTCGGCGTTG CCCATAAAGA AAAGCTGAAG CAATTGCGTT ACAGCGTTGA CGTCCTGACC
ATGACGGCTA CCCCTATACC CCGGACCCTG CATATGTCCC TGGCCGGCGT ACGCGATATG
AGTATGATTG AGACACCGCC CGAAGACCGC TTCCCCGTCC AGACCTATGT GGTGGAGTAT
AACCCGGAGC TGGTGCGGGA GGCCATCCGC CGGGAACTGG ATCGAGGCGG GCAGGTCTTT
ATAGTTCATA ACCGGGTGCA GGATATTGAT CGCTTCGCCT ACCATATTCA GCAACTGGTG
CCGGAAGCCC GGGTCGGCAT CGGCCACGGC CAGATGGGCG AAGAAGAGCT GGAGAATGTG
ATGCTGGACT TTATCTCCGG CCGCTACGAC GTCCTGGTCA GTACCACGAT CGTGGAAAAC
GGTCTGGACA TTCAGAATGC CAATACCCTG ATTGTCGATG AGAGTGATAA CTTCGGCCTG
GCTCAGCTCT ACCAGCTCCG GGGCCGGGTC GGCCGCACCA ACCGCCTGGC CTATGCCTAC
TTCACCTACC GGCCGGACAA GGTACTCGGT GAAATAGCTG AAAAACGCCT GGCCGCTATC
CGGGAGTTTA CGGCTTTCGG TTCGGGGTAT AAGATTGCCC TGCGGGACCT GCAGATCCGG
GGTGCAGGTA ATTTCCTGGG GCCCGAGCAA CACGGCCATA TGGTGGCCGT CGGCTTTGAC
CTCTACTGCC AGCTCCTGGA GGAAGCAGTC CGCAAACTGA AGGAGCAACG TGGCGAGGGA
GTGCCCCGGC CAGCCTTGGC GGAACCCCAG GCCACCCCCA TTGAACTGTC GGTAGATACC
TTCCTGGGTG ATAACTATAT CCCGGAGGCT ACCTTAAAAA TGGAACTCTA CCACCGCCTG
ATGAATGCCG GGGATCTAGC TGCCGTAGAG GATATCGCCG CGGAAATGGA GGACCGTTAC
GGACCGCCGC CGCCGGAAGC CAGGAACCTT CTGGCCCTGA CCAGGGTCCG CATCCTGGCC
CGTGAGGTAG GGGTAATAAG CGTCAACCAG AAAAACCGGG AAGTAGAATT GAGCTTTGGC
CAGCATACCG GCCTGCGGGG AGAAAAACTC CTGCAGCTCA ACCAGTATTT CCCCCGGAAG
CTGGCTTTCT CTTCAGCCGG CGGCCTTACC ATCAGGGTAC GGGTCATGGG TCTGAGCCAG
GAAGAATTAC TGGACCTGCT GGAGAAAGTC CTGACCAGGA TCAAGTACCT GGTAACCGAA
GCAGCAAGTT AG
 
Protein sequence
MYNNSILQII RDSEQFYAIA GGLRRGAGEV QLYGLPEGMK GLWLAAMLDE FNPILVVTPG 
SEEAQRLAAD IESFWPGEGI DYLPPSELLP LEVYTPSPEL AAQRLKVLTN LVSGRTRILV
VPVEDLLRKL PPPDTLRHSL QSLEVGQIID REALLQKLTG LGYRREEVVE APGQLAVRGG
IIDIFPLGAE EPVRLELFGD EIDSLRRFDP VSQRSVADLR AIVVGPAQEV LPPPDLGPGL
ETLKAEFSQT YATLRQRQPQ AARELKDRVQ ELIAMLEAGS WPGGSSQLQP FFYPRQATLF
EYFQRQPLLV LDDPARLLEE MRRREQQRLG IFTDMLAAGL ALPSQGQAYL DSADLERLWQ
RYQRLYFSLL PRRVPGSNPR PAVGISAQTI PAFQGKLGLV VEELTRWRRE GYRIILMVAD
PNRVVALRQA LAEQGIEALT HPEARDTLGR GEVIMVSGRL RQGFTWPEMR LAIIGDTEIY
GPIRRPRRVK TPREGSKISS FTDLKEGDYV VHVHHGIGRY LGLQQLDVGG VKKDYLLIQY
AGKDRLYVPV DQVSLVQKYV GGEGHVPRLY RLGGNEWNKV KSRVQEAVQE MAQELLDLYA
RREAIPGHAF GPDTPWQREF EEAFPYTETP DQLRAIAEVK ADMEKPRPMD RLLCGDVGYG
KTEVAMRAAF KAVMDGMQVA VLVPTTILAQ QHYETFKARF APFPVKIAVL SRFCSPREQK
VVVEALKRGE VDIVIGTHRL LSSDVNFKNL GLVIIDEEQR FGVAHKEKLK QLRYSVDVLT
MTATPIPRTL HMSLAGVRDM SMIETPPEDR FPVQTYVVEY NPELVREAIR RELDRGGQVF
IVHNRVQDID RFAYHIQQLV PEARVGIGHG QMGEEELENV MLDFISGRYD VLVSTTIVEN
GLDIQNANTL IVDESDNFGL AQLYQLRGRV GRTNRLAYAY FTYRPDKVLG EIAEKRLAAI
REFTAFGSGY KIALRDLQIR GAGNFLGPEQ HGHMVAVGFD LYCQLLEEAV RKLKEQRGEG
VPRPALAEPQ ATPIELSVDT FLGDNYIPEA TLKMELYHRL MNAGDLAAVE DIAAEMEDRY
GPPPPEARNL LALTRVRILA REVGVISVNQ KNREVELSFG QHTGLRGEKL LQLNQYFPRK
LAFSSAGGLT IRVRVMGLSQ EELLDLLEKV LTRIKYLVTE AAS