Gene Moth_0640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0640 
Symbol 
ID3832036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp667852 
End bp670770 
Gene Length2919 bp 
Protein Length972 aa 
Translation table11 
GC content57% 
IMG OID637828581 
Producthelicase-like 
Protein accessionYP_429511 
Protein GI83589502 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.060382 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGCCCG GAATTCTGGT CCGCAGCACC ATTTATCCGG AAAAAGGGAT AGGCTTGGTC 
CTGGGGAATG AGGAGTTTTT CGACCAGGTC TACGTGCACG TTTTCTTTGA AAAGACCAGG
GAGAGGCTCA CCCTGCCCCT GGCCGACTTA AGTCCGCTCC ATGATCCGCT GGCCAAAATG
GAAGCAGGCA GCTTTTCCAC CGCTTCCCGC TTCCAGCTGC GCTGGCTGGT GGAGCAAATC
CTGGCAGAAA ATTCCGGGGA GGGCCTTTTG GCCGCCGGGG GCTTCAAGAT TATCCCCCTG
CCCCACCAGC TCCTGGCAGT AAGCTTCGTC CTCGACCAGT TCAAGCCGCG CGTCCTGATC
GCCGACGAGG TCGGCCTGGG CAAGACCATT GAGGCTGCCC TGATCTACGA GGAACTGAAG
GCCAGGGGTA TGGTAAAAAG GGTGCTGGTG GTAGCGCCGT CGGGGCTTTG CCTCCAGTGG
CGGGAGGAAA TGAAAACCAA GTTCGGCGAA GACTTTATCA TCTACGATCG CAGCACCGTC
CATTCTCTGA AACAGCTTCA CGGGGAGATG ACCAACGTCT GGACCTTGGC CGACCGGGTG
ATTACTTCCC TGGACTTCAT CAAGCCCAAA AAGATTACCG CCGACCTGGA CGAGCGTGCA
GCACGTGCCC GCCGCTGGCA CAACGAGCAG GTTTTTGCGG CCGCAGCCGC CGCCTGGTTC
GACATGGTAA TCTTCGACGA AGCCCATAAA CTTACCAAAG ACATGACCGG CGAGGAAACG
GCCCGCTATA AGGCAGGTCA TGCCCTGGTC CAAGCGGCGC CCATAGTGCT TCTCCTTACT
GCCACCCCCC ACCAGGGGGA CCAGCACAAG TTCCGCAACC TGCTCCGGCT CATAGACCCT
TATTTATTCT CCGGGGAAGG CCGGATTACG GCTGAAGACG TTAAAAAGGT AACGGTGCGC
AACAATAAGC GGGCGGTGGT TGATTTCCAC GGGAACCGCC TTTTCAAGCA GCGGGTGGCC
ACAGTGTGCC TGATCCACCG GGACGAGGTG GCGGACCAGG TGGAGCTGGA CCTCTACCGG
GCGGTGACCG ATTATGTAAC TACTTTCTAC GAACTGGCCC GGCAGCAGAA CAATTTCACC
ATGATGTTCC TGCTGCTCAT TTACCAGCGC ATGGTAAGCA GCAGCTCCCC CGCCATTCTG
AAGTCCTTAT CCGCCCGTCT CGCGGCCCTG GAGGAGCTGC GCCGCCGTGC AGCCGACCAG
GAGCCAGAGA GCGAGAGGGA AGAACCCGAC TGGGACGACC TGCAGGAACT GACGGCGGAG
GAGCAGTTGG CCGAACTAAC GCGGGCCAGC GCTGCCCCGC GGGCCGGTAT CGTTATCGTA
CCGGCCGCCT TGGCGGCCGA GATCGCGGCC CTGAAAAAAT GCTTGGCCCT AGCGGAGCGA
GCCACAGCCG GCCGCAACGA TATCAAGTTC ACCAGGCTTC TGGAAATCAT TAATGAACTC
AGAATCCAGG AAAACAATCC CCGCCTAAAA TTTATCATCT TTACCGAGTT TAGGGAGACG
CAAGCTTACT TAGAGGAGCG CCTGACCAGC TTGGGCTACC GGACGGCGCT CATTAATGGT
GCCATGTCCA CCACCGAGCG CATTGCCCAG GTGGAGCGCT TCCGCCGCGA GGCGGATTTC
CTCATTTCTA CCGATGCTGG CGGCGAGGGC ATAAACCTGC AGTTCTGCCA TATCTTGATC
AACTACGATC TGCCCTGGAA CCCCATGCGC TTGGAGCAGC GCATCGGCCG CATTGACCGC
ATCGGCCAGG AACATGACGT TAAAGTGATC AATCTACAAC TGGCGGACAC GGTGGAGAAC
CGGGTGCGGG AGGTAATCGA AAACAAGCTG GACACCATCC GCAGGGAGTT TTGCGCTGGC
GAAGATAAGC TGGCTGACAT TCTAGGGGTC TTGCAAGATG AATTCGATTT TGAGAAGGTG
TATATCGAGG CCTTGCTCAA GCAGGGCCGC AAGGCAGCCA ACCTGGATGC ATTATCCTGG
CAGATTTTTG AACGAGCCCG GGAAATCGTT GAGGAAGAAA GATTAGCTCT GCCTATCTCC
AATTTGGCCC CTGAATATGT TTTAGCGTCG CAGCGAGATT TGGAGAAGAG AGCCAAGAGG
GTGCAAAGGC TGGTAGAGCA ATACCTGCAG GTTTACGGCG CCAGCCTGCA CCCGTACAAG
CTGAGAGAGG GCGTTTACTA CTTTCAGGAC CCCAGGAGCG GCAGGCGCCT GCATAACGTG
ATCTTCCAGC AGAAATATGC CCTGGCCAAT GAGGGGGCCG AGCTTTTGAG TTTCCAGCAC
CCCTATATGG TGGAACTGTT AGCCCACCTG GAGGATGCCC TGCGGGAGGA TACGTCGGCA
AAGCTTTTGG TACGTGAGAG AAAGTTCAGC GGGGAAAAAG GGTTTCTGTT TATATACCGG
CTGACCCTCA CAAACTACTT GGACCCTACT GTTTACTATC TGGTACCTTG TTTTGTTAGC
TTTGCTGGGG ATACGGGGCG GGTAAACGGC AGAATATCCC GCTATTTTCG CGATTGGGAG
CAGCTCATCT GCACCGACCT GGTAACCGGA GAGATACCGT ATAATCTAAA GGAGGCCTGG
CAACTGGCCC GGAAGGCTGT GCAGCAGGAA GCAGAGGTTC TCTTCTTTCA GGCAAAGGAA
CGTTTGGAGA AGAGGCTGCG GGATGAAGAG GAAAAGTTTG AGAAATACTA CAAGGACCGG
GAGGCGGCCA TCGAGAAGAT TGCCGTGGAC AATATCCGCG CAGCCAAGAA AAAAGAGCTG
GAAGAGGACC GAAAGACTAG GCGGCAAGAA TGGTTGCGCC GCCGGCAGCT GGTGCCCAGC
CTAAGCCTAG AGCAAGTTGC CTACGTGGAG TTTGCATGA
 
Protein sequence
MQPGILVRST IYPEKGIGLV LGNEEFFDQV YVHVFFEKTR ERLTLPLADL SPLHDPLAKM 
EAGSFSTASR FQLRWLVEQI LAENSGEGLL AAGGFKIIPL PHQLLAVSFV LDQFKPRVLI
ADEVGLGKTI EAALIYEELK ARGMVKRVLV VAPSGLCLQW REEMKTKFGE DFIIYDRSTV
HSLKQLHGEM TNVWTLADRV ITSLDFIKPK KITADLDERA ARARRWHNEQ VFAAAAAAWF
DMVIFDEAHK LTKDMTGEET ARYKAGHALV QAAPIVLLLT ATPHQGDQHK FRNLLRLIDP
YLFSGEGRIT AEDVKKVTVR NNKRAVVDFH GNRLFKQRVA TVCLIHRDEV ADQVELDLYR
AVTDYVTTFY ELARQQNNFT MMFLLLIYQR MVSSSSPAIL KSLSARLAAL EELRRRAADQ
EPESEREEPD WDDLQELTAE EQLAELTRAS AAPRAGIVIV PAALAAEIAA LKKCLALAER
ATAGRNDIKF TRLLEIINEL RIQENNPRLK FIIFTEFRET QAYLEERLTS LGYRTALING
AMSTTERIAQ VERFRREADF LISTDAGGEG INLQFCHILI NYDLPWNPMR LEQRIGRIDR
IGQEHDVKVI NLQLADTVEN RVREVIENKL DTIRREFCAG EDKLADILGV LQDEFDFEKV
YIEALLKQGR KAANLDALSW QIFERAREIV EEERLALPIS NLAPEYVLAS QRDLEKRAKR
VQRLVEQYLQ VYGASLHPYK LREGVYYFQD PRSGRRLHNV IFQQKYALAN EGAELLSFQH
PYMVELLAHL EDALREDTSA KLLVRERKFS GEKGFLFIYR LTLTNYLDPT VYYLVPCFVS
FAGDTGRVNG RISRYFRDWE QLICTDLVTG EIPYNLKEAW QLARKAVQQE AEVLFFQAKE
RLEKRLRDEE EKFEKYYKDR EAAIEKIAVD NIRAAKKKEL EEDRKTRRQE WLRRRQLVPS
LSLEQVAYVE FA