Gene Moth_1918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1918 
Symbol 
ID3830842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1988691 
End bp1990499 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content60% 
IMG OID637829851 
ProductDEAD/DEAH box helicase-like 
Protein accessionYP_430761 
Protein GI83590752 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID[TIGR00603] DNA repair helicase rad25 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0105236 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACA TGTCAGACGC TCCCCTGGTA GTCCAGAGCG ACAGGACCAT TCTGCTGGAG 
GTTGATAACC CCCTTTACCC GGAGGCGCGG GACGCCCTGG CCCGTTTTGC CGAACTGGTG
AAGAGTCCCG AACATATTCA CACCTATCGC CTGACGCCCC TTTCCCTCTG GAACGCGGCT
GCCGGCGGGC TGGATGCGGC CACTATCATC CAGGTGTTGG CGGATTACAG CAAGTATCCC
CTCCCTGCCA ATGTGGTTGC CGATATCCGG GAATATGTCG GCCGTTACGG CAAAGTCAAG
CTGGTAGCCC GGGGAACTGG ATTGCGGCTG GTCACAGCCG ACCCGGGGAT AGCCGCGGAA
ATCTCGAATA ACAAGCGCAT CCAGCCGTAT ATCAAGGAGC GTCAAGATGC CTGCACCCTG
GCCATCGACC CCTGGCAGCG GGGACCGGTC AAACAGGCCC TGATCAAGAT CGGCTATCCG
GTGGAAGACT TGGCTGGCTA TATTCCCGGG GCACCATTAC CATTTAGCCT GCGCGAAAGG
ACTTTGAACG GGGAGACCTT CAGCCTGCGC CCTTACCAGG CGGAGGCGGC GCGGGTCTTT
TATGCCGGAG GTAGTTCCCG GGGCGGGAGC GGGGTAATCG TCTTACCCTG CGGTGCGGGT
AAGACCGTTG TCGGTATTGC CGCCATGGCC CTCTGCCAGT GTTACACTTT AATCCTGGTG
ACCAGCGTCA CGGCTGCCCG GCAGTGGCTG GCGGAGATCC GGGATAAGAC GGACCTGCCC
CCGGAGATGC TGGGCGAATA TACCGGGGAG AAAAAGGAAA TAAAGCCTGT GACCGTGGCT
ACCTATCAAA TCATCACTCA CCGCCGCCGG CGCAACGAGG ACTACCCCAA TTTCCAGCTT
TTCAACCAGC AGGACTGGGG CTTGATAATT TACGACGAAG TCCACCTGTT GCCGGCCCCC
ATTTTCCGCA TTACGGCCGA ACTCCAGGCG CGCCGGCGCC TGGGCCTGAC GGCCACCTTG
ATCCGGGAAG ACGGCCACGA AGACGACGTC TTTTCCTTAA TCGGTCCCAA GAAATATGAT
TTACCCTGGA AGCAGCTCGA GGCCCAGGGA TGGATCGCCA AAGCCACGTG CTATGAGGTG
AGGCTAAATC TACCGCCGGA GATGCGCCTG GACTACGCCT CCGCCGGTGA GCGGGACAAG
TACCGCATCG CCGCCACCAA CCCGGTAAAA GAGGCTGTGG TTGAGAACAT TATAAAACGC
CACGAGGGCG AACAGGTCCT GGTAATCGGC CAGTATCTCG AGCAACTGGA ACGCCTGGCC
CGGCGGCTGG GGGTACCCAT GATAACCGGG CAGACCAGCA ACCGGGAACG CGAGAGGCTC
TATCAGGCTT TCCGCGAGGG GACTCTGAAG TGCCTGGTGG TTTCCAAGGT GGCAAATTTT
GCCATCGACC TGCCGGAGGC CAGCGTGGCC GTCCAGGTTT CGGGAGCCTT CGGCTCGCGC
CAGGAAGAGG CCCAGCGCTT GGGCCGGATT TTAAGGCCCA AGAAGGGGGG CCTACCCGCC
AGCTTTTATA CCCTGGTTAC CCGGGAGACG GTGGAGCAGG AGTTTGCCGT CCACCGGCAG
CTCTTTCTCA CAGAGCAGGG TTACCGCTAT GTGATAATTG GGCCGGATCT GGAGCAGGAA
GGAGATAAGG TTTATCCGTT GAAGACGCCC ACGGGAAGCG AGCCGGCCGT GGGGGCGGTA
ATAAAGCAGG AAAATCTCGA TGGCAAGGTG ATCGATTTAA TGGCCTGGCG CCAGAAGGCC
GGCCGTTAA
 
Protein sequence
MADMSDAPLV VQSDRTILLE VDNPLYPEAR DALARFAELV KSPEHIHTYR LTPLSLWNAA 
AGGLDAATII QVLADYSKYP LPANVVADIR EYVGRYGKVK LVARGTGLRL VTADPGIAAE
ISNNKRIQPY IKERQDACTL AIDPWQRGPV KQALIKIGYP VEDLAGYIPG APLPFSLRER
TLNGETFSLR PYQAEAARVF YAGGSSRGGS GVIVLPCGAG KTVVGIAAMA LCQCYTLILV
TSVTAARQWL AEIRDKTDLP PEMLGEYTGE KKEIKPVTVA TYQIITHRRR RNEDYPNFQL
FNQQDWGLII YDEVHLLPAP IFRITAELQA RRRLGLTATL IREDGHEDDV FSLIGPKKYD
LPWKQLEAQG WIAKATCYEV RLNLPPEMRL DYASAGERDK YRIAATNPVK EAVVENIIKR
HEGEQVLVIG QYLEQLERLA RRLGVPMITG QTSNRERERL YQAFREGTLK CLVVSKVANF
AIDLPEASVA VQVSGAFGSR QEEAQRLGRI LRPKKGGLPA SFYTLVTRET VEQEFAVHRQ
LFLTEQGYRY VIIGPDLEQE GDKVYPLKTP TGSEPAVGAV IKQENLDGKV IDLMAWRQKA
GR