Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1918 |
Symbol | |
ID | 3830842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1988691 |
End bp | 1990499 |
Gene Length | 1809 bp |
Protein Length | 602 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637829851 |
Product | DEAD/DEAH box helicase-like |
Protein accession | YP_430761 |
Protein GI | 83590752 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1061] DNA or RNA helicases of superfamily II |
TIGRFAM ID | [TIGR00603] DNA repair helicase rad25 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0105236 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGACA TGTCAGACGC TCCCCTGGTA GTCCAGAGCG ACAGGACCAT TCTGCTGGAG GTTGATAACC CCCTTTACCC GGAGGCGCGG GACGCCCTGG CCCGTTTTGC CGAACTGGTG AAGAGTCCCG AACATATTCA CACCTATCGC CTGACGCCCC TTTCCCTCTG GAACGCGGCT GCCGGCGGGC TGGATGCGGC CACTATCATC CAGGTGTTGG CGGATTACAG CAAGTATCCC CTCCCTGCCA ATGTGGTTGC CGATATCCGG GAATATGTCG GCCGTTACGG CAAAGTCAAG CTGGTAGCCC GGGGAACTGG ATTGCGGCTG GTCACAGCCG ACCCGGGGAT AGCCGCGGAA ATCTCGAATA ACAAGCGCAT CCAGCCGTAT ATCAAGGAGC GTCAAGATGC CTGCACCCTG GCCATCGACC CCTGGCAGCG GGGACCGGTC AAACAGGCCC TGATCAAGAT CGGCTATCCG GTGGAAGACT TGGCTGGCTA TATTCCCGGG GCACCATTAC CATTTAGCCT GCGCGAAAGG ACTTTGAACG GGGAGACCTT CAGCCTGCGC CCTTACCAGG CGGAGGCGGC GCGGGTCTTT TATGCCGGAG GTAGTTCCCG GGGCGGGAGC GGGGTAATCG TCTTACCCTG CGGTGCGGGT AAGACCGTTG TCGGTATTGC CGCCATGGCC CTCTGCCAGT GTTACACTTT AATCCTGGTG ACCAGCGTCA CGGCTGCCCG GCAGTGGCTG GCGGAGATCC GGGATAAGAC GGACCTGCCC CCGGAGATGC TGGGCGAATA TACCGGGGAG AAAAAGGAAA TAAAGCCTGT GACCGTGGCT ACCTATCAAA TCATCACTCA CCGCCGCCGG CGCAACGAGG ACTACCCCAA TTTCCAGCTT TTCAACCAGC AGGACTGGGG CTTGATAATT TACGACGAAG TCCACCTGTT GCCGGCCCCC ATTTTCCGCA TTACGGCCGA ACTCCAGGCG CGCCGGCGCC TGGGCCTGAC GGCCACCTTG ATCCGGGAAG ACGGCCACGA AGACGACGTC TTTTCCTTAA TCGGTCCCAA GAAATATGAT TTACCCTGGA AGCAGCTCGA GGCCCAGGGA TGGATCGCCA AAGCCACGTG CTATGAGGTG AGGCTAAATC TACCGCCGGA GATGCGCCTG GACTACGCCT CCGCCGGTGA GCGGGACAAG TACCGCATCG CCGCCACCAA CCCGGTAAAA GAGGCTGTGG TTGAGAACAT TATAAAACGC CACGAGGGCG AACAGGTCCT GGTAATCGGC CAGTATCTCG AGCAACTGGA ACGCCTGGCC CGGCGGCTGG GGGTACCCAT GATAACCGGG CAGACCAGCA ACCGGGAACG CGAGAGGCTC TATCAGGCTT TCCGCGAGGG GACTCTGAAG TGCCTGGTGG TTTCCAAGGT GGCAAATTTT GCCATCGACC TGCCGGAGGC CAGCGTGGCC GTCCAGGTTT CGGGAGCCTT CGGCTCGCGC CAGGAAGAGG CCCAGCGCTT GGGCCGGATT TTAAGGCCCA AGAAGGGGGG CCTACCCGCC AGCTTTTATA CCCTGGTTAC CCGGGAGACG GTGGAGCAGG AGTTTGCCGT CCACCGGCAG CTCTTTCTCA CAGAGCAGGG TTACCGCTAT GTGATAATTG GGCCGGATCT GGAGCAGGAA GGAGATAAGG TTTATCCGTT GAAGACGCCC ACGGGAAGCG AGCCGGCCGT GGGGGCGGTA ATAAAGCAGG AAAATCTCGA TGGCAAGGTG ATCGATTTAA TGGCCTGGCG CCAGAAGGCC GGCCGTTAA
|
Protein sequence | MADMSDAPLV VQSDRTILLE VDNPLYPEAR DALARFAELV KSPEHIHTYR LTPLSLWNAA AGGLDAATII QVLADYSKYP LPANVVADIR EYVGRYGKVK LVARGTGLRL VTADPGIAAE ISNNKRIQPY IKERQDACTL AIDPWQRGPV KQALIKIGYP VEDLAGYIPG APLPFSLRER TLNGETFSLR PYQAEAARVF YAGGSSRGGS GVIVLPCGAG KTVVGIAAMA LCQCYTLILV TSVTAARQWL AEIRDKTDLP PEMLGEYTGE KKEIKPVTVA TYQIITHRRR RNEDYPNFQL FNQQDWGLII YDEVHLLPAP IFRITAELQA RRRLGLTATL IREDGHEDDV FSLIGPKKYD LPWKQLEAQG WIAKATCYEV RLNLPPEMRL DYASAGERDK YRIAATNPVK EAVVENIIKR HEGEQVLVIG QYLEQLERLA RRLGVPMITG QTSNRERERL YQAFREGTLK CLVVSKVANF AIDLPEASVA VQVSGAFGSR QEEAQRLGRI LRPKKGGLPA SFYTLVTRET VEQEFAVHRQ LFLTEQGYRY VIIGPDLEQE GDKVYPLKTP TGSEPAVGAV IKQENLDGKV IDLMAWRQKA GR
|
| |