Gene Moth_0224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0224 
Symbol 
ID3832552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp221141 
End bp222502 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content66% 
IMG OID637828160 
ProductDEAD/DEAH box helicase-like 
Protein accessionYP_429102 
Protein GI83589093 
COG category[L] Replication, recombination and repair 
COG ID[COG4098] Superfamily II DNA/RNA helicase required for DNA uptake (late competence protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0854119 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCACACCC CCCTGGAGGA CCTCTGTCAC TGGCTTTATT TGGAAGGTGA AATAAAGCTT 
CTGCCCGGGG TGGGTTATGA CCCTGACGGT CGCCCCCGGT GCCGGCGCTG CGGCCAGGCT
ACCGGGCTTT TAAAGGTCAA CTGCGCCGCC TGCAGCCGGG AGGATTGCCT GCTCTGCGAG
GAGTGCCTGG CTATGGGCCA GTCCCGCCGC TGCCGTCCTC TGTACGCCAG GCCCTGGCCC
TTTGCCGCGG GTTCTCCGGC CAGGCCGGCG GCAGTGGTGC GCCCCCTGCT CCGGTTTGAC
CTCACGCCGG CCCAGGCGGA CGCCTACCGG GAGGCGGAAG GGTTTGCCAG CCAGGATAAG
GAAAAGGAGT GCCTCCTCTG GGCCGCCTGT GGCGCCGGAA AAACTGAGGT GGCCTATGGC
GCCATTGCTG CCGCCCTGGC CCGCGGGCGT AAAGTCCTTT ATGCCTGCCC CCGGAAGGAG
GTTATCCGGG AACTCCACCC GCGCCTGCAA GCCGTCTGGC CGGGCCTGCG GATCCAGGCT
CTATATGGTG GCAGCCAGGG CAAATACGGC GAGGCCGACC TCATCCTGGC CACCACCCAC
CAGGCCTTAC GTTTCTACCG CCGTTTTGAC CTGGTGATCC TCGATGAAGT GGACGCCTTC
CCCCTGGCGG GGGACCCCAT GCTCTACTAT GCCGTCGAGC GGGCGCGCCG GGAACATGGT
CAGATCCTGT GGTTAACGGC CACCCCGCCC CCGGAGATGG TGGCCAGGGT CAGGAAGGGC
AAGCTGGCCG TTATTTACTT GCCAGCCCGG TACCACGGCC ACCCCCTCCC GGAACCCGAG
TTCGTCCGGG AACCATTTCT TAGGCCGCCG GGGACAGGCC CCCTGCCTCG CTCCATGGTT
AACTGTATAA ATACTACCCT GGGGGCGGGG CTCCAGCTCC TGCTCTTCGT CCCGGCCGTT
TCCCTGGTGG AGGGGGTGGC TGCATGGTTG CTGGACTCCT GGCCCGGCCA GGCCCCCGGC
GGGGCCTGGG TCCGGGGCTG TCATGCCGCC CACCCCAGGC GGGAGGAAGT TATCGCTGCC
TTTCGCCGGG GAGAATTCCC GGTTCTGGTG ACCACTACCG TTATGGAGCG GGGGGTTACC
ATTCCCCGCC TGAACGTCCT TGTCCTTTAC GCTGAGGAGG GCAGGGTCTT TACGGCCAGC
ACCCTGGTGC AGATCGCCGG CCGGGCCGGG CGTTCGGCGG CTTATCCCAC CGGGAGGGTA
TGGTTTATAG GCCGGCACTT GAGCCCCGCC ATTGCAGAGG CTGCCCGCCA GATCCGGGAA
TTCAACCGCC TGGCCCGCCG GCGGGGTTAC TTGACGCGGT AA
 
Protein sequence
MHTPLEDLCH WLYLEGEIKL LPGVGYDPDG RPRCRRCGQA TGLLKVNCAA CSREDCLLCE 
ECLAMGQSRR CRPLYARPWP FAAGSPARPA AVVRPLLRFD LTPAQADAYR EAEGFASQDK
EKECLLWAAC GAGKTEVAYG AIAAALARGR KVLYACPRKE VIRELHPRLQ AVWPGLRIQA
LYGGSQGKYG EADLILATTH QALRFYRRFD LVILDEVDAF PLAGDPMLYY AVERARREHG
QILWLTATPP PEMVARVRKG KLAVIYLPAR YHGHPLPEPE FVREPFLRPP GTGPLPRSMV
NCINTTLGAG LQLLLFVPAV SLVEGVAAWL LDSWPGQAPG GAWVRGCHAA HPRREEVIAA
FRRGEFPVLV TTTVMERGVT IPRLNVLVLY AEEGRVFTAS TLVQIAGRAG RSAAYPTGRV
WFIGRHLSPA IAEAARQIRE FNRLARRRGY LTR