Gene Moth_0923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0923 
Symbol 
ID3832924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp957089 
End bp959131 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content61% 
IMG OID637828854 
ProductATP-dependent DNA helicase RecG 
Protein accessionYP_429783 
Protein GI83589774 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1200] RecG-like helicase 
TIGRFAM ID[TIGR00643] ATP-dependent DNA helicase RecG 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.452189 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACTCG ACCATCCGGT AGCAACTTTA AAATATGTGG GCCACCAGAG GGCGGCCCGC 
CTTTCCCGGC TGGGCATCCA GACGGTTGGT GACTTATTAT GGCACTTCCC CCGGCGTTAT
GAAGACCGCC GCCAGCTTAA AGATCTGGCG GCAGTGGCTC CCGGCGAGGT GGTCACGGTA
CAGGTAACCA TCAGGGCCTG GGAAGAAAGG GAAGTCCGCC CTCGCCTGCG GTTGATCCGG
GCCCTGATTC AGGGCCGGCA GGGAACAGGC TTTGCCGTTT GGTTTAACCA GCCATACCTC
AAGCGCCAGA TGCCGCCGGG AACCGGGGTA ATCCTCACCG GGAAGGTTAG ATACAGGGAC
TACCGGCCGG AAATCCAGGT CAGTGATTAC GAAGCCCTGG GTGAAGGGGA TCCGGGTCTC
CATACAGGGC GGATTGTACC CTTTTATGCT TTGACGGCCG GCCTTAGCCA GCGCTGGCTG
CGCCTGGTAA TCCATCTGGC CCTGGAAGCC GTTGCCGGGG ATTTACCGGA GGTTTTACCC
CTATCCTTAT GCCGGCGTTA CCGCCTTATA CCCCGCCTCC AGGCTTTAAA ATATATCCAT
TTCCCCCCCG ATGCCGCCGG CCTTCACCAG GCCCGGCGCC GGCTGAAATA CGAAGAACTG
CTTATCTGGG AACTGGGTTT AAACCTGCAC CGGGTGCAGC AGGAGCAAGG CAGGCAGGGT
ATAGCCCATA CCCCGGCTAA CAATCTGGTT AACCGGCTGG TTGACAGCCT GCCCTTTAAA
TTAACCTCAG CCCAGGCCAG GGCGCTGGCG GAAATCCTGG CCGATATGGA AGCCCCCCGG
CCTATGGCCC GGCTGCTCCA GGGGGATGTC GGCTCCGGGA AAACGGTTGT TGCTGCGGCC
GCTATGGTAA AGGCGGTGGC CGGCGGCTGG CAGGCTGCCC TCATGGCTCC TACGGAAGTC
CTGGCTGAGC AACACGGCAG GACCCTGGGG CAACTACTGG CGCCCCTGCG GCTCCCGGTA
GTTACCCTTA CCGGCAGTAC CCCCAGGACT GAACGGGAAA ACATCCTGGC CGGCCTGGCC
AGCGGCCAGT TGCCCCTGGT GGTGGGCACC CACGCCCTTA TTCAGGACGA TGTAAGCTTT
AAGTCTCTGG GCCTGGTGGT AATCGACGAA CAGCACCGTT TTGGCGTTGA CCAACGGGCT
GCCCTCCAGA CCAAGGGCGA GTGCCCCGAT TTACTGGTCA TGACGGCGAC GCCCATTCCC
CGGACCCTGG CCCTGGCCAT CTATGGTGAC CTGGATATCT CCGTCCTGGA TGAGCTGCCT
CCGGGCCGGC AACCGGTGGC TACATATGTA ATCACAGAAA AACAGCGTCC CCGGGCTTAC
CGGTTAATCG AACGGGAAAT CCGGGCCGGT CACCAGGCCT ATGTCATCTG CCCCGTAATC
GACGCCAACG ACGGCGTGGC CGTGGAAGCC GCCACCGCCA TGGCCCGGAA ACTGCAGGAG
GAGGTTTTCC CGGGATACCC GGTAGGGCTG GTGCACGGCC GCCTGCGACC GGCGGAAAAA
GAAGAGGTTA TGAACGCCTT CCGGGAAGGG AAGATTGCCA TCCTGGTAGC TACTACGGTG
GTTGAGGTGG GGGTTGATGT CCCCAATGCT ACGGTGATGT TAATCGAAGG AGCGGAAAGG
TTGGGTCTGG CCCAACTGCA CCAGTTGCGG GGGCGGGTGG GCCGGGGCAC GGCAGCAGCT
TACTGCTTCC TGGTTACCAG AGGCAGCCAG GCAGCCCGGG AGCGACTGGC GATCCTGACT
ACCAACCGGG ACGGCTTTGC CATTGCCGAG GCCGACCTGC GCCTGCGGGG ACCGGGCGAG
TTCTTCGGTA CCCGCCAGCA CGGGTTACCT GAATTTCACC TGGCCCAGCT CCCCGGGGAC
AGCCACATCC TGGAACAGGC CCGCCAGGAT GCCAGGGAGA TTTGCCGCCA GGAGGGTCCT
TCACTTGAAT ATGAGGCCCT TTACCAGGCT GCCAGGGAAA AGCTGGCCGG TTTACATTTT
TAA
 
Protein sequence
MILDHPVATL KYVGHQRAAR LSRLGIQTVG DLLWHFPRRY EDRRQLKDLA AVAPGEVVTV 
QVTIRAWEER EVRPRLRLIR ALIQGRQGTG FAVWFNQPYL KRQMPPGTGV ILTGKVRYRD
YRPEIQVSDY EALGEGDPGL HTGRIVPFYA LTAGLSQRWL RLVIHLALEA VAGDLPEVLP
LSLCRRYRLI PRLQALKYIH FPPDAAGLHQ ARRRLKYEEL LIWELGLNLH RVQQEQGRQG
IAHTPANNLV NRLVDSLPFK LTSAQARALA EILADMEAPR PMARLLQGDV GSGKTVVAAA
AMVKAVAGGW QAALMAPTEV LAEQHGRTLG QLLAPLRLPV VTLTGSTPRT ERENILAGLA
SGQLPLVVGT HALIQDDVSF KSLGLVVIDE QHRFGVDQRA ALQTKGECPD LLVMTATPIP
RTLALAIYGD LDISVLDELP PGRQPVATYV ITEKQRPRAY RLIEREIRAG HQAYVICPVI
DANDGVAVEA ATAMARKLQE EVFPGYPVGL VHGRLRPAEK EEVMNAFREG KIAILVATTV
VEVGVDVPNA TVMLIEGAER LGLAQLHQLR GRVGRGTAAA YCFLVTRGSQ AARERLAILT
TNRDGFAIAE ADLRLRGPGE FFGTRQHGLP EFHLAQLPGD SHILEQARQD AREICRQEGP
SLEYEALYQA AREKLAGLHF