Gene Moth_1788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1788 
Symbol 
ID3832454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1841436 
End bp1844141 
Gene Length2706 bp 
Protein Length901 aa 
Translation table11 
GC content60% 
IMG OID637829713 
ProductDNA repair ATPase-like protein 
Protein accessionYP_430632 
Protein GI83590623 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.284461 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCCTGC ACTCCCTGAC TGTTGCCGGT TTGCGTCGTT TCCGCAACCC CGTGGCCCTG 
ACAGGTTTCG ACCCGGGGAT CAATATCATT TACGGCCCCA ATGAATGCGG TAAGTCAACC
CTTATCTGGG GGTTGATCCT GGCCTTTTTG AACCGCCATA ACGTGAGCGG CGAGGAGATT
GCCAGGTTCC GTCCCTGGGG TACGGCGCTG AATCCCATCA TTAAGGTGGA GTTCACCGCC
GGCGGCAAGC GCTACCGGCT GGACAAGGGC TTTTTGGACG AAGCCCGGTG CACCTTGTCC
GAATGGACAG GAGGGGGCTA CCAGGATTTG GCCCGGGGTA AAGCCGCCGA CGAGATGGTC
TTCCAGATGA TGCAGGCTCA GATGGCGGGC CGTGGCTTGA CCAAAAGCTC CCAGTGGGGC
CTGGCCCATC TCCTCTGGAT GCCCCAGGAC AGGGAACGCC TGACGGCCCC GGCCCTAACG
GGCAATGTCC AGGACTACTT CCGTGAGTCT CTGGGCTCCA CCATTTTTAC CCCTGCTGAT
GACCGCCTGC TGGCGGTAAT CGACAGGAAA TACAGTGAGA TTTATACCCA GAAGCGGGGC
GTCCTCCAGG CCCGCTCCCC CGTGCATCAA GCCCGGCAGA GATTGGAGGA GATCAAGGAG
CAGCTGGCTA CCCTCCGGCA AGCCCTGGAG CAGGTTAACC AGGTGGTGGC AGAACTAGAA
ACGCAGCAGG CACGCCATGA GGCGCTGCAA CAGGAACTGG CCGGCCTCCG GGCAAGGGAG
AAGGGATTAA AGGAACAGGT CGAGGCCATT AAAACCTTGC AGGCCGGCTA CCGGCAGGCG
GAAGCCGAAG TAAAAACGGC CACTACCATC TGGGAAAAAT TAAAAACAGA ATGGTGCCGG
GTTAACGAGC TGGAAGAAAG CATTAACCTG GCTAACGCCG AAATAAATCA GGAGAAAGAT
TCCCTGGCGT CCCTGGGTAA CGCCTGGCAG GAGGCGGGTC AAAAGCTGGG TGCCGCTGAA
GAACGCCTGG CAACCCTGGA AAATGAACTG GGTCGGGCCC GCAAAGCCCT GCAGCAATCC
TATGACCGGC AGCAGGCCCG GGAGCGCCTA GTGCGAATCG GCGAGCTGTC CCGGCGCCTT
AAGAAAGCCC GCCAGGTTAC CGCCCGGATT GCCGCCCTGG AGGGCGAGCT GGCCCGGCAA
CCCCTGCCCG GCCCGGAAGA GGTCAAGACC GTCACAGAAA CCTGGAACCG TATTAATTCC
CTGGAGGCCG AGGCCCGGGC CATGGGCCTG AGCATCAATT TCCGGGCCTA TCGCGACCTG
GAGCTTAGCG TTACCACTGC AGCCGGCCGG GAGAAGCGCC GGGTGGGCCC GGATCAACCC
CTTCTCCTGG ATGCCACCGA CCGCCTCGCC CTGGAGATAC CCGGCGCCGG CCTCCTGGAG
GTACGTTCGG GTTCCAGGGA CCTGCATAAG CTTTTACAGG AGATTGCCGG CCTGCGGCAA
GATCTGGAGC ATACCCTGGC CGCTTTTGGC GTTGCCAGTG TGGACGAATT GCAGGAAAGG
TATAACTGGA GCGTCCGCCA AAGGACGGAG TTGAAGGCTC TGGGGGACAA CCTGGGGCAG
GTCCTCGGCC CGGGTAATAC CCTGGAGGCC ATGGAAAAGC AGGTCTTAAC GGAACAGACT
GCCCTGGCGG AGCAATGTGC CGCCCTGGGG ATTACCGGGG AGGAACTTCT GGCCCTACCG
GGACCGGATA TCACTCCCCT GAAGGTGCAG GTGAAAAACC TCCAGGAAGA GGAGGAAAAA
CAGAAACAGG TGGTAACCCA GCTGAGGCAG CAGTTTCAGG ACCTGGAACG AAGGCTAGCA
GAAGAGCAGA AGAAGCTGGA GACCCTGCAA GCCCGGATCA GGGCCGCCAG GGAAGAACGT
GCCCGGCGCC TGGAGGCCTG GGGCGGCTCC GCCGCCAAAT TGCAGGCCGA ACTGGTGGCT
GCGGAGACGG CCAGGAGCAA CAAAACCCTG ACCTTGGAAG AGCTTAAGCA TCGCCTCCCC
GCCAATGCTG CTGAAATTGA ACAAGAAGCA AACCGGGTGG AAAAAAAGAG GGAAGAACTC
GAACGGGAAA TCCTGGTGGT CCGTGACAAG CTTACCAGCC TGAAAACACA GCTGGATCTT
AGCAGTGGCC AGGGTCTCTA TTCCCGGTTG GCCGGCCTGG AAGAGGAATA TGAACTGGCA
GGTGCTGAAT ACCAGCACGC GGCCCGGTAC GCCTGGGCTA TCTGGTTATT ACACCTGATC
ATGCACAACC GCCGGGACCA GATGCTGAAC AGCCTCACCG GGCCGGTGAG GCAGGAAGTC
AGCGACCTCT TTCAGCAGAT TACCGGCCGC CGGGAACGTA GCGTCGAACT GAATGCCGAT
CTTTCCCTGG CCGGCCTAAA GGTGGGAGTG GCAGACCCGC AACCCCTGGA TGTCTTCTCG
GTAGGCACCC AGGAACAGCT TATGGTCGCC ATCCGCCTGG CCCTTGGGCG TTTTTTGGGC
GCCGGCGAAC GACAGCTGGT GGTCCTGGAC GACGCCCTGG TAAATACAGA CGCCGGCCGC
CGTAGCCGCA TTCTGGATTT GTTGGCCGCC GCAGCAGAGA AACTGCAGAT AATAATCCTT
ACCTGTCACC CAGAGAACTA CAGCGGACTA AAGGGAAGGC TGTTTAACGT AGAAGAGCTC
GGGTAG
 
Protein sequence
MFLHSLTVAG LRRFRNPVAL TGFDPGINII YGPNECGKST LIWGLILAFL NRHNVSGEEI 
ARFRPWGTAL NPIIKVEFTA GGKRYRLDKG FLDEARCTLS EWTGGGYQDL ARGKAADEMV
FQMMQAQMAG RGLTKSSQWG LAHLLWMPQD RERLTAPALT GNVQDYFRES LGSTIFTPAD
DRLLAVIDRK YSEIYTQKRG VLQARSPVHQ ARQRLEEIKE QLATLRQALE QVNQVVAELE
TQQARHEALQ QELAGLRARE KGLKEQVEAI KTLQAGYRQA EAEVKTATTI WEKLKTEWCR
VNELEESINL ANAEINQEKD SLASLGNAWQ EAGQKLGAAE ERLATLENEL GRARKALQQS
YDRQQARERL VRIGELSRRL KKARQVTARI AALEGELARQ PLPGPEEVKT VTETWNRINS
LEAEARAMGL SINFRAYRDL ELSVTTAAGR EKRRVGPDQP LLLDATDRLA LEIPGAGLLE
VRSGSRDLHK LLQEIAGLRQ DLEHTLAAFG VASVDELQER YNWSVRQRTE LKALGDNLGQ
VLGPGNTLEA MEKQVLTEQT ALAEQCAALG ITGEELLALP GPDITPLKVQ VKNLQEEEEK
QKQVVTQLRQ QFQDLERRLA EEQKKLETLQ ARIRAAREER ARRLEAWGGS AAKLQAELVA
AETARSNKTL TLEELKHRLP ANAAEIEQEA NRVEKKREEL EREILVVRDK LTSLKTQLDL
SSGQGLYSRL AGLEEEYELA GAEYQHAARY AWAIWLLHLI MHNRRDQMLN SLTGPVRQEV
SDLFQQITGR RERSVELNAD LSLAGLKVGV ADPQPLDVFS VGTQEQLMVA IRLALGRFLG
AGERQLVVLD DALVNTDAGR RSRILDLLAA AAEKLQIIIL TCHPENYSGL KGRLFNVEEL
G