Gene Moth_0290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0290 
Symbol 
ID3832952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp294371 
End bp295858 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content50% 
IMG OID637828225 
Productintegrase catalytic subunit 
Protein accessionYP_429167 
Protein GI83589158 
COG category[L] Replication, recombination and repair 
COG ID[COG4584] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.196451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACAAAT GGCAGCGCAT CAAGGCACTG CACGCTCAAG GGGTCGGCAT CAGGCAAATA 
GCAAGGGATG TTGGGGTGTC CAGGAATACC GTCAGGAAGT ACCTTAAAGA AGCCGGCCCT
CCCCAGTTTA AAGCCAGGGA GTATACCAAA GAACTGGACA AGTTTCTGGA AGAAATAAAG
GTTATGCTTG CCAAAGGATA TATCGGCACA AGGATTTACA AAGAACTGAA AGATAAGGGC
TATCAAGGCT CCCTGGCCAG CGTCCACCGT TATCTTAGGG CCATCAAGGA AGATGACAGG
ACCGCTAAAT TAGCCACCAC CCGGGTGGAA ACAGGCCCAG GTAAACAGAT GCAGTACGAT
TGGAAGGTGT GGACGCTACC AGTTGACGGG AAGCTCGTGA AAATATATCT CCACGAAGTG
GTCTTATCCT ACAGCCGGAT GAAATTCTAC ACCTTCTCTT TAAGCATCAC CACCGCCGAT
GTGATCCGGG TTCTGATTGA AGCCATTGAC TTCTTCGGCG GTTATGCCCC GGAGTTGGTG
ATAGACAACG GCAAGCAAAT GGTTATCACC CACCAGAAGG ACGGTATTGT CCGGTATAAT
GACGAGTTTT TAAAATTCTG CGGGATGTAT GGCATTGAGC CTTCGGCTTG CGAAAACTAC
CGTGCCCGGA CCAAGGGAAA GGTAGAACGC CCCTTTTACT ATGTCCAGGA ACACCTGCTG
CGGGGCCTGG AGGTGGGGAA CTTAAACGAA TTCGCTGTAA AGCTTTCCGA GTTCCAGGAA
GCCTACAACA AAAGGCCCCA CAGCACCTTA GGCCGGCCGC CGGAAGAAAT GTTTGCCGAG
GAAAAAGGGT GCCTTGTTAA AATACCGGCT GTCGAACCGG CCTTATTACA CCATAAAGAA
CCCCGGAAGG TGAGCAATGA CGGCTATATA TCCCATGACG GCAATCTCTA CCCCGTACCC
ATGCGCTACT GCTTAAGGAG GGTGTGGGTC GAAAACATCT ACGGCCGGCG CTTAAAGGTA
TATGACGAGG AAGGTGCGCT TTTAGCGGAG TTTGACCTTG ACCTTAAAAA ACAAACCGCC
CGTCCCCTTC ACCCCGAACA CGAAACCATC AACCGTCAAT ACCAGGAAAA GAAACTGAAG
CTACGCTCGG CCCTGGTGGA GAAGTTCACC AGCGCCTTTG GCGAGGATGG CCAAAGGTAT
CTGGAAGGCC TGCGTGATAA AAATGGCGCC AACCTGTACT GGCACCTGGC GGAAATCTTA
AGCTATCAGG AGATATATAC CCCAGAAGAT ATCATAGCAG CCATCAAAGA ATGCCTGAAA
ATCGGTTCTT ATCACAAAAA CAGCGTAAAA AGGCTTTTAG AGCGCAAGGA AATCGCTCCG
CTTTCTTGTG CCTGTGACCC GGCAAGTGTC AATATGCCGC CAGGTAAAAT CAAACGGGAC
CTCTCCTGTT ATGCCCTAAA GGAGAGCGAG GTGGCGGCAG TATCATGA
 
Protein sequence
MYKWQRIKAL HAQGVGIRQI ARDVGVSRNT VRKYLKEAGP PQFKAREYTK ELDKFLEEIK 
VMLAKGYIGT RIYKELKDKG YQGSLASVHR YLRAIKEDDR TAKLATTRVE TGPGKQMQYD
WKVWTLPVDG KLVKIYLHEV VLSYSRMKFY TFSLSITTAD VIRVLIEAID FFGGYAPELV
IDNGKQMVIT HQKDGIVRYN DEFLKFCGMY GIEPSACENY RARTKGKVER PFYYVQEHLL
RGLEVGNLNE FAVKLSEFQE AYNKRPHSTL GRPPEEMFAE EKGCLVKIPA VEPALLHHKE
PRKVSNDGYI SHDGNLYPVP MRYCLRRVWV ENIYGRRLKV YDEEGALLAE FDLDLKKQTA
RPLHPEHETI NRQYQEKKLK LRSALVEKFT SAFGEDGQRY LEGLRDKNGA NLYWHLAEIL
SYQEIYTPED IIAAIKECLK IGSYHKNSVK RLLERKEIAP LSCACDPASV NMPPGKIKRD
LSCYALKESE VAAVS