Gene Moth_0662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0662 
Symbol 
ID3832149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp691427 
End bp692914 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content51% 
IMG OID637828601 
Productintegrase catalytic subunit 
Protein accessionYP_429531 
Protein GI83589522 
COG category[L] Replication, recombination and repair 
COG ID[COG4584] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0102982 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000317754 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTACAAAT GGCAGCGCAT CAAGGCACTG CACGCTCAAG GGGTCGGCAT CAGGCAAATA 
GCAAGGGATG TTGGGGTGTC CAGGAATACC GTCAGGAAGT ACCTTAAAGA AGCCGGCCCT
CCCCAGTTTA AAGCCAGGGA GTATACCAAA GAACTGGACA AGTTTCTGGA AGAAATAAAG
GTTATGCTTG CCAAAGGATA TATCGGCACA AGGATTTACA AAGAACTGAA AGATAAGGGC
TATCAAGGCT CCCTGGCCAG CGTCCACCGT TATCTTAGGG CCATCAAGGA AGATGACAGG
ACCGCTAAAT TAGCCACCAC CCGGGTGGAA ACAGGCCCAG GTAAACAGAT GCAGTACGAT
TGGAAGGTGT GGACGCTACC AGTTGACGGG AAGCTCGTGA AAATATATCT CCACGAAGTG
GTCTTATCCT ACAGCCGGAT GAAATTCTAC ACCTTCTCTT TAAGCATCAC CACCGCCGAT
GTGATCCGGG TTCTGATTGA AGCCATTGAC TTCTTCGGCG GTTATGCCCC GGAGTTGGTG
ATAGACAACG GCAAGCAAAT GGTTATCACC CACCAGAAGG ACGGTATTGT CCGGTATAAT
GACGAGTTTT TAAAATTCTG CGGGCTGTAT GGCATTGAGC CCTGTCCTTG CGCCAACTAC
CGTGCCCGGA CCAAGGGAAA GGTAGAACGC CCCTTTTACT ATGTCCAGGA ACACCTGCTG
CGGGGCCTGG AGGTGGGGAA CTTAAACGAA TTCGCTGTAA AGCTTTCCGA GTTCCAGGAA
GCCTACAACA AAAGGCCCCA CAGCACCTTA GGCCGGCCGC CGGAAGAAAT GTTTGCCGAG
GAAAAAGGGT GCCTTGTTAA AATACCGGCT GTCGAACCGG CCTTATTACA CCATAAAGAA
CCCCGGAAGG TGAGCAATGA CGGCTATATA TCCCATGACG GCAATCTCTA CCCCGTACCC
ATGCGCTACT GCTTAAGGAG GGTGTGGGTC GAAAACATCT ACGGCCGGCG CTTAAAGGTA
TATGACGAGG AAGGTGCGCT TTTAGCGGAG TTTGACCTTG ACCTTAAAAA ACAAACCGCC
CGTCCCCTTC ACCCCGAACA CGAAACCATC AACCGTCAAT ACCAGGAAAA GAAACTGAAG
CTACGCTCGG CCCTGGTGGA GAAGTTCACC AGCGCCTTTG GCGAGGATGG CCAAAGGTAT
CTGGAAGGCC TGCGTGATAA AAATGGCGCC AACCTGTACT GGCACCTGGC GGAAATCTTA
AGCTATCAGG AGATATATAC CCCAGAAGAT ATCATAGCAG CCATCAAAGA ATGCCTGAAA
ATCGGTTCTT ATCACAAAAA CAGCGTAAAA AGGCTTTTAG AGCGCAAGGA AATCGCTCCG
CTTTCTTGTG CCTGTGACCC GGCAAGTGTC AATATGCCGC CAGGTAAAAT CAAACGGGAC
CTCTCCTGTT ATGCCCTAAA GGAGAGCGAG GTGGCGGCAG TATCATGA
 
Protein sequence
MYKWQRIKAL HAQGVGIRQI ARDVGVSRNT VRKYLKEAGP PQFKAREYTK ELDKFLEEIK 
VMLAKGYIGT RIYKELKDKG YQGSLASVHR YLRAIKEDDR TAKLATTRVE TGPGKQMQYD
WKVWTLPVDG KLVKIYLHEV VLSYSRMKFY TFSLSITTAD VIRVLIEAID FFGGYAPELV
IDNGKQMVIT HQKDGIVRYN DEFLKFCGLY GIEPCPCANY RARTKGKVER PFYYVQEHLL
RGLEVGNLNE FAVKLSEFQE AYNKRPHSTL GRPPEEMFAE EKGCLVKIPA VEPALLHHKE
PRKVSNDGYI SHDGNLYPVP MRYCLRRVWV ENIYGRRLKV YDEEGALLAE FDLDLKKQTA
RPLHPEHETI NRQYQEKKLK LRSALVEKFT SAFGEDGQRY LEGLRDKNGA NLYWHLAEIL
SYQEIYTPED IIAAIKECLK IGSYHKNSVK RLLERKEIAP LSCACDPASV NMPPGKIKRD
LSCYALKESE VAAVS