Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0662 |
Symbol | |
ID | 3832149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 691427 |
End bp | 692914 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637828601 |
Product | integrase catalytic subunit |
Protein accession | YP_429531 |
Protein GI | 83589522 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4584] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0102982 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000317754 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTACAAAT GGCAGCGCAT CAAGGCACTG CACGCTCAAG GGGTCGGCAT CAGGCAAATA GCAAGGGATG TTGGGGTGTC CAGGAATACC GTCAGGAAGT ACCTTAAAGA AGCCGGCCCT CCCCAGTTTA AAGCCAGGGA GTATACCAAA GAACTGGACA AGTTTCTGGA AGAAATAAAG GTTATGCTTG CCAAAGGATA TATCGGCACA AGGATTTACA AAGAACTGAA AGATAAGGGC TATCAAGGCT CCCTGGCCAG CGTCCACCGT TATCTTAGGG CCATCAAGGA AGATGACAGG ACCGCTAAAT TAGCCACCAC CCGGGTGGAA ACAGGCCCAG GTAAACAGAT GCAGTACGAT TGGAAGGTGT GGACGCTACC AGTTGACGGG AAGCTCGTGA AAATATATCT CCACGAAGTG GTCTTATCCT ACAGCCGGAT GAAATTCTAC ACCTTCTCTT TAAGCATCAC CACCGCCGAT GTGATCCGGG TTCTGATTGA AGCCATTGAC TTCTTCGGCG GTTATGCCCC GGAGTTGGTG ATAGACAACG GCAAGCAAAT GGTTATCACC CACCAGAAGG ACGGTATTGT CCGGTATAAT GACGAGTTTT TAAAATTCTG CGGGCTGTAT GGCATTGAGC CCTGTCCTTG CGCCAACTAC CGTGCCCGGA CCAAGGGAAA GGTAGAACGC CCCTTTTACT ATGTCCAGGA ACACCTGCTG CGGGGCCTGG AGGTGGGGAA CTTAAACGAA TTCGCTGTAA AGCTTTCCGA GTTCCAGGAA GCCTACAACA AAAGGCCCCA CAGCACCTTA GGCCGGCCGC CGGAAGAAAT GTTTGCCGAG GAAAAAGGGT GCCTTGTTAA AATACCGGCT GTCGAACCGG CCTTATTACA CCATAAAGAA CCCCGGAAGG TGAGCAATGA CGGCTATATA TCCCATGACG GCAATCTCTA CCCCGTACCC ATGCGCTACT GCTTAAGGAG GGTGTGGGTC GAAAACATCT ACGGCCGGCG CTTAAAGGTA TATGACGAGG AAGGTGCGCT TTTAGCGGAG TTTGACCTTG ACCTTAAAAA ACAAACCGCC CGTCCCCTTC ACCCCGAACA CGAAACCATC AACCGTCAAT ACCAGGAAAA GAAACTGAAG CTACGCTCGG CCCTGGTGGA GAAGTTCACC AGCGCCTTTG GCGAGGATGG CCAAAGGTAT CTGGAAGGCC TGCGTGATAA AAATGGCGCC AACCTGTACT GGCACCTGGC GGAAATCTTA AGCTATCAGG AGATATATAC CCCAGAAGAT ATCATAGCAG CCATCAAAGA ATGCCTGAAA ATCGGTTCTT ATCACAAAAA CAGCGTAAAA AGGCTTTTAG AGCGCAAGGA AATCGCTCCG CTTTCTTGTG CCTGTGACCC GGCAAGTGTC AATATGCCGC CAGGTAAAAT CAAACGGGAC CTCTCCTGTT ATGCCCTAAA GGAGAGCGAG GTGGCGGCAG TATCATGA
|
Protein sequence | MYKWQRIKAL HAQGVGIRQI ARDVGVSRNT VRKYLKEAGP PQFKAREYTK ELDKFLEEIK VMLAKGYIGT RIYKELKDKG YQGSLASVHR YLRAIKEDDR TAKLATTRVE TGPGKQMQYD WKVWTLPVDG KLVKIYLHEV VLSYSRMKFY TFSLSITTAD VIRVLIEAID FFGGYAPELV IDNGKQMVIT HQKDGIVRYN DEFLKFCGLY GIEPCPCANY RARTKGKVER PFYYVQEHLL RGLEVGNLNE FAVKLSEFQE AYNKRPHSTL GRPPEEMFAE EKGCLVKIPA VEPALLHHKE PRKVSNDGYI SHDGNLYPVP MRYCLRRVWV ENIYGRRLKV YDEEGALLAE FDLDLKKQTA RPLHPEHETI NRQYQEKKLK LRSALVEKFT SAFGEDGQRY LEGLRDKNGA NLYWHLAEIL SYQEIYTPED IIAAIKECLK IGSYHKNSVK RLLERKEIAP LSCACDPASV NMPPGKIKRD LSCYALKESE VAAVS
|
| |