Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0649 |
Symbol | |
ID | 3832045 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 679902 |
End bp | 680822 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637828590 |
Product | integrase catalytic subunit |
Protein accession | YP_429520 |
Protein GI | 83589511 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4584] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00201548 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTACAAAT GGCAGCGCAT CAAGGCACTG CACGCTCAAG GGGTCGGCAT CAGGCAAATA GCAAGGGATG TTGGGGTGTC CAGGAATACC GTCAGGAAGT ACCTTAAAGA AGCCGGCCCT CCCCAGTTTA AAGCCAGGGA GTATACCAAA GAACTGGACA AGTTTCTGGA AGAAATAAAG GTTATGCTTG CCAAAGGATA TATCGGCACA AGGATTTACA AAGAACTGAA AGATAAGGGC TATCAAGGCT CCCTGGCCAG CGTCCACCGT TATCTTAGGG CCATCAAGGA AGATGACAGG ACCGCTAAAT TAGCCACCAC CCGGGTGGAA ACAGGCCCAG GTAAACAGAT GCAGTACGAT TGGAAGGTGT GGACGCTACC AGTTGACGGG AAGCTCGTGA AAATATATCT CCACGAAGTG GTCTTATCCT ACAGCCGGAT GAAATTCTAC ACCTTCTCTT TAAGCATCAC CACCGCCGAT GTGATCCGGG TTCTGATTGA AGCCATTGAC TTCTTCGGCG GTTATGCCCC GGAGTTGGTG ATAGACAACG GCAAGCAAAT GGTTATCACC CACCAGAAGG ACGGTATTGT CCGGTATAAT GACGAGTTTT TAAAATTCTG CGGGATGTAT GGCATTGAGC CCTGTCCTTG CGCCAACTAC CGTGCCCGGA CCAAGGGAAA GGTAGAACGC CCCTTTTACT ATGTCCAGGA ACACCTGCTG CGGGGCCTGG AGGTGGGGAA CTTAAACGAA TTCGCTGTAA AGCTTTCCGA GTTCCAGGAA GCCTACAACA AAAGGCCCCA CAGCACCTTA GGCCGGCCGC CGGCGGAAAT GTTTGCCGAG GAAAAAGAAT GCCTTGTTAA AATACCGGCT GTCGAACCGG CCTTATTACA CCATAAAGAA CCTCTTAAGC CCCGAAATTA G
|
Protein sequence | MYKWQRIKAL HAQGVGIRQI ARDVGVSRNT VRKYLKEAGP PQFKAREYTK ELDKFLEEIK VMLAKGYIGT RIYKELKDKG YQGSLASVHR YLRAIKEDDR TAKLATTRVE TGPGKQMQYD WKVWTLPVDG KLVKIYLHEV VLSYSRMKFY TFSLSITTAD VIRVLIEAID FFGGYAPELV IDNGKQMVIT HQKDGIVRYN DEFLKFCGMY GIEPCPCANY RARTKGKVER PFYYVQEHLL RGLEVGNLNE FAVKLSEFQE AYNKRPHSTL GRPPAEMFAE EKECLVKIPA VEPALLHHKE PLKPRN
|
| |