Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1561 |
Symbol | |
ID | 3832194 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1603696 |
End bp | 1605336 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637829493 |
Product | resolvase-like protein |
Protein accession | YP_430413 |
Protein GI | 83590404 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1961] Site-specific recombinases, DNA invertase Pin homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.571827 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCCG TGATATATGC TAGGGTAAGC ACCCAGGACC AGGCTCTCGG TTTTTCCCTG GCCACCCAGA AAGAACTCTG CGAGAAGAAG GCCAGGGCTT TAGGGGCTTT AGAAGTCGAA GTTGTTGAAG ACGCTTACAC CGGGACGGAA TTGAACCGCC CTAGCCTGGA TTATGTGAGG CAGCTCGTAG CTGCCGGGAA AGTGGACCTT GTAGTTGTAT ATGACCCCGA CAGGCTTTCC CGCAACCTTA CAGATTTGCT CATCCTTTGC CGGGAATTTG ATAAGGCAGG AGTAAGGCTG GAGTTTGTTA ATTTTGACTG GCAAAAGACG CCCCAGGGGA TGCTCTTCTT GCAGATGCGG GGGGCCTTTG CCGAATTCGA GCATGCGCTC ATCAAGGAGC GTACCCAGAG GGGCAAGGCC AAAAAAGCGG CCAGCGGCAA GATCCGCTGC TACGCCAAGC CTTTTGGTTA CGACTGGGAC GCCGAGAGGG ATACCCTGGT AATTAATCCC CAGGAAGCCG CAATCGTCAA GCAGATGTTT GAGTGGCTAA CGGATCCCCT GGAACCCTTA ACTCCCTGGC AGATTACCCA GAAGCTGGGC CAGGTGTATC CCCAGGGTCC CAGGGGCAAG GGGTGGGTCC ACTCTTCGGT GGTACGGATG CTGAAAAATC CCGTATATAC AGGCCGGTTG AGGCGCAAGG ATGAGCAGCC TGATTGGAAA CCGGTCCTTG TGCCGGCCAT AATTGACCAG GAAACCTTTA TGAGAGCCCA GGAGATGCTT GCCAGGTCCA AGCGTTTCAA CCCTAAGACT ACCAGGAGAA GATTCCTGTT GCAGCGGTTG CTGGTATGCG GGGAGTGCGG GCGGAGGCTT ACAGTCGTCA CCCATAACAA TCCCCAGAGG GCGAAGTATA GCTATTATAC CTGTCCCGGG CGTTACCCTA CCAAGTTTGA CGACCAGGGC CGGGTAGGCA GATGCGGGCT CCCACCCTGG CGGACGGAAG AAATAGATAA GACAGTATGG GATACTATAG CTTCCATAAT TAAGAACCCG GAGCTTTTCT ACCAGTATAT TACCAGCGAA AAGCTTGAAA CAGCCAGCAT TCCCCGGAGG CGCCTGGAGG AAGCGCGGAA GAGGCTAGAG CAGGTGCAAC GGGTGATCGA GCGGATTGAC CGGGCCTATT TTATCCTGGA GGCGTTGCCC GAAGAAGACT ACAAGCGTTA CCGGGCGGAA CAAGAAGGAG AGCTAGTAAG AATAGAAGAA GATATCAAGA GGCTTGAAGC CGTAATTAAC GCCCAGGAAC AAGTGCAAAA AGGCGTAGAA TTCCTGCGCC AGTACGCTGA AAACCTGGCA AGGTCGGTGG ATGAACTAAA CTTTTTTCAA AAGCAGAATA TCACCCGCGA GCTAGTGCAG AGGGTAAAAA TATATGCTGA CGGCAGCCTG GAGATAGAAG GGTACTTTAA CATCCCCTTT ACCGGGCCGG GTAAAAATGT TTCGGATCAT TGCGAAGAAC CTACAACATT AACCACACAG AAAAACTTAA AAGAACCACT TTCAAGTAGT GTTTTAACTT GCACACATCG AAAAAAAGGC CATCCAAAAG TTACTGAAGG AATTATCCAT CTGACGATTT TAGCTGTATA A
|
Protein sequence | MKAVIYARVS TQDQALGFSL ATQKELCEKK ARALGALEVE VVEDAYTGTE LNRPSLDYVR QLVAAGKVDL VVVYDPDRLS RNLTDLLILC REFDKAGVRL EFVNFDWQKT PQGMLFLQMR GAFAEFEHAL IKERTQRGKA KKAASGKIRC YAKPFGYDWD AERDTLVINP QEAAIVKQMF EWLTDPLEPL TPWQITQKLG QVYPQGPRGK GWVHSSVVRM LKNPVYTGRL RRKDEQPDWK PVLVPAIIDQ ETFMRAQEML ARSKRFNPKT TRRRFLLQRL LVCGECGRRL TVVTHNNPQR AKYSYYTCPG RYPTKFDDQG RVGRCGLPPW RTEEIDKTVW DTIASIIKNP ELFYQYITSE KLETASIPRR RLEEARKRLE QVQRVIERID RAYFILEALP EEDYKRYRAE QEGELVRIEE DIKRLEAVIN AQEQVQKGVE FLRQYAENLA RSVDELNFFQ KQNITRELVQ RVKIYADGSL EIEGYFNIPF TGPGKNVSDH CEEPTTLTTQ KNLKEPLSSS VLTCTHRKKG HPKVTEGIIH LTILAV
|
| |