Gene Moth_1561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1561 
Symbol 
ID3832194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1603696 
End bp1605336 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content52% 
IMG OID637829493 
Productresolvase-like protein 
Protein accessionYP_430413 
Protein GI83590404 
COG category[L] Replication, recombination and repair 
COG ID[COG1961] Site-specific recombinases, DNA invertase Pin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.571827 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCCG TGATATATGC TAGGGTAAGC ACCCAGGACC AGGCTCTCGG TTTTTCCCTG 
GCCACCCAGA AAGAACTCTG CGAGAAGAAG GCCAGGGCTT TAGGGGCTTT AGAAGTCGAA
GTTGTTGAAG ACGCTTACAC CGGGACGGAA TTGAACCGCC CTAGCCTGGA TTATGTGAGG
CAGCTCGTAG CTGCCGGGAA AGTGGACCTT GTAGTTGTAT ATGACCCCGA CAGGCTTTCC
CGCAACCTTA CAGATTTGCT CATCCTTTGC CGGGAATTTG ATAAGGCAGG AGTAAGGCTG
GAGTTTGTTA ATTTTGACTG GCAAAAGACG CCCCAGGGGA TGCTCTTCTT GCAGATGCGG
GGGGCCTTTG CCGAATTCGA GCATGCGCTC ATCAAGGAGC GTACCCAGAG GGGCAAGGCC
AAAAAAGCGG CCAGCGGCAA GATCCGCTGC TACGCCAAGC CTTTTGGTTA CGACTGGGAC
GCCGAGAGGG ATACCCTGGT AATTAATCCC CAGGAAGCCG CAATCGTCAA GCAGATGTTT
GAGTGGCTAA CGGATCCCCT GGAACCCTTA ACTCCCTGGC AGATTACCCA GAAGCTGGGC
CAGGTGTATC CCCAGGGTCC CAGGGGCAAG GGGTGGGTCC ACTCTTCGGT GGTACGGATG
CTGAAAAATC CCGTATATAC AGGCCGGTTG AGGCGCAAGG ATGAGCAGCC TGATTGGAAA
CCGGTCCTTG TGCCGGCCAT AATTGACCAG GAAACCTTTA TGAGAGCCCA GGAGATGCTT
GCCAGGTCCA AGCGTTTCAA CCCTAAGACT ACCAGGAGAA GATTCCTGTT GCAGCGGTTG
CTGGTATGCG GGGAGTGCGG GCGGAGGCTT ACAGTCGTCA CCCATAACAA TCCCCAGAGG
GCGAAGTATA GCTATTATAC CTGTCCCGGG CGTTACCCTA CCAAGTTTGA CGACCAGGGC
CGGGTAGGCA GATGCGGGCT CCCACCCTGG CGGACGGAAG AAATAGATAA GACAGTATGG
GATACTATAG CTTCCATAAT TAAGAACCCG GAGCTTTTCT ACCAGTATAT TACCAGCGAA
AAGCTTGAAA CAGCCAGCAT TCCCCGGAGG CGCCTGGAGG AAGCGCGGAA GAGGCTAGAG
CAGGTGCAAC GGGTGATCGA GCGGATTGAC CGGGCCTATT TTATCCTGGA GGCGTTGCCC
GAAGAAGACT ACAAGCGTTA CCGGGCGGAA CAAGAAGGAG AGCTAGTAAG AATAGAAGAA
GATATCAAGA GGCTTGAAGC CGTAATTAAC GCCCAGGAAC AAGTGCAAAA AGGCGTAGAA
TTCCTGCGCC AGTACGCTGA AAACCTGGCA AGGTCGGTGG ATGAACTAAA CTTTTTTCAA
AAGCAGAATA TCACCCGCGA GCTAGTGCAG AGGGTAAAAA TATATGCTGA CGGCAGCCTG
GAGATAGAAG GGTACTTTAA CATCCCCTTT ACCGGGCCGG GTAAAAATGT TTCGGATCAT
TGCGAAGAAC CTACAACATT AACCACACAG AAAAACTTAA AAGAACCACT TTCAAGTAGT
GTTTTAACTT GCACACATCG AAAAAAAGGC CATCCAAAAG TTACTGAAGG AATTATCCAT
CTGACGATTT TAGCTGTATA A
 
Protein sequence
MKAVIYARVS TQDQALGFSL ATQKELCEKK ARALGALEVE VVEDAYTGTE LNRPSLDYVR 
QLVAAGKVDL VVVYDPDRLS RNLTDLLILC REFDKAGVRL EFVNFDWQKT PQGMLFLQMR
GAFAEFEHAL IKERTQRGKA KKAASGKIRC YAKPFGYDWD AERDTLVINP QEAAIVKQMF
EWLTDPLEPL TPWQITQKLG QVYPQGPRGK GWVHSSVVRM LKNPVYTGRL RRKDEQPDWK
PVLVPAIIDQ ETFMRAQEML ARSKRFNPKT TRRRFLLQRL LVCGECGRRL TVVTHNNPQR
AKYSYYTCPG RYPTKFDDQG RVGRCGLPPW RTEEIDKTVW DTIASIIKNP ELFYQYITSE
KLETASIPRR RLEEARKRLE QVQRVIERID RAYFILEALP EEDYKRYRAE QEGELVRIEE
DIKRLEAVIN AQEQVQKGVE FLRQYAENLA RSVDELNFFQ KQNITRELVQ RVKIYADGSL
EIEGYFNIPF TGPGKNVSDH CEEPTTLTTQ KNLKEPLSSS VLTCTHRKKG HPKVTEGIIH
LTILAV