Gene Moth_1796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1796 
Symbol 
ID3832392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1850618 
End bp1851790 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content53% 
IMG OID637829724 
ProductPhage integrase 
Protein accessionYP_430640 
Protein GI83590631 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAAA GGAGAGGCCA CGGGGAGGGT ACGGTTTACC AGCGGCCTGA CGGGCGATGG 
ACGGCGCAAA TAACTGTAGG CATTGACCAT GAGGGGAAGC AGAGAAGGTT AACCCTTTAC
GGTAAGAGCA AGAAGGAGGT CCTGGAAAAA CTGACCCAGG CCCTTTACCA GCAGCAAAAC
GGCGGCTTTA TTGAGCCGTC GAAGATTACC GTCGAGCAAT GGCTAAATCG CTGGCTTACT
GATTATGCAA AGCCTCATTT GCGGCAGAGC ACATGGGAAA GCTACGAAAC GGTTTTAAGG
CTCCACGTTA TTCCTACTCT GGGCAGTATT CCGCTGAAGA AACTCCAACC GGCTGATATA
CAACGGCTTT ATGCGTCTAA ACTCGAAAGC GGCTTATCGC CGACCAGGGT ACGCTATATC
CACGTTGTTC TCCATGAGGC CATGAGCCAG GCAAGGGAGA GTGGGCTGCT TTTGCAGAAC
CCGACCGAGG CCGCTAAACC GCCACGGCAC CCGAAGAAGA AGGTCCAACC GTTAAACCCG
GAGCAGGTAA AACGATTCCT GGAAACAGCT AAACAGGACC CCCTTTACCC GGCGTTTTTG
CTGGCCCTGG GGACGGGCCT GCGGCGGGGT GAAATATTAG GTTTGCGCTG GCAAGACCTC
GATTTGCAGA AAGGCATCCT TCAGGTACGC CAATCCTTGA TACGCACAAG GGAAGGGCTT
AAATTCGAGG AGCCGAAAAC TGAAAAGAGC CGGAGGCAAA TACCCCTGCC GCCGAGTGTA
GTCGCTGCAT TGAAACGACA TAAAGCCTGG GTTAACCAGA ATAAACTCAT TTTAGGTCCT
GACTACGAAG ACCACGACTT GGTTTTCCCA GTGGAAAACG GCAGGCCTCG TGATCCTAAA
GGGTTTGCCG AGTACTTCAA CCGCTTGCTG GATAAGGCTG GCCTGCCGCA TATCCGGCTG
CATGACCTGC GGCACACCCA CGCTACTTTA CTCCTTCTTG AGGGTGTTCA TCCCAAAGTC
GTTCAAGAAA GATTGGGCCA CTCCACGGTT AGCATAACCC TGGACATTTA TTCCCATATC
CTGCCGGGGC TACAGGAGAA GGCGGCTGAA AGGATTGACG GTCTGCTGCA ACCAAAAGAA
AACCCTTCCC CAAAGGAAGG GACCATAAAG TAA
 
Protein sequence
MPKRRGHGEG TVYQRPDGRW TAQITVGIDH EGKQRRLTLY GKSKKEVLEK LTQALYQQQN 
GGFIEPSKIT VEQWLNRWLT DYAKPHLRQS TWESYETVLR LHVIPTLGSI PLKKLQPADI
QRLYASKLES GLSPTRVRYI HVVLHEAMSQ ARESGLLLQN PTEAAKPPRH PKKKVQPLNP
EQVKRFLETA KQDPLYPAFL LALGTGLRRG EILGLRWQDL DLQKGILQVR QSLIRTREGL
KFEEPKTEKS RRQIPLPPSV VAALKRHKAW VNQNKLILGP DYEDHDLVFP VENGRPRDPK
GFAEYFNRLL DKAGLPHIRL HDLRHTHATL LLLEGVHPKV VQERLGHSTV SITLDIYSHI
LPGLQEKAAE RIDGLLQPKE NPSPKEGTIK