Gene Moth_1028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1028 
Symbol 
ID3832648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1058475 
End bp1059416 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content60% 
IMG OID637828956 
Producttyrosine recombinase XerD subunit 
Protein accessionYP_429885 
Protein GI83589876 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID[TIGR02224] tyrosine recombinase XerC 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.00856798 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000634873 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCCCGGAG TTACCTTTGG GGAAGCCCTG GAGGGTTTTC TTTTGTATCT AAAAGGCGAA 
AGGCAGGCTT CGCCCTGTAC CGTCGATGCC TACCGGGCTG ATATCGAACA ATTCGCCGCC
TTCGTAGCAG GACGCGCGGG CCAGGAAGCA GGCCCTGCAG CAGTCGATAT CTGGATGGTG
CGGCGCTACT TGGGCTGGCT GAACCAGCTG GGCCAGCAGC GGTCAAGCAT GAACCGTAAA
CTGGCCGCGT TGCGTGCTTT TTATCGCTTC CTTCTACGGG CGGGGCAGGT ACAGAGCAGC
CCCGTCGCCC TGTTATCCGG CCCCCGCCGG GAGAAAAGAT TGCCCGGCTG TCTGAGCCAT
GCTGAAATGG AAAAACTCTT AAGTATCCCG GCGACTACTC CCCTGGGTTT GAGGGACCGA
GCTATTCTGG AGACGCTCTA CGCCTCCGGT ATCCGGGTGG CTGAACTGGT AGGCATGGAC
CAGGATGACC TGGATCTGGT AGCAGGTTAT GCCAGGGTCC TGGGTAAAGG CCGGCGGGAA
AGGGTGGTAC CCCTTGGTCG CTATGCTGTT AAGGCCCTGG AGAATTATTT ACATCGGGCC
CGTCCGGAAC TGGCCGCCCG GCGTATCCCT CCTGAACCCA GGGCCCTTTT CTTGAATCAC
CTGGGGGGGC GGTTAACAAC CCGGGGAGTC CGGGAACGCC TGAGCCACTA CGTAGAAAAG
GCCGCCCTGC GGAGGGGGGT TTCCCCCCAT ACTATCCGCC ACACCTTTGC TACCCACCTG
CTGGAGGGAG GGGCGGATCT GAGGGTGGTC CAGGAACTCC TGGGCCATAT CCGCCTGGCG
ACGACCCAGA TTTACACCCA CATCAGCCAG TCCCAGCTGC GTGAGGTTTA CCGCCAGTTC
CACCCGCGGG CCAGCCGTGA TAATATAGAT AATCGAAGGT GA
 
Protein sequence
MPGVTFGEAL EGFLLYLKGE RQASPCTVDA YRADIEQFAA FVAGRAGQEA GPAAVDIWMV 
RRYLGWLNQL GQQRSSMNRK LAALRAFYRF LLRAGQVQSS PVALLSGPRR EKRLPGCLSH
AEMEKLLSIP ATTPLGLRDR AILETLYASG IRVAELVGMD QDDLDLVAGY ARVLGKGRRE
RVVPLGRYAV KALENYLHRA RPELAARRIP PEPRALFLNH LGGRLTTRGV RERLSHYVEK
AALRRGVSPH TIRHTFATHL LEGGADLRVV QELLGHIRLA TTQIYTHISQ SQLREVYRQF
HPRASRDNID NRR