Gene Moth_1079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1079 
Symbol 
ID3833192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1109977 
End bp1111023 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content57% 
IMG OID637829007 
ProductrecA protein 
Protein accessionYP_429936 
Protein GI83589927 
COG category[L] Replication, recombination and repair 
COG ID[COG0468] RecA/RadA recombinase 
TIGRFAM ID[TIGR02012] protein RecA 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACGCG TGGTCATAAA TGAAAAACAA CGGGCTCTGG AAATGGCCCT GAGCCAGATT 
GAGCGCCATT TCGGTAAAGG TTCCATCATG CGGCTGGGTG AAACCGGCGC CCGCCTCAAT
GTTGAGGCCA TCTCCACCGG AGCCCTGCCC CTGGATCTGG CCCTGGGAGT AGGGGGGTTG
CCCCGGGGGC GGGTAATAGA GATCTTTGGC CCGGAATCCT CCGGTAAAAC TACTGTAGCC
CTACATGTCA TTGCCGAAGC CCAGCGGGCC GGGGGTACAG CAGCCTTTAT CGATGCCGAA
CATGCCCTGG ACCCAGTCTA CGCCCACAAC CTGGGAGTAG ATACGGATAA CCTGCTGGTG
TCCCAGCCCG ATACCGGGGA ACAGGCCCTG GAGATAGCTG AAGCCTTGGT ACGCAGCGGG
GCTATTGACG TTATCGTCAT CGACTCGGTG GCCGCCCTGG TACCCCGGGC CGAACTGGAG
GGAGAGATGG GCGATGCCCA TGTAGGTCTC CAGGCGCGGT TAATGTCCCA GGCTTTGCGT
AAATTGGCAG GGATTATCTC CAAATCGCGG ACGGTGGCCA TTTTCATCAA CCAGCTACGG
GAAAAGGTGG GAGTCCTCTT CGGCAACCCT GAGACTACCC CCGGTGGCCG TGCCCTGAAG
TTTTATGCTT CCGTACGTCT GGATGTCCGT AAAGTAGAAC AGCTAAAAGC CGGGACAGAG
ATAGTCGGCA ATCGAACCAG GGTCAAGGTT GTTAAGAATA AGGTAGCACC ACCTTTTCGC
CAGGCCGAAT TTGACATTAT CTACGGCCGG GGAATCGACC GCGAGGGCTG CCTCCTGGAT
ATGGGGACTG AACTGGATAT CATTAAAAAG AGCGGTGCCT GGTATTCCCT GGGGGAAGAC
CGCCTGGGAC AGGGACGCGA AGCCGCCAAG GATTTCCTCC GAGAACACCC CGATCTGGCT
GCCGCCCTCG AGACCAAGAT CCGGGAAAAA GCAGGCTTAA TTAACTTTAC GGCCGGGAAA
GAAGATGCCA CTTCGGGGGA AGACTGA
 
Protein sequence
MQRVVINEKQ RALEMALSQI ERHFGKGSIM RLGETGARLN VEAISTGALP LDLALGVGGL 
PRGRVIEIFG PESSGKTTVA LHVIAEAQRA GGTAAFIDAE HALDPVYAHN LGVDTDNLLV
SQPDTGEQAL EIAEALVRSG AIDVIVIDSV AALVPRAELE GEMGDAHVGL QARLMSQALR
KLAGIISKSR TVAIFINQLR EKVGVLFGNP ETTPGGRALK FYASVRLDVR KVEQLKAGTE
IVGNRTRVKV VKNKVAPPFR QAEFDIIYGR GIDREGCLLD MGTELDIIKK SGAWYSLGED
RLGQGREAAK DFLREHPDLA AALETKIREK AGLINFTAGK EDATSGED