Gene Cthe_1050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1050 
Symbol 
ID4811348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1253935 
End bp1254984 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content45% 
IMG OID640106472 
ProductrecA protein 
Protein accessionYP_001037475 
Protein GI125973565 
COG category[L] Replication, recombination and repair 
COG ID[COG0468] RecA/RadA recombinase 
TIGRFAM ID[TIGR02012] protein RecA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000195685 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGAGA AGAAAAAAGC TCTGGAGATG GTTTTGGGAC AAATCGAGAA GCAGTTTGGC 
AAAGGGGCGG TTATGAAGCT GGGTGAAAAC ACCCACATGA ATATAGAGAC CATTCCCACC
GGTGCTTTAG GGCTTGATAT AGCTTTGGGA GTCGGAGGAG TGCCGAGAGG AAGAATAGTT
GAAATATTTG GGCCTGAATC ATCGGGTAAG ACGACGGTCG CTCTGCATAT TATCGCCGAG
GCGCAAAAAG CAGGCGGCGA GGCGGCTTTC ATAGATGCGG AGCATGCTTT GGACCCTGTG
TATGCCAAGA ACCTGGGAGT TGATATAGAA AATCTTATTG TTTCCCAGCC GGATACCGGA
GAACAGGCTT TGGAAATTGC AGAGGCACTT GTAAGAAGCG GAGCGATTGA CGTTATTGTA
ATTGACTCGG TTGCGGCTTT GGTTCCCAAA GCGGAAATAG ACGGTGAAAT GGGAGATTCC
CATGTAGGTC TTCAGGCAAG GCTTATGTCC CAGGCATTAA GGAAACTTGC CGGTGTTATC
AACAAATCCA GGACGACGGC AATATTTATT AACCAACTCA GAGAAAAGGT AGGAGTTATG
TTTGGCAATC CTGAAACCAC GCCGGGCGGA CGGGCTTTGA AATTCTATGC TTCGGTAAGA
CTCGATGTAA GAAGGCTTGA GTCCATCAAA CAGGGAAATG AGGTAATAGG AAGCAGGACA
AAGGTCAAAG TTGTTAAGAA CAAAGTTGCT CCTCCTTTTA AAGAAGCAGA GTTTGATATT
ATTTACGGCC AGGGTATTTC AAAGGAAGGC AATATTCTCG ATGTTGCGGT AAATCTCGAT
ATTGTAAACA AAAGCGGCGC GTGGTTCTCC TACAACGGTC AGAAGATTGG CCAGGGAAGG
GAGAATGCAA AGCAATTCCT GAAGGAAAAT CCTGAAATAG CGAAGGAAAT TGAGACAAAG
ATTAGAGAAA ACTATAACCA GACATTTCTA ATGAGTATTA ATGCCCAGGT TGATACTGAT
GATGAAGTGG ATTTGGAAGA TGAAGAATAA
 
Protein sequence
MIEKKKALEM VLGQIEKQFG KGAVMKLGEN THMNIETIPT GALGLDIALG VGGVPRGRIV 
EIFGPESSGK TTVALHIIAE AQKAGGEAAF IDAEHALDPV YAKNLGVDIE NLIVSQPDTG
EQALEIAEAL VRSGAIDVIV IDSVAALVPK AEIDGEMGDS HVGLQARLMS QALRKLAGVI
NKSRTTAIFI NQLREKVGVM FGNPETTPGG RALKFYASVR LDVRRLESIK QGNEVIGSRT
KVKVVKNKVA PPFKEAEFDI IYGQGISKEG NILDVAVNLD IVNKSGAWFS YNGQKIGQGR
ENAKQFLKEN PEIAKEIETK IRENYNQTFL MSINAQVDTD DEVDLEDEE