Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1050 |
Symbol | |
ID | 4811348 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1253935 |
End bp | 1254984 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640106472 |
Product | recA protein |
Protein accession | YP_001037475 |
Protein GI | 125973565 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0468] RecA/RadA recombinase |
TIGRFAM ID | [TIGR02012] protein RecA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000195685 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGAGA AGAAAAAAGC TCTGGAGATG GTTTTGGGAC AAATCGAGAA GCAGTTTGGC AAAGGGGCGG TTATGAAGCT GGGTGAAAAC ACCCACATGA ATATAGAGAC CATTCCCACC GGTGCTTTAG GGCTTGATAT AGCTTTGGGA GTCGGAGGAG TGCCGAGAGG AAGAATAGTT GAAATATTTG GGCCTGAATC ATCGGGTAAG ACGACGGTCG CTCTGCATAT TATCGCCGAG GCGCAAAAAG CAGGCGGCGA GGCGGCTTTC ATAGATGCGG AGCATGCTTT GGACCCTGTG TATGCCAAGA ACCTGGGAGT TGATATAGAA AATCTTATTG TTTCCCAGCC GGATACCGGA GAACAGGCTT TGGAAATTGC AGAGGCACTT GTAAGAAGCG GAGCGATTGA CGTTATTGTA ATTGACTCGG TTGCGGCTTT GGTTCCCAAA GCGGAAATAG ACGGTGAAAT GGGAGATTCC CATGTAGGTC TTCAGGCAAG GCTTATGTCC CAGGCATTAA GGAAACTTGC CGGTGTTATC AACAAATCCA GGACGACGGC AATATTTATT AACCAACTCA GAGAAAAGGT AGGAGTTATG TTTGGCAATC CTGAAACCAC GCCGGGCGGA CGGGCTTTGA AATTCTATGC TTCGGTAAGA CTCGATGTAA GAAGGCTTGA GTCCATCAAA CAGGGAAATG AGGTAATAGG AAGCAGGACA AAGGTCAAAG TTGTTAAGAA CAAAGTTGCT CCTCCTTTTA AAGAAGCAGA GTTTGATATT ATTTACGGCC AGGGTATTTC AAAGGAAGGC AATATTCTCG ATGTTGCGGT AAATCTCGAT ATTGTAAACA AAAGCGGCGC GTGGTTCTCC TACAACGGTC AGAAGATTGG CCAGGGAAGG GAGAATGCAA AGCAATTCCT GAAGGAAAAT CCTGAAATAG CGAAGGAAAT TGAGACAAAG ATTAGAGAAA ACTATAACCA GACATTTCTA ATGAGTATTA ATGCCCAGGT TGATACTGAT GATGAAGTGG ATTTGGAAGA TGAAGAATAA
|
Protein sequence | MIEKKKALEM VLGQIEKQFG KGAVMKLGEN THMNIETIPT GALGLDIALG VGGVPRGRIV EIFGPESSGK TTVALHIIAE AQKAGGEAAF IDAEHALDPV YAKNLGVDIE NLIVSQPDTG EQALEIAEAL VRSGAIDVIV IDSVAALVPK AEIDGEMGDS HVGLQARLMS QALRKLAGVI NKSRTTAIFI NQLREKVGVM FGNPETTPGG RALKFYASVR LDVRRLESIK QGNEVIGSRT KVKVVKNKVA PPFKEAEFDI IYGQGISKEG NILDVAVNLD IVNKSGAWFS YNGQKIGQGR ENAKQFLKEN PEIAKEIETK IRENYNQTFL MSINAQVDTD DEVDLEDEE
|
| |