Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0005 |
Symbol | |
ID | 3831877 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 3229 |
End bp | 4341 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637827932 |
Product | DNA replication and repair protein RecF |
Protein accession | YP_428888 |
Protein GI | 83588879 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1195] Recombinational DNA repair ATPase (RecF pathway) |
TIGRFAM ID | [TIGR00611] recF protein |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000000369508 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000145417 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCCTGCTT TAAGCTTACA ACAGCTCCAG CTGATTAATT TCCGCAGTTA TAAATGCCTT ACCTGGGATT GCCGGCCCGG GCTAAATATT ATTTTTGGGC CCAATGCTGC CGGTAAAACT AATCTCCTGG AAGCCATTGG TTACCTGGCC CTCGCCCGAT CCTTCAGGCA GCAACAGGAC CAGCAATTGT TGACCTGGGG AGCAAGCTCC TTCCAGGTGC GAGGATTATG CCACAGTAAT GGTGAAAAAA TTGAGCTGGT AATTAACTAC CAGCAGCACA ATAAAAGGTT GACAATTAAC GGCAACCGCA ACCGCTTAAT CGAACTCCTG GGTATATTCC CCGTCATTTA CTTCGGACCT GATGACCTGC ACCTCCTTAA GGGCGGCCCG GCCTACCGGC GCCATTTCCT GGATCGGGAG ATCAGTATGG GCGATCGCCT TTACTGCCGC AATCTGCAAG ATTACCGGCG CATTCTCTTC CAGCGTAATC TTTTGTTGCG GGCCATTAAG GCCGGCCGGG GGAAGGAGGG GGAGCTGGAA CCCTGGGATA TCCAGTTATT AGCAACAGGT AAGGCTATCT GCGAAAAGCG ATCTTGTTTT TTACAATCTT TGGCACCGAG GGTTGCCGCT ACCTACCGGG ATATGGCCGG GGGGGAAGAA CTGGCCCTCA TTTATCGGCC CGGGGTTGCC AGCCAGGAAG AGTGGGCGGA AAGGCTAAAG GTCGGCCGCG AAAGGGAGGT CCAGGCCGGT ATGACCCTCT GGGGTCCCCA CCGGGATGAC TTTACCTTTA CCCTGGACGG TCACGAGGCC CGTTATTTTG CCTCCCAGGG CCAGCAAAGA GCTATCGTTC TGGCCTTGAA ACTGGCTGAG GCCCGGTATT ACCGGGAACT TTTGCATGTA ATGCCAGTTT TGCTCCTGGA TGACGTTTTT TCCGAACTTG ATGAAGCGCA CCAGGGGGCG TTACTGGAGT TACTGGCAGG GGCCGACCAG GCTTTTTTAA CGACCACGGA GGTCGGCTTA TTACCAGCCA GGCTTATACA ACGCTCCCAT CTCTGGGAAT TAGCCCGGGG AAGGGAACCC CGGCTCACCT CCGGGCCTGT TGAGGCACAG TAA
|
Protein sequence | MPALSLQQLQ LINFRSYKCL TWDCRPGLNI IFGPNAAGKT NLLEAIGYLA LARSFRQQQD QQLLTWGASS FQVRGLCHSN GEKIELVINY QQHNKRLTIN GNRNRLIELL GIFPVIYFGP DDLHLLKGGP AYRRHFLDRE ISMGDRLYCR NLQDYRRILF QRNLLLRAIK AGRGKEGELE PWDIQLLATG KAICEKRSCF LQSLAPRVAA TYRDMAGGEE LALIYRPGVA SQEEWAERLK VGREREVQAG MTLWGPHRDD FTFTLDGHEA RYFASQGQQR AIVLALKLAE ARYYRELLHV MPVLLLDDVF SELDEAHQGA LLELLAGADQ AFLTTTEVGL LPARLIQRSH LWELARGREP RLTSGPVEAQ
|
| |