Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0004 |
Symbol | |
ID | 7407239 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 2761 |
End bp | 3822 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 643714418 |
Product | DNA replication and repair protein RecF |
Protein accession | YP_002571943 |
Protein GI | 222528061 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1195] Recombinational DNA repair ATPase (RecF pathway) |
TIGRFAM ID | [TIGR00611] recF protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000383314 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATAA AAAGCATTTA CGTTGAAAAT TTCAGGGGTT ACAAACAAAG GTTTTTTGAG TTCAAAGATA AAATGAATTT AATTGTCGGT AATAATGCCT CAGGAAAGAC TTCTCTTCTT GAGGCTCTAT ATTTTTGTAT GTGTGGAAAA TCTTTTAAAA GTCGAGATAT AGATGCAATA AACTTTGATT CTTTCTATTT CAAGCTTGAG ATGTTGGCCG AGGTTGGAGA CACTGAATAT AATGTTTTCT GTTATGTAGA TAAAGCCTTA GATAAGAGAA TAATGATAAA TGATAAGAAA ATAAAGAAAT TGTCTGAGCT GATAAGTACA TTTAAATTTG TCTTTTTTGA GCCAGATGCA ACAGAACTTA TAAAACATCA GCCAAAACTG AGGCGCAGAT TTTTAGATAT GGAAGTTACA AAGCTTTACC CTTACATGAC AAAAGTTTAT TCAGAGTACC ACAGAGCACT TCTTTCACGA AACGCGTTTT TGAAAAGTTA TGATAAAAAG GATATAATAG ATGTGTACGA TATGCAAATA AGTCAGCTTG GATTTTTAAT TTTTCAAAAA CGACAGGAGG TTATAAATAA ACTATCCATT GAAGCGCAAA AGATATTTAG TTTAGTGTTT GAAAACAAAT CAATGCTTGA ACTTAAATAT ATGCCATCAA TTGCTGCTTC AACCGAGAAA GAATATTACA AAGAGATAAA AAAGAATATT GAGAAAGATT TGAGTCTTGG ATATACAACA AAAGGTGTTC ACAGAGATGA CTTTGAGATT TTGATAGATA AAAAACCGGC AATAAATTTT GCATCTGAAG GACAGATAAA ACTTGCGGCA GTCTCAGTTG TGCTTGCAAC TTCTCTTCTT TATTCGGAGC CTGTGCTAAT TTTAGACGAT GTGTTTTCTG AGCTTGATAG TTTTAAAAGA AAAAATCTTG TGAAATTTAT AAGCCAATAC CAGTCGTTTG TGACATCTGC AGAAGATTTA AGTGTTCTTG AAAGAGAAGA AATACTGGAG TTTGGCAGTG CAAACTTGAT TTTTCTTGAA AGAAGTATGT AA
|
Protein sequence | MKIKSIYVEN FRGYKQRFFE FKDKMNLIVG NNASGKTSLL EALYFCMCGK SFKSRDIDAI NFDSFYFKLE MLAEVGDTEY NVFCYVDKAL DKRIMINDKK IKKLSELIST FKFVFFEPDA TELIKHQPKL RRRFLDMEVT KLYPYMTKVY SEYHRALLSR NAFLKSYDKK DIIDVYDMQI SQLGFLIFQK RQEVINKLSI EAQKIFSLVF ENKSMLELKY MPSIAASTEK EYYKEIKKNI EKDLSLGYTT KGVHRDDFEI LIDKKPAINF ASEGQIKLAA VSVVLATSLL YSEPVLILDD VFSELDSFKR KNLVKFISQY QSFVTSAEDL SVLEREEILE FGSANLIFLE RSM
|
| |