Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0004 |
Symbol | recF |
ID | 7978438 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 3017 |
End bp | 4141 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 644796953 |
Product | recombination protein F |
Protein accession | YP_002948212 |
Protein GI | 239825588 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1195] Recombinational DNA repair ATPase (RecF pathway) |
TIGRFAM ID | [TIGR00611] recF protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00000414234 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTTTTTAA CACATTTATC GCTAAAAAAT TATCGTAATT ATGAGAGTGA AACGATAGAG TTTGCCAATA ACGTAAATAT TATTTTAGGG GAGAATGCTC AAGGGAAAAC AAACATGATG GAGGCCATCT ATGTATTGGC GATGGCGAAG TCGCACAGGA CAACAAACGA CAAAGATCTC ATTCGCTGGG ATGAAGACTA TGCTAAAATA GAAGGAAAGG CAATGAAAAA AAATGGGGCG CTATCGCTTG AACTTATTAT TTCAAAAAAA GGAAAAAAAG CGAAATGCAA CCATATTGAA CAGCAGCGAT TAAGTCAATA TGTTGGTCAT TTGAATATCG TGATGTTTGC CCCGGAAGAT TTAAATTTAG TAAAAGGAAG CCCGCAAGTG AGACGGCGTT TTGTCGATAT GGAAATTGGG CAAGTATCCC CTGTTTATAT ACATGATTTG AGCCAATACC AAAAGCTTTT GCAGCAGCGA AACCATTATT TAAAAATGTT GCAAACGCGC GAGCAGCAAG ATGAAACAGT ACTAGATATC TTAACAGAAC AGCTCATTCC GCTGGCAGCT AAAATCACGC TAAAGCGGTA TGAATTTTTG CTGTTGCTGC AAAAATGGGC GGCGCCAATT CATCACGAAA TTAGCCGCGG ATTAGAAACA TTACAAATTC AATATCGACC TTCTGTTGAT GTATCAGAAA AGATAGAATT GTCTAGAATA ATAGAAGCAT ATAGTGAAAA GTTTGCTACA ATAAAAGAAA GAGAAATCCA GCGAGGAATG ACGCTTGCTG GTCCGCATCG TGATGATATT GCATTTAGCG TGAACGGAAA AGATGTCCAA ATATTTGGTT CTCAAGGGCA ACAGCGCACA ACAGCGTTAT CCATCAAGTT GGCAGAAATC GAATTGATTT TTTCCGAAAT TGGCGATTAC CCCATTCTTT TACTTGATGA TGTGCTATCT GAACTAGATG ACTTTCGGCA AACCCACCTT CTTGATACAA TTCGGAAAAA AGTGCAAACG TTTGTGACGA CAACAAGCAT TGAAGGAATC GAACATGACA TCATCAAGGA AGCAGCAATT TACAAAGTCC ATTCCGGTCA CATTACCGCC CCTCTTTGTG ATTAA
|
Protein sequence | MFLTHLSLKN YRNYESETIE FANNVNIILG ENAQGKTNMM EAIYVLAMAK SHRTTNDKDL IRWDEDYAKI EGKAMKKNGA LSLELIISKK GKKAKCNHIE QQRLSQYVGH LNIVMFAPED LNLVKGSPQV RRRFVDMEIG QVSPVYIHDL SQYQKLLQQR NHYLKMLQTR EQQDETVLDI LTEQLIPLAA KITLKRYEFL LLLQKWAAPI HHEISRGLET LQIQYRPSVD VSEKIELSRI IEAYSEKFAT IKEREIQRGM TLAGPHRDDI AFSVNGKDVQ IFGSQGQQRT TALSIKLAEI ELIFSEIGDY PILLLDDVLS ELDDFRQTHL LDTIRKKVQT FVTTTSIEGI EHDIIKEAAI YKVHSGHITA PLCD
|
| |