Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0003 |
Symbol | |
ID | 4269534 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 2816 |
End bp | 3880 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638124730 |
Product | DNA replication and repair protein RecF |
Protein accession | YP_740852 |
Protein GI | 114319169 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1195] Recombinational DNA repair ATPase (RecF pathway) |
TIGRFAM ID | [TIGR00611] recF protein |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00180653 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.423581 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACTGG CGGCCGAAGG GGTGCGCAAC CTCCAGCCGT TTGAGCTTAC GCCTGGCGCC GGAATCAACG TGGTGGTGGG TGCCAACGCC GCGGGGAAGA CGAGCCTGTT GGAGGCCATC TACTTCGTGG CGCGGACCCG GTCCTTCCGC GCCACGCGGA CGGCGCAAAT GATCGGGAAC GGTCATGAGG CCTTGTGGGT TCGGGCTCAG ACTCAGGGGC ATACCATCGG GGTCGCCCGG GACAGCCAGG AGACGCAGGT CCGTCTGGAC GGTCGGGACG GGCGAAGCCT TTCCGAACTG GCCCGCTACC TACCGGTGCA GGTCATCAAT AGTGAGCACC AGCGGTTATT GCTGGACGGC CCTGCCGTGC GGCGCAGTTT TTTGAACTGG GCGGTGTTCC ACGTGGAACC ACAGTTCTCA ACTGTGTGGG GCCGTTACGT CCGCGCCCTC CGGCAGCGCA ATGCTGCCCT CAAGGCGGGC GAGTCCCGCT TGGCGTGGGC CTACGACGAG GGCCTCATCG AAACGGCCGA CACCATCGAC CGTAACCGGC GCCACCTCAT CGACGCCCTG GAGCCGCGCT GGTCGGCCTT GGTCCGCCGC TGGCTGCCGG ACGAACCGGT GGCCCTCCAT TACCGCCCGG GCTGGCGGAG CGACGAGCCG CTCGCTGATC GCCTTGAAGC CCAGCGGGAG TTGGATCGTC AGCGGGGGTT CACCAACAGT GGGCCGCATC GCGCCGATCT GAGTTTCCGC GTGGCCGGTG TCGAGGCCCA GCACCGGCTC TCGCGCGGAC AGCAAAAGCT TCTGGTGCTC GCCCTGCTGC TCGCCCAGGC AGCGGTTACC CACACCCTGA CCGGCCAGTC GCTCACCTTG TTGGTGGATG ACCTCGCTGC GGAGCTCGAC CCGGCGCGGC GTGCCGCCGT GGTGGAGGCG ATCGCCTCCA GTGGCAATCA GGCCTTCCTG ACCGCCATTG AGCCGGGCGA TATCCCCCTC GCCCCCGACG CCGCGCAATG GTTCCACGTG GAACAGGGCC GTATTCACTC CGGCCCGCCG CCGGCGGCCG GCTGA
|
Protein sequence | MSLAAEGVRN LQPFELTPGA GINVVVGANA AGKTSLLEAI YFVARTRSFR ATRTAQMIGN GHEALWVRAQ TQGHTIGVAR DSQETQVRLD GRDGRSLSEL ARYLPVQVIN SEHQRLLLDG PAVRRSFLNW AVFHVEPQFS TVWGRYVRAL RQRNAALKAG ESRLAWAYDE GLIETADTID RNRRHLIDAL EPRWSALVRR WLPDEPVALH YRPGWRSDEP LADRLEAQRE LDRQRGFTNS GPHRADLSFR VAGVEAQHRL SRGQQKLLVL ALLLAQAAVT HTLTGQSLTL LVDDLAAELD PARRAAVVEA IASSGNQAFL TAIEPGDIPL APDAAQWFHV EQGRIHSGPP PAAG
|
| |