Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0004 |
Symbol | |
ID | 8414279 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 4350 |
End bp | 5612 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645022976 |
Product | DNA replication and repair protein RecF |
Protein accession | YP_003180388 |
Protein GI | 257789782 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1195] Recombinational DNA repair ATPase (RecF pathway) |
TIGRFAM ID | [TIGR00611] recF protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000011529 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.000000444236 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGATCTGG CCATCGCCCA TATCTCGTTT CTGAACTTCC GCAGCTACGA GGCGTTCGAT CTCGACGGCA TCGGTCCGCT CACGGTGCTC GTGGGTCCGA ACGCCGCGGG GAAGACGAAC GTCGTCGAGG GAATAGGCCT GCTGACGGCG CAATCGTCGT TTCGACATGC GCCCGTCGAC CAATTGGTGC GCGCCGGGGC TCCCTTCGCG CGCTTGACGG CCGACGTCAC CGACGGCAGC CGTCAGCTTG AGCTGGCGGT ACAGATGGCC GAGGGCAAGA AGAAGCACCT GCTGAACGGG AAGCCCAAGC GCACAGCCGA CCTGAAGGGC CTCGTACCGT CCGTGACGTT CACGCCCGAC GACCTGGAGC TGGCGAAAGG GGCCATGTCG GTGCGCCGCG CCGCGCTGGA CGCGCTGGGC TCGCAGCTGT CGGCCAACCA CTACCTCATC CGGCGCGATT ACGAGAAGGT GCTGCGCCAC AAGAACCGCC TGCTGAAGGA CGAGGCGCCG GCTGCGCTGG TGGGGGCGAT GAACGAGACG CTGGTCACGT GCGGCGCGCA GCTTTCGTGC TACCGCGCCG CGCTGTTCGA GAAGCTGGCG GCTTCGATGG CGTCGTACTA CGCTGAGATC ACCGACGGGC ATGAGCGGCT GGACGCGGGC TTCGTCCCTT CGTGGGAAGA GCACGATCCC CTTTCGTTCG CCACGCGCAC GTTCGGGCGC GACGAGGCGC GCGAGGCGCT GGCGGATGCG CTTGCGCGCC GCGGCGGCGA GGAGCGCGTG CGAAAGCGGG CGCTCGTGGG CCCGCATGCC GACCGCATCG AGTTCTTCAT CGACGGCAAG AACGCCGCCC TGTTCGGCAG CCAGGGGCAG CAGCGTTCGG TGGTGCTGGC GTTCAAGCTG GCCGAGGCCA CGCTGATCCA GGACATTTTG CGCCAAAAGC CGGTGCTGCT GCTGGACGAC GTGATGAGCG AGCTGGACGC CGCGCGCCGC CGCGCGCTGG TGGCGTTCAT CTCGGGCGAC ATCCAGACGT TCATCACCAC GACGAACCTC GCCTACTTCG ACGACGACCT GCTGGGCGGC GCTCGCATCG TAGAGCTGGA GAAACCGCGC AAAACCGGCG AAATGCCTGA CGAGGATCGA ATGTTTCACG TGAAACATTC GCCGATCGAC GCGCAACCGG CGCGCGGGGA GGCCGAAGAG CGCGACGGGC AGCCGTCCGA TCCGGCCGAA CAACCTTCGG CGCGAATCGA GGGGGACTCA TGA
|
Protein sequence | MDLAIAHISF LNFRSYEAFD LDGIGPLTVL VGPNAAGKTN VVEGIGLLTA QSSFRHAPVD QLVRAGAPFA RLTADVTDGS RQLELAVQMA EGKKKHLLNG KPKRTADLKG LVPSVTFTPD DLELAKGAMS VRRAALDALG SQLSANHYLI RRDYEKVLRH KNRLLKDEAP AALVGAMNET LVTCGAQLSC YRAALFEKLA ASMASYYAEI TDGHERLDAG FVPSWEEHDP LSFATRTFGR DEAREALADA LARRGGEERV RKRALVGPHA DRIEFFIDGK NAALFGSQGQ QRSVVLAFKL AEATLIQDIL RQKPVLLLDD VMSELDAARR RALVAFISGD IQTFITTTNL AYFDDDLLGG ARIVELEKPR KTGEMPDEDR MFHVKHSPID AQPARGEAEE RDGQPSDPAE QPSARIEGDS
|
| |