Gene Elen_0004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0004 
Symbol 
ID8414279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp4350 
End bp5612 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content67% 
IMG OID645022976 
ProductDNA replication and repair protein RecF 
Protein accessionYP_003180388 
Protein GI257789782 
COG category[L] Replication, recombination and repair 
COG ID[COG1195] Recombinational DNA repair ATPase (RecF pathway) 
TIGRFAM ID[TIGR00611] recF protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000011529 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.000000444236 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGATCTGG CCATCGCCCA TATCTCGTTT CTGAACTTCC GCAGCTACGA GGCGTTCGAT 
CTCGACGGCA TCGGTCCGCT CACGGTGCTC GTGGGTCCGA ACGCCGCGGG GAAGACGAAC
GTCGTCGAGG GAATAGGCCT GCTGACGGCG CAATCGTCGT TTCGACATGC GCCCGTCGAC
CAATTGGTGC GCGCCGGGGC TCCCTTCGCG CGCTTGACGG CCGACGTCAC CGACGGCAGC
CGTCAGCTTG AGCTGGCGGT ACAGATGGCC GAGGGCAAGA AGAAGCACCT GCTGAACGGG
AAGCCCAAGC GCACAGCCGA CCTGAAGGGC CTCGTACCGT CCGTGACGTT CACGCCCGAC
GACCTGGAGC TGGCGAAAGG GGCCATGTCG GTGCGCCGCG CCGCGCTGGA CGCGCTGGGC
TCGCAGCTGT CGGCCAACCA CTACCTCATC CGGCGCGATT ACGAGAAGGT GCTGCGCCAC
AAGAACCGCC TGCTGAAGGA CGAGGCGCCG GCTGCGCTGG TGGGGGCGAT GAACGAGACG
CTGGTCACGT GCGGCGCGCA GCTTTCGTGC TACCGCGCCG CGCTGTTCGA GAAGCTGGCG
GCTTCGATGG CGTCGTACTA CGCTGAGATC ACCGACGGGC ATGAGCGGCT GGACGCGGGC
TTCGTCCCTT CGTGGGAAGA GCACGATCCC CTTTCGTTCG CCACGCGCAC GTTCGGGCGC
GACGAGGCGC GCGAGGCGCT GGCGGATGCG CTTGCGCGCC GCGGCGGCGA GGAGCGCGTG
CGAAAGCGGG CGCTCGTGGG CCCGCATGCC GACCGCATCG AGTTCTTCAT CGACGGCAAG
AACGCCGCCC TGTTCGGCAG CCAGGGGCAG CAGCGTTCGG TGGTGCTGGC GTTCAAGCTG
GCCGAGGCCA CGCTGATCCA GGACATTTTG CGCCAAAAGC CGGTGCTGCT GCTGGACGAC
GTGATGAGCG AGCTGGACGC CGCGCGCCGC CGCGCGCTGG TGGCGTTCAT CTCGGGCGAC
ATCCAGACGT TCATCACCAC GACGAACCTC GCCTACTTCG ACGACGACCT GCTGGGCGGC
GCTCGCATCG TAGAGCTGGA GAAACCGCGC AAAACCGGCG AAATGCCTGA CGAGGATCGA
ATGTTTCACG TGAAACATTC GCCGATCGAC GCGCAACCGG CGCGCGGGGA GGCCGAAGAG
CGCGACGGGC AGCCGTCCGA TCCGGCCGAA CAACCTTCGG CGCGAATCGA GGGGGACTCA
TGA
 
Protein sequence
MDLAIAHISF LNFRSYEAFD LDGIGPLTVL VGPNAAGKTN VVEGIGLLTA QSSFRHAPVD 
QLVRAGAPFA RLTADVTDGS RQLELAVQMA EGKKKHLLNG KPKRTADLKG LVPSVTFTPD
DLELAKGAMS VRRAALDALG SQLSANHYLI RRDYEKVLRH KNRLLKDEAP AALVGAMNET
LVTCGAQLSC YRAALFEKLA ASMASYYAEI TDGHERLDAG FVPSWEEHDP LSFATRTFGR
DEAREALADA LARRGGEERV RKRALVGPHA DRIEFFIDGK NAALFGSQGQ QRSVVLAFKL
AEATLIQDIL RQKPVLLLDD VMSELDAARR RALVAFISGD IQTFITTTNL AYFDDDLLGG
ARIVELEKPR KTGEMPDEDR MFHVKHSPID AQPARGEAEE RDGQPSDPAE QPSARIEGDS