Gene Elen_0801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0801 
Symbol 
ID8415091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp999484 
End bp1001178 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content49% 
IMG OID645023767 
ProductSMC domain protein 
Protein accessionYP_003181164 
Protein GI257790558 
COG category[L] Replication, recombination and repair 
COG ID[COG3593] Predicted ATP-dependent endonuclease of the OLD family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.779455 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATAG ATAAACTGAC AATTAAAAAC TATAGGAGCG TCCGTGATTT GGAACTCAGC 
CTGTCGCCGC GCATCAATGT CTTCATTGGG GCAAATAACG TTGGCAAGAG CAATATCCTC
TCTGCAATGG AATATCTGCT GGGTCCGTCC TATCCAACAG CCAATCGGCT TGAGCGGTGG
GACTTCTACC AGGGCGATGA GGAGCTTCCC CTCAAAATAG CCCTTGATTT CGATGACGGG
GCTCACCTTT CATTCGATTC AACCTGGCAC GATGGTTATG GAAGAGAAAA ACACGGCTTA
AATTACAACG GTAGCTACAT TTCGGATGAA GTGCGTAGTC GCTATATTTC AGCTTCGATT
GGGCCCGACA GGCGCGTTCT CGACAATCCG GCGTCCAGCC AATGGAGCCT GTTGGGCAGG
ATGCTCAAGG AATTCAACGA ACGTCTTAGC GAGGAGACGA TCTCGTCTGC CGATGGGCAT
ACGGTCACCA AAGCCGAAGC GTTTAAACAG AGCATGCAGG AGATTCGTGA TCAAATACTC
TTCTCCATTA CCGACCAAGA CGGTACGAAC CTTATGGGCG AGCTTAGCCG CATTATGCAG
CAGGAAACTG CGAATCAGCT CAATTGCTCG CCTAACGATT TGACTGTCGA CTTGAATGCC
TACGACCCGT GGAACCTGTA CAAAACACTG CAGATTTTCG TGACCGAGCA GGAGACCGGT
GTTCAGATGC GGGCATCTGA CATGGGCATG GGGGTGCAGG CAAGCCTCAC TATAGCTATC
CTCCGTGCCT ATTCGAAGCT CAAGTTGAAG AACCAAACGC CGCTGTTTAT CGACGAGCCA
GAACTGTATT TACATCCTCA GGCAAGGCGA AAGTTTTATC GCGTGATTGA AGAGCTCGCA
GATTCGGGAA CCCAGATATT CCTTACGACT CATTCCACTG AGTTCATTGA TCTGGGCAAC
TTTGATCAGA TATACCTTGT GCGCAAGAAC GCCGAGCGAG GGACCTATGT TAGAAAAGCA
GATCCCCAGA GTTTTGTAGA TGACCTACAA AACAGGCTCA ATATAAGAAC GGACGCAAAC
AGATTGATGC TCGAATACCG CAATGCTTTC GAGAACACGG GCGACTCTCA AAAAGCTGCC
GAAGGCCTTT TCGCTTCGAA GGTGTTACTA GTCGAGGGAG AGAGTGAGTC GCTTATCCTG
CCGTTTTGCT TCGATAGGAT AGGCTTCGAC TACGATGGAA AAGGCATCTC CATAGTACGC
TGCGGCGGCA AAAATGAGCT TGACCGTTTC TATCGCTTAT ACAGCGAATT CGGCATCCCT
TGCTTCATCC TTTTCGACGG GGACTTTCAG AATTTCCAAA CCGAAGATCA AGCACACACC
ATTAAAGCCA ATAAGAGCAT CCTTTCGCTC TTCGGTTGCT TGGACGATTT CCCTGACGGA
AATGTGCATG AGTCATATTT TGGTTTCCGG ACGCTACTCG AGGATAATCT GGGGCTCAAC
GGTATTGGCT CAAAAACGAA AGGCCTTCGG CTGTTCGTTA GGTTCAAGAA TGCCGTTTCC
CGCGAGGAAG CAGCTGTTCC GTTCTGGGTT AAAGAGATTG CCGACAAGCT TGACGGTTTG
CCTAACGAGG CGCGCTCCGT CCTAACTTGC AAATGTGAAC CCCTTGCATG GGATGATGAC
TACATCCCTT TTTAG
 
Protein sequence
MKIDKLTIKN YRSVRDLELS LSPRINVFIG ANNVGKSNIL SAMEYLLGPS YPTANRLERW 
DFYQGDEELP LKIALDFDDG AHLSFDSTWH DGYGREKHGL NYNGSYISDE VRSRYISASI
GPDRRVLDNP ASSQWSLLGR MLKEFNERLS EETISSADGH TVTKAEAFKQ SMQEIRDQIL
FSITDQDGTN LMGELSRIMQ QETANQLNCS PNDLTVDLNA YDPWNLYKTL QIFVTEQETG
VQMRASDMGM GVQASLTIAI LRAYSKLKLK NQTPLFIDEP ELYLHPQARR KFYRVIEELA
DSGTQIFLTT HSTEFIDLGN FDQIYLVRKN AERGTYVRKA DPQSFVDDLQ NRLNIRTDAN
RLMLEYRNAF ENTGDSQKAA EGLFASKVLL VEGESESLIL PFCFDRIGFD YDGKGISIVR
CGGKNELDRF YRLYSEFGIP CFILFDGDFQ NFQTEDQAHT IKANKSILSL FGCLDDFPDG
NVHESYFGFR TLLEDNLGLN GIGSKTKGLR LFVRFKNAVS REEAAVPFWV KEIADKLDGL
PNEARSVLTC KCEPLAWDDD YIPF