Gene Elen_0799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0799 
Symbol 
ID8415089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp995631 
End bp998864 
Gene Length3234 bp 
Protein Length1077 aa 
Translation table11 
GC content64% 
IMG OID645023765 
Producthelicase domain protein 
Protein accessionYP_003181162 
Protein GI257790556 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.356327 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAACCCG ACTTCTTCGA CAACAGGGCG CGCATCGTCA AGGACGACCT TGTCGCCAAT 
ATAGCCGACG GCGATCGCGT GGCGATAGCC GCGTCGGTGT TCTCCATGTA CGCCTATCAG
GAACTCGCAA CCCAGCTTGA AGGCATCGAC GAGCTGCGCT TCGTCTTCAC GTCGCAGGCG
TTCACCAAGC AGCGCCCCCC GCGCGAGAAG CGCGAATTCT ACATCCCGCG GCTTTCCCGA
GAGCAGGGCC TTTGCGGCAC CGACTTCGAG ATCAAGCTGC GCAACGAGCT CACGCAGAAG
GCCGTCGCGA CCGAGTGCGC CGACTGGATC CGCCGCAAGG CGCGTTTCCG TTCCTTCGAG
GGCGAGGGGC GTATGAGCGG CTTTCTGAAC GTGGAAAAGA CCGACGACAA CGTGGCCTAC
ATGCCCTTCG ACGGCTTCAC CACGCGCAAG CTCGGCTGCG ACAACTCCGC GGAAGCTCCC
GATGTGACCA TGCGCCTGGA CGCCTCCCAG TCGCGCGCCC TGCTCAAGCA GTTCGACGAC
GCCTGGGACT CGGGCGAGCT CCACGACGTC ACCGACGCCG TCATCGACGG CATCACGGCA
ATGTATCAAG AAAACGCGCC AGAGCTCATC TACTACATGG CGCTCTACCG CATATTCAGC
GAGTTCCTGG ACGACGTCTC CGAGGACGTG CTGCCCAACG AGGGGCTGGG CTTCCGCGAC
AGCCTCATCT GGAACAAGCT CTACGACTTC CAGAAGGACG CGGCGCTCGC CATCATCAAC
AAGCTCGAGA CCTACAACGG CTGCATCCTG GCAGACTCCG TGGGCCTTGG CAAGACGTTC
ACCGCGCTCG CCGTGATCAA GTACTACGAG AGCCGCAACA AGGACGTGCT CGTGCTGTGC
CCCAAGAAGC TGCGCGACAA CTGGATAACC TACAACTCGA ACGTCGTGAA CAACCCCATC
GCGGGCGACC GCCTGCAGTA CGACGTGCTC TACCACACCG ACCTCTCGCG CACGCGAGGC
ACGTCGGAGA CTGGGCTGCC CCTCGACCGC CTGAACTGGG GCGCGTACGG GCTTGTCGTC
ATCGACGAGA GCCACAACTT CCGAAACGGC GGCGACTCGG CGTCGGAAGA CAAGATGAAC
CGCTACCAGC TGCTTATGGA GAAGGTCATC AAGCAGGGCG TGAAGACCAA GGTGCTGATG
CTCTCGGCCA CGCCCGTGAA CAACCGCTTC CGCGACCTGC GCAACCAGCT CGCCCTGGCA
TACTGGGGCG ACCCCACGGG CTGGTCGGAG AAGCTGCGCC TGGAGAACGA CATCGAGACG
GTGTTCCGCA ACGCGCAGAC CGTTTACGCC CGCTGGTCGA AGCTGCCCGG CGAGCAGCGG
ACGACCCGCG CCCTCACCGA CATGCTCGAC TACGACTTCT TCGAGGTGCT CGACCAGGTG
ACCGTGGCGC GCAGCCGCAA GCACATCCAG CGCTACTACG ACATGAGCGC CATCGGGCCC
TTCCCGAAGC GCCTGCCGCC GATATCGAAG CGGCCCAAGC TCTCGACGCT GGCGAACGCG
ATCAACTACC GCGAGATATA CGAGGAGCTG GACTCCCTCG CGCTCGCCGT GTACATGCCT
AGCTCGTACG TGCACCCCAG CAAGATGGAG AAGTACGCGA AGATGGGCGG CGGCGGAAAC
CTCACGCTGG GCGGTCGCGA GACGGGCGTG CGCCGCCTCA TGACGACGAA CCTGCTCAAG
CGCCTGGAGA GCTCGGTGTG CTCGTTCCGC CTGACGCTCG AACGCGTACT GGCGGCGATG
AGCGCCGCCC TCGAGACCAT AGACGACTAC CGCAGGGGCC TCGCCTCGGG CGCCAGCGTG
GCCGACGGCG ACCTGCCGGG CGGGTTCGAC TTCGACCCCG ACGACGAGAG CGGCTTCGAG
GTGGGCGGCG CCACGAAGAT CCTCGTCGAG GACATGGACT GGATGAGCTG GGAGCGCGAC
ATCGAGGCGG ACGTCGCCGT CATCGAGGTC CTGATCTCGA TGGTGCGCGA CATCGACCCC
GCCCACGACG CCAAGCTCAT CGAGCTGTGC GAGCAGATCC GCGAGAAGTC GCAAGGCCCC
ATCAACCCGG GCAACCGCAA GGTGCTCGTG TTCACCGCGT TCTCAGACAC GGCGGACTAC
CTCTACGAGA ACGTCTCGGC CTTCGCGAAG CGCGAGCTCG GCCTCGAGTG CGCCAAGGTC
ACGGGCGACG GCTGCGCCTG CACGATAAAG GCAGTGCCCC CGCACATGCA GGAGGTGCTC
GCCTGCTTCT CGCCCGCGTC GAAGGAGCGC GACGTGGTGG CGCCTCAGCT TTCCGGCTGC
GACGTGAACA TCGTTATCGC CACCGATTGC ATCTCGGAGG GCCAGAACCT CCAGGACTGC
GACTACCTGG TGAACTACGA CATCCACTGG AACCCCGTGC GCATCGTGCA GCGCTTCGGC
CGCGTTGACC GCATCGGCTC TAAGAATGAC CGCATCCAGC TGGTGAACTA CTGGCCGGAC
GTCGAGCTTG ACGAGTACAT CAAGCTCAAG GCGCGCGTCG AGGAGCGCAT GCGCATCACC
GTGATGACGT CCACCGGCGA CGACGACTAC ATCAACGCCG ACGAGACGGG GGACCTGGAG
TACCGCCGCC AGCAGCTCGA GCAGATGCGC GACGAGGTCG TGGACCTGGA GGACGTCTCG
GGCGGCGTGT CCATCACCGA CCTGGGGCTC AACGAGTTCC GCATGGACCT GGTAGGCTAT
TACCGCGACA ACCCCGACAT CGACCGCCTT CCGAGCGGCA TCAACGCCGT GGTGGAGGGC
GACGAGCCAG GCGTCATCTT CGTGCTGCGC AACGTCAACA GCAGGATCGA CCGCGACGGC
GCGAACCACC TGCACCCGTT CTACGTGGTT CGCATGGGGT CCGACGGCTC CGTAATTCAC
GGGCACCTGG AGCCCAAGGC GGTGCTCGAC GACATGCGCC TTTTGTGCCG CGGCAATTCC
GAGCCCGACA TGGCGCTGTG CCGTGCCTAC AACCGCGAGA CCAAGAACGG CCGTGACATG
CGCCGCGAGG CGGGGCTCTT GCGCGACGCG GTGGCTTCGA TCGTAGACTC CAAGGAGGCG
TCCGACATCG ACAGCTTCTT CGGCGGCGGC ACGACGAGCT TCTTGGAGAA CGACATCGAG
GGGCTCGACG ACTTCGAGCT GGTGTGCTTC CTGGTGGTGA AGCCCAGATG CTAG
 
Protein sequence
MQPDFFDNRA RIVKDDLVAN IADGDRVAIA ASVFSMYAYQ ELATQLEGID ELRFVFTSQA 
FTKQRPPREK REFYIPRLSR EQGLCGTDFE IKLRNELTQK AVATECADWI RRKARFRSFE
GEGRMSGFLN VEKTDDNVAY MPFDGFTTRK LGCDNSAEAP DVTMRLDASQ SRALLKQFDD
AWDSGELHDV TDAVIDGITA MYQENAPELI YYMALYRIFS EFLDDVSEDV LPNEGLGFRD
SLIWNKLYDF QKDAALAIIN KLETYNGCIL ADSVGLGKTF TALAVIKYYE SRNKDVLVLC
PKKLRDNWIT YNSNVVNNPI AGDRLQYDVL YHTDLSRTRG TSETGLPLDR LNWGAYGLVV
IDESHNFRNG GDSASEDKMN RYQLLMEKVI KQGVKTKVLM LSATPVNNRF RDLRNQLALA
YWGDPTGWSE KLRLENDIET VFRNAQTVYA RWSKLPGEQR TTRALTDMLD YDFFEVLDQV
TVARSRKHIQ RYYDMSAIGP FPKRLPPISK RPKLSTLANA INYREIYEEL DSLALAVYMP
SSYVHPSKME KYAKMGGGGN LTLGGRETGV RRLMTTNLLK RLESSVCSFR LTLERVLAAM
SAALETIDDY RRGLASGASV ADGDLPGGFD FDPDDESGFE VGGATKILVE DMDWMSWERD
IEADVAVIEV LISMVRDIDP AHDAKLIELC EQIREKSQGP INPGNRKVLV FTAFSDTADY
LYENVSAFAK RELGLECAKV TGDGCACTIK AVPPHMQEVL ACFSPASKER DVVAPQLSGC
DVNIVIATDC ISEGQNLQDC DYLVNYDIHW NPVRIVQRFG RVDRIGSKND RIQLVNYWPD
VELDEYIKLK ARVEERMRIT VMTSTGDDDY INADETGDLE YRRQQLEQMR DEVVDLEDVS
GGVSITDLGL NEFRMDLVGY YRDNPDIDRL PSGINAVVEG DEPGVIFVLR NVNSRIDRDG
ANHLHPFYVV RMGSDGSVIH GHLEPKAVLD DMRLLCRGNS EPDMALCRAY NRETKNGRDM
RREAGLLRDA VASIVDSKEA SDIDSFFGGG TTSFLENDIE GLDDFELVCF LVVKPRC