Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0799 |
Symbol | |
ID | 8415089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 995631 |
End bp | 998864 |
Gene Length | 3234 bp |
Protein Length | 1077 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645023765 |
Product | helicase domain protein |
Protein accession | YP_003181162 |
Protein GI | 257790556 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.356327 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAACCCG ACTTCTTCGA CAACAGGGCG CGCATCGTCA AGGACGACCT TGTCGCCAAT ATAGCCGACG GCGATCGCGT GGCGATAGCC GCGTCGGTGT TCTCCATGTA CGCCTATCAG GAACTCGCAA CCCAGCTTGA AGGCATCGAC GAGCTGCGCT TCGTCTTCAC GTCGCAGGCG TTCACCAAGC AGCGCCCCCC GCGCGAGAAG CGCGAATTCT ACATCCCGCG GCTTTCCCGA GAGCAGGGCC TTTGCGGCAC CGACTTCGAG ATCAAGCTGC GCAACGAGCT CACGCAGAAG GCCGTCGCGA CCGAGTGCGC CGACTGGATC CGCCGCAAGG CGCGTTTCCG TTCCTTCGAG GGCGAGGGGC GTATGAGCGG CTTTCTGAAC GTGGAAAAGA CCGACGACAA CGTGGCCTAC ATGCCCTTCG ACGGCTTCAC CACGCGCAAG CTCGGCTGCG ACAACTCCGC GGAAGCTCCC GATGTGACCA TGCGCCTGGA CGCCTCCCAG TCGCGCGCCC TGCTCAAGCA GTTCGACGAC GCCTGGGACT CGGGCGAGCT CCACGACGTC ACCGACGCCG TCATCGACGG CATCACGGCA ATGTATCAAG AAAACGCGCC AGAGCTCATC TACTACATGG CGCTCTACCG CATATTCAGC GAGTTCCTGG ACGACGTCTC CGAGGACGTG CTGCCCAACG AGGGGCTGGG CTTCCGCGAC AGCCTCATCT GGAACAAGCT CTACGACTTC CAGAAGGACG CGGCGCTCGC CATCATCAAC AAGCTCGAGA CCTACAACGG CTGCATCCTG GCAGACTCCG TGGGCCTTGG CAAGACGTTC ACCGCGCTCG CCGTGATCAA GTACTACGAG AGCCGCAACA AGGACGTGCT CGTGCTGTGC CCCAAGAAGC TGCGCGACAA CTGGATAACC TACAACTCGA ACGTCGTGAA CAACCCCATC GCGGGCGACC GCCTGCAGTA CGACGTGCTC TACCACACCG ACCTCTCGCG CACGCGAGGC ACGTCGGAGA CTGGGCTGCC CCTCGACCGC CTGAACTGGG GCGCGTACGG GCTTGTCGTC ATCGACGAGA GCCACAACTT CCGAAACGGC GGCGACTCGG CGTCGGAAGA CAAGATGAAC CGCTACCAGC TGCTTATGGA GAAGGTCATC AAGCAGGGCG TGAAGACCAA GGTGCTGATG CTCTCGGCCA CGCCCGTGAA CAACCGCTTC CGCGACCTGC GCAACCAGCT CGCCCTGGCA TACTGGGGCG ACCCCACGGG CTGGTCGGAG AAGCTGCGCC TGGAGAACGA CATCGAGACG GTGTTCCGCA ACGCGCAGAC CGTTTACGCC CGCTGGTCGA AGCTGCCCGG CGAGCAGCGG ACGACCCGCG CCCTCACCGA CATGCTCGAC TACGACTTCT TCGAGGTGCT CGACCAGGTG ACCGTGGCGC GCAGCCGCAA GCACATCCAG CGCTACTACG ACATGAGCGC CATCGGGCCC TTCCCGAAGC GCCTGCCGCC GATATCGAAG CGGCCCAAGC TCTCGACGCT GGCGAACGCG ATCAACTACC GCGAGATATA CGAGGAGCTG GACTCCCTCG CGCTCGCCGT GTACATGCCT AGCTCGTACG TGCACCCCAG CAAGATGGAG AAGTACGCGA AGATGGGCGG CGGCGGAAAC CTCACGCTGG GCGGTCGCGA GACGGGCGTG CGCCGCCTCA TGACGACGAA CCTGCTCAAG CGCCTGGAGA GCTCGGTGTG CTCGTTCCGC CTGACGCTCG AACGCGTACT GGCGGCGATG AGCGCCGCCC TCGAGACCAT AGACGACTAC CGCAGGGGCC TCGCCTCGGG CGCCAGCGTG GCCGACGGCG ACCTGCCGGG CGGGTTCGAC TTCGACCCCG ACGACGAGAG CGGCTTCGAG GTGGGCGGCG CCACGAAGAT CCTCGTCGAG GACATGGACT GGATGAGCTG GGAGCGCGAC ATCGAGGCGG ACGTCGCCGT CATCGAGGTC CTGATCTCGA TGGTGCGCGA CATCGACCCC GCCCACGACG CCAAGCTCAT CGAGCTGTGC GAGCAGATCC GCGAGAAGTC GCAAGGCCCC ATCAACCCGG GCAACCGCAA GGTGCTCGTG TTCACCGCGT TCTCAGACAC GGCGGACTAC CTCTACGAGA ACGTCTCGGC CTTCGCGAAG CGCGAGCTCG GCCTCGAGTG CGCCAAGGTC ACGGGCGACG GCTGCGCCTG CACGATAAAG GCAGTGCCCC CGCACATGCA GGAGGTGCTC GCCTGCTTCT CGCCCGCGTC GAAGGAGCGC GACGTGGTGG CGCCTCAGCT TTCCGGCTGC GACGTGAACA TCGTTATCGC CACCGATTGC ATCTCGGAGG GCCAGAACCT CCAGGACTGC GACTACCTGG TGAACTACGA CATCCACTGG AACCCCGTGC GCATCGTGCA GCGCTTCGGC CGCGTTGACC GCATCGGCTC TAAGAATGAC CGCATCCAGC TGGTGAACTA CTGGCCGGAC GTCGAGCTTG ACGAGTACAT CAAGCTCAAG GCGCGCGTCG AGGAGCGCAT GCGCATCACC GTGATGACGT CCACCGGCGA CGACGACTAC ATCAACGCCG ACGAGACGGG GGACCTGGAG TACCGCCGCC AGCAGCTCGA GCAGATGCGC GACGAGGTCG TGGACCTGGA GGACGTCTCG GGCGGCGTGT CCATCACCGA CCTGGGGCTC AACGAGTTCC GCATGGACCT GGTAGGCTAT TACCGCGACA ACCCCGACAT CGACCGCCTT CCGAGCGGCA TCAACGCCGT GGTGGAGGGC GACGAGCCAG GCGTCATCTT CGTGCTGCGC AACGTCAACA GCAGGATCGA CCGCGACGGC GCGAACCACC TGCACCCGTT CTACGTGGTT CGCATGGGGT CCGACGGCTC CGTAATTCAC GGGCACCTGG AGCCCAAGGC GGTGCTCGAC GACATGCGCC TTTTGTGCCG CGGCAATTCC GAGCCCGACA TGGCGCTGTG CCGTGCCTAC AACCGCGAGA CCAAGAACGG CCGTGACATG CGCCGCGAGG CGGGGCTCTT GCGCGACGCG GTGGCTTCGA TCGTAGACTC CAAGGAGGCG TCCGACATCG ACAGCTTCTT CGGCGGCGGC ACGACGAGCT TCTTGGAGAA CGACATCGAG GGGCTCGACG ACTTCGAGCT GGTGTGCTTC CTGGTGGTGA AGCCCAGATG CTAG
|
Protein sequence | MQPDFFDNRA RIVKDDLVAN IADGDRVAIA ASVFSMYAYQ ELATQLEGID ELRFVFTSQA FTKQRPPREK REFYIPRLSR EQGLCGTDFE IKLRNELTQK AVATECADWI RRKARFRSFE GEGRMSGFLN VEKTDDNVAY MPFDGFTTRK LGCDNSAEAP DVTMRLDASQ SRALLKQFDD AWDSGELHDV TDAVIDGITA MYQENAPELI YYMALYRIFS EFLDDVSEDV LPNEGLGFRD SLIWNKLYDF QKDAALAIIN KLETYNGCIL ADSVGLGKTF TALAVIKYYE SRNKDVLVLC PKKLRDNWIT YNSNVVNNPI AGDRLQYDVL YHTDLSRTRG TSETGLPLDR LNWGAYGLVV IDESHNFRNG GDSASEDKMN RYQLLMEKVI KQGVKTKVLM LSATPVNNRF RDLRNQLALA YWGDPTGWSE KLRLENDIET VFRNAQTVYA RWSKLPGEQR TTRALTDMLD YDFFEVLDQV TVARSRKHIQ RYYDMSAIGP FPKRLPPISK RPKLSTLANA INYREIYEEL DSLALAVYMP SSYVHPSKME KYAKMGGGGN LTLGGRETGV RRLMTTNLLK RLESSVCSFR LTLERVLAAM SAALETIDDY RRGLASGASV ADGDLPGGFD FDPDDESGFE VGGATKILVE DMDWMSWERD IEADVAVIEV LISMVRDIDP AHDAKLIELC EQIREKSQGP INPGNRKVLV FTAFSDTADY LYENVSAFAK RELGLECAKV TGDGCACTIK AVPPHMQEVL ACFSPASKER DVVAPQLSGC DVNIVIATDC ISEGQNLQDC DYLVNYDIHW NPVRIVQRFG RVDRIGSKND RIQLVNYWPD VELDEYIKLK ARVEERMRIT VMTSTGDDDY INADETGDLE YRRQQLEQMR DEVVDLEDVS GGVSITDLGL NEFRMDLVGY YRDNPDIDRL PSGINAVVEG DEPGVIFVLR NVNSRIDRDG ANHLHPFYVV RMGSDGSVIH GHLEPKAVLD DMRLLCRGNS EPDMALCRAY NRETKNGRDM RREAGLLRDA VASIVDSKEA SDIDSFFGGG TTSFLENDIE GLDDFELVCF LVVKPRC
|
| |