Gene Elen_1588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1588 
Symbol 
ID8415887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1885945 
End bp1888782 
Gene Length2838 bp 
Protein Length945 aa 
Translation table11 
GC content65% 
IMG OID645024557 
Productmolydopterin dinucleotide-binding region 
Protein accessionYP_003181945 
Protein GI257791339 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0385421 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGAGAACA TGGCTGAAAC CACGTTGACA CGTCGAAGCT TCGTCAAGGC GTCCGCTCTG 
GTGGGCGCGA CGGCGGCGTT CGGCGCTTCG ATGGCCGGCT GTATGCAGGA GGCTCCGCAG
GAGCAGGCTC CGTCCGGCGG CGGCGCCGAC GAGGGTCTCG TGAGGATGAA GACGTCCTGC
CACGGCTGCA TCCAGATGTG CCCGGCTATC GCGTATCTGA AGGACGGCGT CGTCGTAAAG
CTGGAAGGCG ACCCCGACGC GCCGGTAAGT CGCGGAAGCC TGTGCATCAA GGGCCTCAAC
CAGCTGCACA CCATGTACAG CCCCCGCCGC GTGCTGCATC CGCTTCGGCG TGCCGGCGAG
CGCGGCGAGA ACAAATGGGA GGTCATCAGC TGGGACGAGG CCGTCGAGGA AGCGGCCACG
CATATCTGCG ACGCCATCGA CAAATACGGC CCCTATTCCT TCTTCGCCAG CGTGGGCGGC
GGCGGGGCCT ACTCGTTCAT GGAGGCCATG ACCCTGCCCA TGGCGTTCGG GTCGCCCACC
GTGTTCGAGC CCGGCTGCGC CCAGTGCTAT CTGCCGCGCT GGAGCATGTC GAAACTGTTC
TACGGCGGCA ACGACCAGTC CATCGCCGAC AACGCCGTGC AGGAGATATT CCGTCCCGAC
CCCGACAACA AGGCCGAGGT CGTGGTGCTG TGGGGCGCCC AGCCTTCCGT CAGCCAGACG
GCGGAATCGG GTCGCGGAAT GGCCGAGCTG CGCGCCAAGG GCGTGAAGAC CATCGTGGTC
GATCCCAACT TCTCGCCCGA CGCGGTGAAG GCCGACGTGT GGCTGCCGGT GCGCCCGGCC
ACCGACACGG GTCTGCTCCT GTGCTGGTTC CGCTACATCT TCGAGAACAA GCTCTACGAC
GAGCAGTTCA CGAAGTACTG GACGAACCTG CCCTTCCTTA TCGACCCCGA GACGAAGCTG
CCGGTGAAGG CGCAGGAGCT GTTCCCCGAC TTCCAGCAGA CCACGCCCGA GAACACCCCG
GCCTACGTCT GCTACGACCT CAAGACGAAC GCAGTGGCGC CCTTCGAGTT CTCCGCGCCC
GCCGACGCCG CAGTGGACCC CGAGATCTTC TGGACGGGCG ACTTCGAAGG GAAGACGTAC
AAGACCGCCG GCCAGATCTA CAAGGAAGAA GCCGATCCCT GGACGCTCGA GCACACCGCC
GAGAACTGCT GGCTCGATGC CGGCAAGATC GAAAAAGCCA TCAAGATCTA CGCCGAGGCC
TCCGTGGCCG GCATCGCCAA CGGCGTGGCG TCGGATATGA CCGAGTCCGC CTCGCAGGTG
CCGCTGGGCT GCATGGGCCT GGACTCCATC ATGGGCTACG TCAACAAGCC CGGCTGCACG
ATGACGCAGT ACGGCGCCGC CGGCGCGCCG CCGACGAAGC GCCCGGTCAC GTACAACAAC
GGCTTCGACG GCATGTTCTC GGACATGTAC GGCATCGGCG CGGTCATCGG CATGAGCGAT
GCCGAGAACG AGGCGCGCGC CAAGAAGCTG GGGGAGGAGA ACCCGCAGCA GAAGCTGGCG
AACCAGCTGC TCGTCGATCG ACTTGGCATG AAGGACCACA AGGGCCTGTA CGCATGGTGC
CACAGCCATA TCCCCACGGT GCGCGAGGCC ATCGCCACCG GCGAGCCGTA CAAGCCGCGC
GTGTGGTTCG ACATGTCGGG CAACAAGCTG GCCATGCTGG GCAATGCGAA GTCGTGGTAC
GACGTGTTCC CCGAGGTCGA CTACATCATC GGCCAGTACC CGATGCTCAC CTCCTTCCAC
ATCGAGGCCG CCGACCTCGT GTTCCCCGTG CGCGAGTGGC TGGAAGAGCC CATGGTGAAC
ATGACCCAGC TCAACACCCA GTGGCTGCAG AACGAGTGCG TGCACATCGG CGAGACGGTG
TCGCACTCCA TCCCAGCCGC GCAGGTGGTG GCGAAATGCG CGGAGAAGAT GGGCGGCGAG
CTGCCCGGTC TCAAGCCCGG ATACCTGGGC AGCGCCACCG AGGAGGAGGT CAAGGCCTCC
GTCGCCGAGA CGCTGCACGC GCCCAGCTGG GACGAGCTGG TGAAGGACGC CGACAAGTAC
GTGCCCTACG TCACGCCGGC CAGCGAGTAC TTCCATTACG ACCAACATGA AACCGTGGTG
GACGATGGCC TGCCTGCCGG CTTCGGCACC GAGTCCCGCA AGATCGAGGT GTACTGCCAG
ATCCTGCTCA AGCTGGCGCG CACGGGATAT CCGTTCTGCT ATCCGGAGCC GCAGGAGCCC
TGCGAGGACT ACAGCCCCAT CTGCTCGTAC ATCGAGCCGG CGGAGAGCCC GCTATCCGAC
GAGGAGTACC CCTTCGTGCT CACGTCGGGC CGCGTGCCGT ACTTCCACCA TGGCACCATG
CGCCACGCCG CGCTGTCGCG CGAGCTGTTC CCCACGGCCG AGATCCGCAT CAATCCGGCG
AGCGCCAAGG AGCTGGGCAT CGAGCATATG GATTGGGTGA AGGTGACCAG CCGCCGCGGC
GAGGTGCACG CGCGCGCCTA CCTCACCGAG GGCGTGCATC CGAAAACCGT GTGGATGGAG
CGCTTCTGGA ACCCGGAGTG CTACGACGAG TCCCAGACGA ACCCGACGGC CGGCTGGCGC
GAGTGCAACG TGAACGTGCT CACGAAGAAC GACGCCCCGT TCAACGAAGT GTACGGCTCC
TACACGAACC GCGGTTTCAC GGTGAAGATC GAGAAGTCCC AGAAACCTGC GAACGTATGG
GTGGAGCCCG AGGAGTTCGC GCCGTTCCTG CCGACTGAGG AAATGCTGTC CGAAGCTCAG
ACGAAGGATG TGTTCTGA
 
Protein sequence
MENMAETTLT RRSFVKASAL VGATAAFGAS MAGCMQEAPQ EQAPSGGGAD EGLVRMKTSC 
HGCIQMCPAI AYLKDGVVVK LEGDPDAPVS RGSLCIKGLN QLHTMYSPRR VLHPLRRAGE
RGENKWEVIS WDEAVEEAAT HICDAIDKYG PYSFFASVGG GGAYSFMEAM TLPMAFGSPT
VFEPGCAQCY LPRWSMSKLF YGGNDQSIAD NAVQEIFRPD PDNKAEVVVL WGAQPSVSQT
AESGRGMAEL RAKGVKTIVV DPNFSPDAVK ADVWLPVRPA TDTGLLLCWF RYIFENKLYD
EQFTKYWTNL PFLIDPETKL PVKAQELFPD FQQTTPENTP AYVCYDLKTN AVAPFEFSAP
ADAAVDPEIF WTGDFEGKTY KTAGQIYKEE ADPWTLEHTA ENCWLDAGKI EKAIKIYAEA
SVAGIANGVA SDMTESASQV PLGCMGLDSI MGYVNKPGCT MTQYGAAGAP PTKRPVTYNN
GFDGMFSDMY GIGAVIGMSD AENEARAKKL GEENPQQKLA NQLLVDRLGM KDHKGLYAWC
HSHIPTVREA IATGEPYKPR VWFDMSGNKL AMLGNAKSWY DVFPEVDYII GQYPMLTSFH
IEAADLVFPV REWLEEPMVN MTQLNTQWLQ NECVHIGETV SHSIPAAQVV AKCAEKMGGE
LPGLKPGYLG SATEEEVKAS VAETLHAPSW DELVKDADKY VPYVTPASEY FHYDQHETVV
DDGLPAGFGT ESRKIEVYCQ ILLKLARTGY PFCYPEPQEP CEDYSPICSY IEPAESPLSD
EEYPFVLTSG RVPYFHHGTM RHAALSRELF PTAEIRINPA SAKELGIEHM DWVKVTSRRG
EVHARAYLTE GVHPKTVWME RFWNPECYDE SQTNPTAGWR ECNVNVLTKN DAPFNEVYGS
YTNRGFTVKI EKSQKPANVW VEPEEFAPFL PTEEMLSEAQ TKDVF