Gene Elen_0515 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0515 
Symbol 
ID8414799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp663588 
End bp666482 
Gene Length2895 bp 
Protein Length964 aa 
Translation table11 
GC content61% 
IMG OID645023486 
Productmolydopterin dinucleotide-binding region 
Protein accessionYP_003180889 
Protein GI257790283 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.470196 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGAAC TGCTCATGAC CCGACGTGCG TTTGCGAAGG TGATGGCCGT GACGGCTGCG 
GCCGCCGGTT TCACGGGGGC TCAGTCGGCG TTGGCGGATA CCGAGCCGGC TGCCTCGTCG
GGCGAAGTCA AGCGCATACG CTCGGCCTGT CGCGGATGCG GCAAGATGGA GTGCGGCGTT
TGGGTGACCG TTCAGGATGG GCGGGTTGTC AAAACGGAGG GCGATGAAAG CGCTTTCCAG
TCGGCGGGAA ACCATTGCGC GAAAGGGCAG GCGTCCTTGC AGGCGGCGTA TCATCCCGAC
CGTCTCATGT ACCCGCTCAA GCGCACGAAT CCCAAAGGGC AAGAGCCCGG TTGGGTGCGC
ATCAGCTGGG ATGAGGCGTA TCGATCCACC GTCGAGGCAA TCCATAAGAA CCAGGAAAAG
TACGGCAACG AGACGTGCTT CTTCATGGGC GGCACGTCGC GTATCTGGGC CATGGGCCCT
TATGGCGCGC TGAAGCAGTG CTTCGGGTCG CCGAACGGCA TACAGGCCAA CGAGATATGC
AAAGGCCCGC GCTTCTACGC TACGAAGCTG AACGATTCGA ACGCCTACAG CTGGATGGAA
GTGGTGGGGC GTCCGCGCGT GTACGTGCAA TGGGGCGGCG CGTCGGAGCT GTCCAATTAC
GACGATAGCT GTCGCACCAC GGTCGACGTG GCGACGCGCG CCGACAAGCA CATCCTGGTC
GATCCGCGCC AGACGAACTT GGGGAAAGAG GCGGATATCT GGGTGAACCT TCGTCCCGGA
ACCGATGGGG CAGTGGCGAA CTGCTGGGCC AACGTGATCA TCGAGAACGA GCTGTACGAC
GATTTGTACG TACGGAAGTG GATGAACGCT CCCATGTTGG TGGTGGAAGA TGAGAGCTTC
GAGCCAACCC CTTCCTCGTC GAGCGCGCAG ACGGCCAAGG TGCGTACGCG CCTTCTCAAA
GAGTCCGACC TCGTCGAGGG CGGGGCGGAT ACCCGCTTCA TGGTGCTCAA CGAGATCACA
AACGAGCTGA GCTGGTACGA TGCCGGCGGC GACAACCCCG GCTGGGAGGG CGAGGACTGG
GTTCCCGCAA CCGAGGGCAA AGAAGCTCAT CAGCCGGGAT TGGATCTGAC GGGTCAGACG
CAGGGCTTCG TGCTTGACTA CGTACCGTTT CCCGACGGGC TGCTTCCCGC GCTGCATACC
CCCGAAGGCG GCTTCGAGGT GGAGTTGAAA GACGGCTCCA GCGTGCACGT GCGCACGGTG
TGGGAGCGCT ATATCGAGTT CTTGGAAGAC TACACTCCCG AAAAGGTGTC GGAGATATCG
GGCGTTGACG TCGAAGTGCT AAAAGAGGCG GCCATCACGT ACGCAACGCG CGTCGACCCT
TCGACCGGGT ACGGCAATGG CGGAATCCAG TACATGCTGG CGTTGGAACA CGCGTGCAAT
TCGACGCAGA ACAACCGTGC CTGCGACCTG CTGGCGGGCA TAACGGGCAA CATGGACACT
CCGGGCGGCA TGCGCGGCTC GACTCCCGGT TGGCCCGTCT ACGACCTCGG CATGTGCGTG
CCCGACTCCG GCAAAACCAC CGAGTACACC ATCGAGAAGA TTTTGGGCAA AGAGCGTTTT
CCGATGATCG GCTCGGACTG CAACCCCAGC TGGGCGGATG CGACGTCGGT GTACGACGCT
ATCGAGAGCG GCGAGCCTTA TAACGTGACG TGCGGCATCG GGCAAACGGG CGACTTCATG
AACCAGTCGA ACTCCCTCTT TGCCGCGGAG CAGCTTCAGA AGCTCGACTT CTGGTGCTCG
ATCGATCTGT GGCACACCCC GTGCGTGGAT ATGATGGCCG ATATCGCAAT GCCCGCCGCT
CATTGGCTGG AGCTCGATTG CATTCGCAAA AGCCAGGGCT CGTCGGGTGC GTTCGGCGCT
ACGGTGAAAG CGGTGGAGCC TCCCGGAGAA GCGAAGAACG ATCTGGAGAT CGTGGTTGGC
CTGTATAAGG CCGCCGGGGT CCCGTACTTC GACGAGGAGT ATCACGGTGC GGCGTGGCTC
GAGGGGGATG AAGCCGTGGA CGCGTGCAAC AACGTTGCGC TCAAAAGCTT CCGCATTCCC
GATTGGAATG ACTACAAGAA GGAATTCCAA GAGAAGGGCT GGTTCGATTC CAAGGTGGAG
AAGCCCGACG ATTGGGGCGT CTACCGGCGC TATCAAACCG GAAACGGCCA CATCAACGGA
GGGTTCCCGC CCAATCCGAA CCAGCATCAA GGCTGGAACA CGACCACGCA CAAGCAGGAG
ATCTGGTCGA CGGTGCTGGA ATCATGGTTG CCCGGCGAGG GCGAGGAGTT CCCGAAATTC
GTGGAGGCTC CGCATGGCCC CGTCGCCGAC CCCGATTTGT TCACGGACGA CAACTCGTTC
TTGATGACCA CCGGGCGTCG TCAAGGAACC TACTTCCATT CCGAGCACCG GCAGCTGCCA
TGGTGCCGCG AGCTGTGGCC CGTTCCTCGG CTTGAGATGA ATCCCGTTGA CGCCGAGCGA
CTTGGTCTCG AGCAGGGGGA TTGGGTATGG ATCGAAACCG ATCAGCACAA GATTCGCGAA
GTGGTCGATC TGTACTACGG CATCGCCCCG GGTGTGGTGA ACGCCGAGCA CCAATGGTGG
TACCCCGAGC TCAATCAGCC CGATCACGGG TTCAAATTGT CCGGCGTGAA CTGTTTGATC
GATCGCCATG CCCAAGATCG CATCATCGGG TCGTCGAACC TGCGTGCTTA CGGTGTGAAG
GTGTACAAGG CCACGCCCGA GAATTCGCCG TTCGGCAACC CCGTGCCGTG CGGAGACGAC
GGGACGCCCA TCATCCACAC CTGCGACGAT CCGCGCCTGA AGGAATGGCA ACCGTTGTAC
GAGGGGAGGG AGTGA
 
Protein sequence
MGELLMTRRA FAKVMAVTAA AAGFTGAQSA LADTEPAASS GEVKRIRSAC RGCGKMECGV 
WVTVQDGRVV KTEGDESAFQ SAGNHCAKGQ ASLQAAYHPD RLMYPLKRTN PKGQEPGWVR
ISWDEAYRST VEAIHKNQEK YGNETCFFMG GTSRIWAMGP YGALKQCFGS PNGIQANEIC
KGPRFYATKL NDSNAYSWME VVGRPRVYVQ WGGASELSNY DDSCRTTVDV ATRADKHILV
DPRQTNLGKE ADIWVNLRPG TDGAVANCWA NVIIENELYD DLYVRKWMNA PMLVVEDESF
EPTPSSSSAQ TAKVRTRLLK ESDLVEGGAD TRFMVLNEIT NELSWYDAGG DNPGWEGEDW
VPATEGKEAH QPGLDLTGQT QGFVLDYVPF PDGLLPALHT PEGGFEVELK DGSSVHVRTV
WERYIEFLED YTPEKVSEIS GVDVEVLKEA AITYATRVDP STGYGNGGIQ YMLALEHACN
STQNNRACDL LAGITGNMDT PGGMRGSTPG WPVYDLGMCV PDSGKTTEYT IEKILGKERF
PMIGSDCNPS WADATSVYDA IESGEPYNVT CGIGQTGDFM NQSNSLFAAE QLQKLDFWCS
IDLWHTPCVD MMADIAMPAA HWLELDCIRK SQGSSGAFGA TVKAVEPPGE AKNDLEIVVG
LYKAAGVPYF DEEYHGAAWL EGDEAVDACN NVALKSFRIP DWNDYKKEFQ EKGWFDSKVE
KPDDWGVYRR YQTGNGHING GFPPNPNQHQ GWNTTTHKQE IWSTVLESWL PGEGEEFPKF
VEAPHGPVAD PDLFTDDNSF LMTTGRRQGT YFHSEHRQLP WCRELWPVPR LEMNPVDAER
LGLEQGDWVW IETDQHKIRE VVDLYYGIAP GVVNAEHQWW YPELNQPDHG FKLSGVNCLI
DRHAQDRIIG SSNLRAYGVK VYKATPENSP FGNPVPCGDD GTPIIHTCDD PRLKEWQPLY
EGRE