Gene Elen_0452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0452 
Symbol 
ID8414736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp575770 
End bp579033 
Gene Length3264 bp 
Protein Length1087 aa 
Translation table11 
GC content68% 
IMG OID645023425 
Productmolydopterin dinucleotide-binding region 
Protein accessionYP_003180828 
Protein GI257790222 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGCGC AGCATAAAGA CGGACGGCGA TGCGATGCCG CAAGCGAGCC TCAGCGTTCG 
AGCTTGTTCG GAACGCCTTC GCGGCGCGCC GTGGTCGGCG CCGGTTGCGC GGTCGCCGCA
GGCGCCTTGG TCGCCGGAGG CGGGCTGGGC GCGTACATGG CCGCTGATGA CCCGCTGACC
GACGAGCCTC ACGGGCGCGG CAACGCCACC GGCGCGTGCG CGGCCGACGA CGTCATCCTG
TCCATGTGCA ACAACTGCAA CAGCTACTGC ACCATCAAGG TGCGCGTGAC CGATGCGGCC
GACGGCAAGC AGGCGAACGA CGGCGCCACC GCGCTGGTGC GCAAGATCGC GGGCAATCCG
TACTCGCCGC TCAACTCGCA GCCCTACGCG CCGATCCCCT ACGCTACGCG GCCCGAGGAA
GCGCTCGCCC CCGGCGACGA CATGGCCGTG GCCGGTCGCG CTTCGAACGG GGGCATGATA
TGCCTCAAGG GCCAGGCGGG CATTCAGCTC GTGCACGACC GGTTCCGCAT CACGCAGCCT
TTGCGCCGCG TGGGGGAGCG CGGGTCGGAC GAGTGGGAGA CGGTCAGCTG GGACACGGCC
CTCGACGAGA TCGTGAACGG GTCGCCCGCG CTGGGCACGC CGGGCATCGC CGAATGGTAC
GCCTACGCGC CGAAGAAGCA GGTGGAGGCC GATGTCGCGC TTGTGGAGTC GGGCGAGATG
ACCAAGGACG CCTTCGCGGC GAAATGGGCC GACAAGCTGA TCGACGTGGA GCATCCCGAC
CTGGGCCCCA AGTCCAACCT GTTCTGCTCG GCCGGCGGCG ACCGCATGTT CCTCATCGGG
GACCGCTTGA CGCAGCTGGG GTTCGGCTCG GTCAACAACT TCAACCACGG CGGCGTGTGC
GGCATGACGG GCGTCATGGC CAACGTGCGC ACGCACCCCA CGACGAACCA TAAGCGCATG
TACGCCGACA TCGACCATTG CGAATGCCTT ATCATCTGGG GCACCGAGCC CATGACGGCG
AACAAGGGTC CGTCGTGGCT GGCTCCGCGC CTGTCGGTGG CGCGCGAGCG CGGCATGAAG
CTGTACGTGG TCGACCCGCG CCAGGGGCGT AGCGCCTCGA AGGCCGACGT GTGGCTGCCC
GTGATTCCCG GCAAGGACGC CGAGCTGGCG TTCGCCATGA TGTCGTGGAT CATCGCGAAC
GAGCGCTACG ATGCGGCCTA CCTGTCGGCG CCTTCGAAGA AGGCAGCCGC GGCGCTGGGC
GAGCCCACCT GGTCCGATGC GACGCACCTC GTGGCCGTCG ACCTGCCGAA CCGGCCCATC
GTCACGGCCA AGGCGCTGGG GCGCGCGGGC GAGGTCGGCG CGAACGGCGA GGCGCTGGCC
GACGACGCGC GCTTCGTGCT GGTGGACGGC GAGCTGACTC TCGCCGATGC CGCCGAGGGC
GCCGCCGATC TGCTAGTGGA CGCAGAGATC GACATCAAGG GAAAGCCTGC GCGCGTGAAG
AGCGTGTTCC AGCTTTTGAA AGAGCGCGTC GAGGAGCGAA CGCTGGAGGA GTACGCCGCC
GATGCGGGCA TCGAGCCCTC CGTGGTGGAA GAGGTGGCGC GCGAGTTCAC GTCGCACGGC
AAACGTGCCT GCGTCATGAG CTACCGCGGC CCGGCTATGC ACGCCAACGG CTTCGATGCG
GTGCGCGCCG TGGGGTATCT CAACTTCCTC ATCGGCAACC ACGATTGGAA GGGCGGTCAC
ATCGCCGCCG CGGCGAAGTT CGCGCCGTTC GAGGGCCGCT ACGACTTGAA GACCGTGCCG
GACGCGCATG CCGGATGGGG CATTCCCATC ACGCGCCAGA AGACCGAGTA CGAGAAGACC
AGCTACTTCA AGCAGGACGG CTATCCGGCG CCGCGTCCGT GGTACCCGCT GCCGGGCAAC
CTGTCGCACG AGATCGTGCC CACCTTGCGC GCGGGCTACG TGTACGATCA TTTGGGCGCG
CTGTTCATCC ATCGCCATTC GCTGGTGGAC TCCACGCCCG GCGGGCGGCG ACTGGCCGAC
GTGCTGGGCG ACCAGGACAA GATCAACCTG CTGGTGTCGT TCGACGTGGA GATAGGCGAC
ACGTCGCGCT TTGCCGACTT CGTGTTACCG GACAAGGTGT ACCTCGAGCG CTTCAGCCAG
GAATCCATCT ATCCGAACCA GCAGTACCAG CTCATCCAGC TGGGGCAGCC GGCGGTGCGC
GCCTTCGACG GCCCGCGCTC GGTGGAGGAC GTGTACTTCG ACATCATGCA ACGCCTCGGG
CTGCCGGGCG TCGGCGAGCA TGCCGTGCCG GTGGGCAAGG ACGGCGACGG CGGCACGGCG
GCCCTTTCGA CCGAGTACGA CTACTGGTTG AAGATGGCCG CGAACATCGC CTACGCCGGC
GAGAAGCCGG TGCCGGACGC CGACGACGAC GAGCTCGCCC TGTTCGAGCG CGCCCGCAAG
CGCGCGCTGG GCGAGGCGTT CGATCTGGAG GCGTGGAAAG CCGCCGTCAC CGAAGAGGAA
TGGCCGAAGG TGGTGTACGT GCTGAACCGC GGCGGGCGGT TCGCGTCGGC CGACCCTGCG
AAGGGCGACG GTTACGACGG AGATCTGATC AAGACGAAGT ACGCCGGCTT GTGCGCGTTC
TACGACCCGA AGACGGCTTC GCTGAAAGAC GCGTTGACCG GTGAGAACTT CGACGGATTG
GCCCATACGG CGCCGATCGC CTTCGCCGAC GGCACGCCGA TGGCGCGGCC GGCCGACCGA
CCGTTCGCGT TCATCAACTG GAAAGCGCGC ACGAACGGCA CGCACCGCAC CATCGCCGCG
TCGTGGCTGC GCGAGACGAC CACCGAGAAC TTCGTATGGA TGTCGCCGTC CGACGCTGCC
GAGCGCGGCT TGAAGAACGG CGATGCCGTG GAGGTCGTGG GGCCTGAAGG CACGCTTTCC
GGTCACGTGC GCGTGACCGA GGGCATTCGT CCCGGCGTGG TGGGGGCCAA CTACTCGTTC
GGCCAGCAGG GCTACGCCGC GCGCGCGGTC ACCATCGACG GCATGTTGAC GGGCCCGGCT
CCCGATTACC TCGAAGAAGA GGGCATCCTC GACGGCGACG AGCCCGGCAA GCAGAAGACC
GGCTTCGCCG GCGGGCGGGG GCGCGGCTTC TGCATGAACG AGCTGCTGCC CGAGGAAACG
CTTGCCGGAG GCGGTGGCGT GACCGATCCT ATTGGAGGCG GCGCCGCCCA GTTCGACCTC
TGGGTGGACG TGCGGAAGGT GTAG
 
Protein sequence
MEAQHKDGRR CDAASEPQRS SLFGTPSRRA VVGAGCAVAA GALVAGGGLG AYMAADDPLT 
DEPHGRGNAT GACAADDVIL SMCNNCNSYC TIKVRVTDAA DGKQANDGAT ALVRKIAGNP
YSPLNSQPYA PIPYATRPEE ALAPGDDMAV AGRASNGGMI CLKGQAGIQL VHDRFRITQP
LRRVGERGSD EWETVSWDTA LDEIVNGSPA LGTPGIAEWY AYAPKKQVEA DVALVESGEM
TKDAFAAKWA DKLIDVEHPD LGPKSNLFCS AGGDRMFLIG DRLTQLGFGS VNNFNHGGVC
GMTGVMANVR THPTTNHKRM YADIDHCECL IIWGTEPMTA NKGPSWLAPR LSVARERGMK
LYVVDPRQGR SASKADVWLP VIPGKDAELA FAMMSWIIAN ERYDAAYLSA PSKKAAAALG
EPTWSDATHL VAVDLPNRPI VTAKALGRAG EVGANGEALA DDARFVLVDG ELTLADAAEG
AADLLVDAEI DIKGKPARVK SVFQLLKERV EERTLEEYAA DAGIEPSVVE EVAREFTSHG
KRACVMSYRG PAMHANGFDA VRAVGYLNFL IGNHDWKGGH IAAAAKFAPF EGRYDLKTVP
DAHAGWGIPI TRQKTEYEKT SYFKQDGYPA PRPWYPLPGN LSHEIVPTLR AGYVYDHLGA
LFIHRHSLVD STPGGRRLAD VLGDQDKINL LVSFDVEIGD TSRFADFVLP DKVYLERFSQ
ESIYPNQQYQ LIQLGQPAVR AFDGPRSVED VYFDIMQRLG LPGVGEHAVP VGKDGDGGTA
ALSTEYDYWL KMAANIAYAG EKPVPDADDD ELALFERARK RALGEAFDLE AWKAAVTEEE
WPKVVYVLNR GGRFASADPA KGDGYDGDLI KTKYAGLCAF YDPKTASLKD ALTGENFDGL
AHTAPIAFAD GTPMARPADR PFAFINWKAR TNGTHRTIAA SWLRETTTEN FVWMSPSDAA
ERGLKNGDAV EVVGPEGTLS GHVRVTEGIR PGVVGANYSF GQQGYAARAV TIDGMLTGPA
PDYLEEEGIL DGDEPGKQKT GFAGGRGRGF CMNELLPEET LAGGGGVTDP IGGGAAQFDL
WVDVRKV