Gene Elen_0471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0471 
Symbol 
ID8414755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp602045 
End bp605098 
Gene Length3054 bp 
Protein Length1017 aa 
Translation table11 
GC content66% 
IMG OID645023442 
Productmolydopterin dinucleotide-binding region 
Protein accessionYP_003180845 
Protein GI257790239 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAACC TGACCATGTC ACGACGCACG TTCGTCAAGA CCGCCGCCAT CACCGGTGCG 
GCAGCCGCGG CGTTCGGCGC TTCGACGCAC ACCGCGCTGG CCGAAGAGAC GTACAGCAGC
GTGTCCGGCA ACGACACCGT GGCCGTGAAA ACGTGCTGCC GCGGCTGCGG CAAGATGGAG
TGCGGCGTCA AGGTCATCGT CCAGAACGGG CGGGCCATAC GCGTCGAGGG CGACGAGGGA
GCGTTCCAGT CCATGGGCAA CTGCTGCACG AAGTCGCAGT CGTCCATCCA GGCGGCGTAC
CACCCCGACC GTCTGCACTA TCCCATGAAG CGCACGAACC CGAAGGGCGA GGAGCCCGGC
TGGCAGCGCA TCTCGTGGGA CGAGGCCATG CAGTCCATCG TGGACAACTT CATGGACATC
AAGGCGAAGC ACGGCGGCGA GGCCATCGCC TGCCAGGTGG GCACGTCGCG CATCTGGTGC
ATGCACTCGG AGTCCATCCT GAAGAACATG CTGGAAACGC CGAACAACGT TGAGGCCTGG
CAGATCTGCA AGGGCCCGCG CCACTTCGCC ACCACCATGG TGTCGCAGTT CGCCATGTCG
TGGATGGAAA CCATCACGCG CCCGAAAGTG TACGTGCAGT GGGGCGGCGC GTCCGAGCTG
TCCAACTACG ACGACTCCTG CCGCACCACG GTGGACGTGG CATCGCGCGC CGACGTGCAC
ATCAGCGTCG ACCCCCGCAT GGCCAACATG GGCAAGGAAG CCGACTACTG GCAGCACCTG
CGCCCCGGCA CCGACGGCGC CCTGGCCCTG GCCTGGACGA ACGTCATCAT CGAGAAGAAG
CTCTACGACG AGCTGTACGT GAAGAAGTGG ACGAACGCCC CGTTCCTCGT GTGCGAGGAC
ATGGAGCCCT CCGGCTTCCC CACCGTGCGC ACCGACGGTT CCTACTGGGA CGTGAAAACC
GCGCTGCTCA AGGAAAGCGA CATCAAGGAG GGCGGCAGCC CCTATAAGTT CCTCGTGTAC
GACAACAACT GGGAGAAGCT GAAGGCCGAG GGCGTCGAGC ACGAGTACGG CGCGTTCACC
TGGTTCAACG CCGACCAGGA AGGCGTCATC GACGAGACCG GCGGCTTCTG GGAGGGCGAG
AACTACGACT CCGAGAAGGC GCGCCAAGGC CGCGAGGCCG CGCAGGACAA CCTGCTGCCG
GGCCAGACGC AGGGCTGGCT GCCCGATCCC ATGCCGTTCG ACCCCGCCAT CGACCCCGCG
CTGGAAGGCG AGTTCGAGAT CACGCTGAAG GACGGCAAGA CCGTGAAGGT CAAGCCGGTG
TGGGAGCACT ACAAGGCCCG CGCCGCCGAG TACAAGCCCG AGGTGGCGGC CGAGATCACC
GGCATCCCCG CCTCCGAGAT CGAGGCGGCG GCCACGGCCT ACGGCACGCG CATCGACCCG
TCCACCGGCT ACGGCAACGG CGGCATCCAG TACATGCTGG CGGTCGAGCA CTTCTGCAGC
GCCATCCAGA ACTGCAGCGC CTTCGACAAC CTCGTGGGCA TCACCGGCAA CATGGACACC
CCGGGCGGCA ACCGCGGCCC GACCATCGTC CCCATCGACG GCGACCTCCA GGGCTTCAGC
GCCTGGGCCC CCGGCGCCAC CACCCCGCCG GAGGAAGTCA ACCGCAAGCA GATCGGCATC
GACAAGTTCC CGCTTCTGGG CTGGTGGCAG TACTGGTGCG ACAGCCATTC GCTGTGGGAC
GCCGTCATCA CGGGCGACCC CTACCCGGTG CGCGCCCTCT GGAACGAGTC CGGCAACTTC
ATGAGCCAGA CGAACACCAC GCGCGCCTGG GAGGCCCTGT GCTCGCTTGA CTTCTACGTG
GACCTCAACC TGTGGCACAC GCCGCAGAAC GACACCGCCG ACATCATCCT GCCGGTGGCC
CATTGGATCG AGCTCAACTC GCCGCGCGCC AGCCAAGGTT CCGCCGGCGC CATGGGCGCC
ACGGTCAAGT GCGTGCAGCC GCCCGCGGAA GCCAAGTACG ATCCCGAGAT CGTCATGGAC
CTCGCCCGCC GCATGAACTG GAAGTGGACC GACGAGCCGG GCAACGAGTG GCCCGACATC
AACTGGCAGC TGGACGACTC CATCAAGCTG CTCACCGACG ACGAGCTGAC CTACACCACG
TGGCACGTCG AGAACGGCAA GCCCACGTTC GAGCGCCACG GCGTTCCGAT GGCCGAGGTC
ACGCCCAAGT ACAAGACGTG GGACGAGTAC GTCAAGGCCT TCCAGGAGCA CGGCTGGTGG
CAGGCGAAGG ACATCGAGCC GCGCAACTGG GGCACGTACC GCCGCTACCA GACCGGCGCG
ATGCGCGCAC GCGACCGCGT GTGGGGCCGC CTCGACTACA CGGCCGGCAA GGGCATCGGC
GACTGGAAGC CGGGCTGGTT CACCCCGACG ATGAAGCAGG AGATCTGGTC CACCGTCATG
GAATCGCACC ATCCCGACCA TCCCGAGTGG AGGCTTCCCA CCTACACCGA GCCGCCTCAC
GGCCCGAAGG ACGGCGACCG CATCAAGGAG TACCCGCTGA CCGCCACCAC CGGCCGTCGC
ATCCCGGTGT ACTTCCACTC CGAGCATCGT CAGCTGCCCT GGTGCCGCGA GCTGTGGCCC
GTGCCGCGCG TGGAGATCAA CCCGAAGACG GCCGCCGAGT ACGGCATCGA GCAGGGCGAC
TGGGTGTGGA TCGAAACCGA ATGGGGCAAG ATCCGCGAAG TGGCCGACCT GTACTACGGC
GTGAAGGAAG ACGTCATCAA CCTCGAGCAC ACGTGGTGGT ACCCCGAGGT GAAGGACGCC
GGCCACGGCT GGCAGTTCTC CCAGGTGAAC CAGCTGATCG ACCACTACGC CCAGGATCCG
CACTCCGGCA CATCCAACCT GCGCGCCTAC CAGGTGAAGA TCTACAAGGC CACGCCCGAG
AACTCGCCGT TCAACAACCC CGTGCCGTGC GACTCCACCG GCACGCCCAT CATCCATACC
TCCGACGACC CCCGTCTGAA GGAATGGCTG CCTACCTACG AAGGGAGGGA GTAA
 
Protein sequence
MGNLTMSRRT FVKTAAITGA AAAAFGASTH TALAEETYSS VSGNDTVAVK TCCRGCGKME 
CGVKVIVQNG RAIRVEGDEG AFQSMGNCCT KSQSSIQAAY HPDRLHYPMK RTNPKGEEPG
WQRISWDEAM QSIVDNFMDI KAKHGGEAIA CQVGTSRIWC MHSESILKNM LETPNNVEAW
QICKGPRHFA TTMVSQFAMS WMETITRPKV YVQWGGASEL SNYDDSCRTT VDVASRADVH
ISVDPRMANM GKEADYWQHL RPGTDGALAL AWTNVIIEKK LYDELYVKKW TNAPFLVCED
MEPSGFPTVR TDGSYWDVKT ALLKESDIKE GGSPYKFLVY DNNWEKLKAE GVEHEYGAFT
WFNADQEGVI DETGGFWEGE NYDSEKARQG REAAQDNLLP GQTQGWLPDP MPFDPAIDPA
LEGEFEITLK DGKTVKVKPV WEHYKARAAE YKPEVAAEIT GIPASEIEAA ATAYGTRIDP
STGYGNGGIQ YMLAVEHFCS AIQNCSAFDN LVGITGNMDT PGGNRGPTIV PIDGDLQGFS
AWAPGATTPP EEVNRKQIGI DKFPLLGWWQ YWCDSHSLWD AVITGDPYPV RALWNESGNF
MSQTNTTRAW EALCSLDFYV DLNLWHTPQN DTADIILPVA HWIELNSPRA SQGSAGAMGA
TVKCVQPPAE AKYDPEIVMD LARRMNWKWT DEPGNEWPDI NWQLDDSIKL LTDDELTYTT
WHVENGKPTF ERHGVPMAEV TPKYKTWDEY VKAFQEHGWW QAKDIEPRNW GTYRRYQTGA
MRARDRVWGR LDYTAGKGIG DWKPGWFTPT MKQEIWSTVM ESHHPDHPEW RLPTYTEPPH
GPKDGDRIKE YPLTATTGRR IPVYFHSEHR QLPWCRELWP VPRVEINPKT AAEYGIEQGD
WVWIETEWGK IREVADLYYG VKEDVINLEH TWWYPEVKDA GHGWQFSQVN QLIDHYAQDP
HSGTSNLRAY QVKIYKATPE NSPFNNPVPC DSTGTPIIHT SDDPRLKEWL PTYEGRE