Gene Elen_3031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_3031 
Symbol 
ID8417365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3518954 
End bp3522535 
Gene Length3582 bp 
Protein Length1193 aa 
Translation table11 
GC content65% 
IMG OID645026010 
Productmolydopterin dinucleotide-binding region 
Protein accessionYP_003183363 
Protein GI257792757 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAACC TATCACTGAG TCGTCGAAGC TTCTTGAAGG CCTCTGCCAT GGCTGCCGCG 
GCAACCACGG TGGGCTTCGC AGCAACGCCT TCGACGGCGT TGGCCGAAGG TGAAGACGCC
TCAGCGGGCG AGATCAAGCG CATCCGCTCC TGCTGCCGCG CGTGCGGCAA GGTGGAGTGC
GGCGTGTGGG TGACCGTTCA GGACAACAAG GTCATCAAAG TCGAGGGCGA CGAGTCCAAC
GCTCACAGCC GCGGTCACTG CTGCGCGAAG TCGCAGTCGT CCATGCTGGC TCTGTACCAT
CCCGATCGTC TGCGCTACTG CATGAAGCGC ACGAACCCCA AAGGCGAGGA CGACCCCGGT
TGGGTGCGCA TCACGCTTGC CGAGGCGTTC GACGAGGCGG GTGCCAAGTT CAACGAGATC
GTGGAGAAGT ACGGCGGGGA AGCCAACTTC GCCATGGGCG GCACCTCGCG TGTGTGGGCG
CAGCCGCCGT ACGGCACGCT GAAGTCCATC TTCCCCACGC CGAACGCGCA CCTGGCCTAC
GAGATCTGCA AGGGCCCGCG CCACTTCGGC GGCATCCTCA CCGACGAGAT CGGCTCGCCG
TGGATGGAGG TGGAGCAGGG CCCTCTGGTG TACGTGCAGT GGGGCACCGC GTCGGAGTAC
TCGAACTACG ACTCCACGAA CCGTACGGTG GTGGACTGCT CGCAGCGCGC CTACAAGCAC
ATCCTGGTGG ATCCCCGCAT GACGCCGCTG GGCAAGGAAG CCGACGTGTG GCTGCCGCTG
CGCGTGGGCA CCGACCTGTG CCTGTCGCTG GGTTGGCTCA AGTGGATCCT CGACAACGAG
GCGTACGACG ACGCGTTCGT GCGCCGTTGG ACGAACGCCC CGTTCTTGTG GAACCCCGAG
AAGGACGGCC GCACGGCGAA GGGTTGGTTC ATGGAAATGA ACGGCGGCAT CGACATGGAG
AGCCGCATCC TGACCGAGGC CGACTGCGAC CCCGAGTGGA TCGGCCAGTA CTGGGACTAC
GAGGGCCGCT ACCAGCGCTT CATCTGCTGG GACGAGAACA ACAACAAGCC TACGTACTGG
GATGCCGAGG CCTGCCAGTG GGAGGGCGAG AAGCACAAGA TCCCGACGAC GGGCACCTGG
ATCGAGCATC CGTACAAGCC CATCATCGCG GACGCCTGGC TGCCCGACCC CTCCAAGTTC
GCCAATCCGG CCGACGTGCA GTACGACGCC TACTGGGACG AGGGCAACGA GGGCGGCAAG
CGCTCGAACC CGGCCGGGCT GCCCAAGAAC CCGGCGCTGT TCCCCGGCGG CGTGGAGGTG
AAGCTGAAGG ACGGCTCCAC CATCAGCGCC GACACGGTGT GGGAGGCGTT CTCCGACAGC
CTGGAGCAGT ACACGCTGGA GTACGTGTCC GAGGTCACCG AGGTGCCGGT CGACAAGATC
GAGGAGGGCG TGCGCATCTA CACGACGCGC CTGAACCCGC TGCACGGCAA CGGCGGCATC
CACTACCAGC TTGCTCCCGA CCAGACGGGC CACGCGGTGC AGAACACCCG CGCGCTGCAG
CTGATCGCCT GCATCACCGG CAACTCCGAC GAGCCCGCCG GCAACCGCGG TTCGTCGAAG
GCCCAGGTGG ACGGCTGCTG CGGCCGCGCC AACATGCTGG TGACCGACCA TGAGCCGAAG
GACTGGGGCT TGGACGTGGG CACCATGGAG CTGGGCAACA CCCCGCGCGA CCTGTCCGTG
GAAGACCAGA TCCCGCTGAT CCAGAACTTC GTGCAGTACC TCATCGACGA GAAATCGCCC
TTGGCCGAGC GCTACGGCAA CAAGGTTCCC ACCGCCGAAG AGGCGCGCTG GATCGCCGAG
CGTAAGGGCG GCGCCTACAA GCCCAGCCGC TCGTGGCCGG CGCTCAAGAC GTCGTTCGAC
AGCAACGAGA AGCAGATTTC CGCCGAGCGC TTCCCGCTGC TGCGCTACTG GAACCGATGG
GCCGACTCCG CTGCCATCTG GGACTCCATC AACGGCATCG ACACGCCGTA CCAGATCCAC
GGCGGCGTGT GCATGTCGGG CGACTTCATG AACGAGTCCA ACCTGCTGGA GGCTTGGGAG
GCGCTCACCC GTCTGGACTT CTGGCTGGAC TTCAACCTGT GGTCCTGCCC GAACAACGGC
TGCGCCGACA TCGTGATCCC CGTGCTGCAC TGGCTTGAAG TGAACACGGG CCGCGTGTCG
CAGGGCGCGG GCGGCATCTT CGGCGCCGGG CAGCGCTGCG TGGAGCCGAT GGGCGATTGC
ATCTACGACC CGGTTGCCGT CATCTGCCTG TACAAGGCCA TGGGCGTGGT GTGGAACAAC
CGCGACCCCG AATACGACGA ATGGAACAAC CTGGACTACC GCGACTTCGT GCAGATGGGC
GGTAGCGTCG GTTACGAGGA GCAGGAGTAC CGCGTGCTCA AGGACGCCAC CGACTGGTGG
AAGACCGAGG AGTTCCCCGA CGGCCCCGAC TTCCCGCAGT ACGCGGCGAA GTTCCAGGAA
GAAGGCTGGT TCGACTGCCG CAAGTGGCAC CCCGAGCGTT GGGGCACGTA CCGCCGTTGG
GAGATGGGCT ACCGTCGCCA GCAGGGTGGC TACAACCTGT ACGCGGCCAT CGACGAGAAG
TGCGGCTTCA TGACCCCCAC CGCCAAGGTG GAGGTGTGGT CCACCATCGC CGAAACGTAC
ATCCCCGACG GCGCGGCCAC CTTCGCCAGC ACGAACACGG TCGATCCGAA CATCCCCGAC
ATCGACAAGT TCCCGCACTG GGTGGAGCCG AAGAACTCGC GCGTATCGAA TCCCGAGTAC
TTCGACGCCT CGCTGGTCGA CCAGATCAAG ACGTCGGACG CCTACATCAA CGACAACTAC
CAGGGCGATC ATCTGGTGGA GGAGTACAAG GAGGCGCTGA CGGCGCATCC GGACAGCGCG
TTCATCATGA CGACGGGCTC CCGCCAGCCG GTGTACTTCC ACTCCGAGCA TCGCCAGCTG
CCGTGGTGCC GCGAGCTGTG GCCCAGCCCT CGTTTGGAGA TGAACCCCAA CGACGCGGCG
CGCCTGGGGT TGGAGCAGGG GCAGTGGATT TGGATCCGCA GCCCGTGGGG CGCCATCCGC
GAGGTCGTGG ACTTGTACTA CGGCATCAAG GAGGGCACGG TCAACGCGAA CCACGCCTGG
TGGTATCCCG AGATCGACAC CGCTTCGCAC GGCTTCGAGC TGGTGAACAT CAACTGCACG
ATGGACAAGT ACGCGCAGTG TTGGATTTGC GGCGCGTCCC AGCTGCGCGG CGTGCCGGTG
CTCGTGTACC CCGCGACGCC CGAGAACTCG CCGCATGGCA ACCCGGTTCC GTGCGACCCG
CAAGGCAATC CGGTCATCAC GAACGCAAAC GATCCGCGCC TCAAGGAATG GTTGGCGAAC
GATCCGCGCC TGGAGGATTC CAAGGTGGAG CTCACGTTCG CGAACATGGC GGCGGTCGGC
TGCCAGCCGA GCGTCCAAAG CCCCGACCTG CTCTCCGGCG GCAAGCTGGC CGTCGGCAGC
GTGGGCGGCG ATGCGCTGGG CGCCTATTCG AAATCGATGT AA
 
Protein sequence
MANLSLSRRS FLKASAMAAA ATTVGFAATP STALAEGEDA SAGEIKRIRS CCRACGKVEC 
GVWVTVQDNK VIKVEGDESN AHSRGHCCAK SQSSMLALYH PDRLRYCMKR TNPKGEDDPG
WVRITLAEAF DEAGAKFNEI VEKYGGEANF AMGGTSRVWA QPPYGTLKSI FPTPNAHLAY
EICKGPRHFG GILTDEIGSP WMEVEQGPLV YVQWGTASEY SNYDSTNRTV VDCSQRAYKH
ILVDPRMTPL GKEADVWLPL RVGTDLCLSL GWLKWILDNE AYDDAFVRRW TNAPFLWNPE
KDGRTAKGWF MEMNGGIDME SRILTEADCD PEWIGQYWDY EGRYQRFICW DENNNKPTYW
DAEACQWEGE KHKIPTTGTW IEHPYKPIIA DAWLPDPSKF ANPADVQYDA YWDEGNEGGK
RSNPAGLPKN PALFPGGVEV KLKDGSTISA DTVWEAFSDS LEQYTLEYVS EVTEVPVDKI
EEGVRIYTTR LNPLHGNGGI HYQLAPDQTG HAVQNTRALQ LIACITGNSD EPAGNRGSSK
AQVDGCCGRA NMLVTDHEPK DWGLDVGTME LGNTPRDLSV EDQIPLIQNF VQYLIDEKSP
LAERYGNKVP TAEEARWIAE RKGGAYKPSR SWPALKTSFD SNEKQISAER FPLLRYWNRW
ADSAAIWDSI NGIDTPYQIH GGVCMSGDFM NESNLLEAWE ALTRLDFWLD FNLWSCPNNG
CADIVIPVLH WLEVNTGRVS QGAGGIFGAG QRCVEPMGDC IYDPVAVICL YKAMGVVWNN
RDPEYDEWNN LDYRDFVQMG GSVGYEEQEY RVLKDATDWW KTEEFPDGPD FPQYAAKFQE
EGWFDCRKWH PERWGTYRRW EMGYRRQQGG YNLYAAIDEK CGFMTPTAKV EVWSTIAETY
IPDGAATFAS TNTVDPNIPD IDKFPHWVEP KNSRVSNPEY FDASLVDQIK TSDAYINDNY
QGDHLVEEYK EALTAHPDSA FIMTTGSRQP VYFHSEHRQL PWCRELWPSP RLEMNPNDAA
RLGLEQGQWI WIRSPWGAIR EVVDLYYGIK EGTVNANHAW WYPEIDTASH GFELVNINCT
MDKYAQCWIC GASQLRGVPV LVYPATPENS PHGNPVPCDP QGNPVITNAN DPRLKEWLAN
DPRLEDSKVE LTFANMAAVG CQPSVQSPDL LSGGKLAVGS VGGDALGAYS KSM