Gene Elen_1190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1190 
Symbol 
ID8415481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1426167 
End bp1429031 
Gene Length2865 bp 
Protein Length954 aa 
Translation table11 
GC content67% 
IMG OID645024153 
Productmolybdopterin oxidoreductase 
Protein accessionYP_003181549 
Protein GI257790943 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.537056 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTGC TCGACAAAAG CCTGAAGCGA CGTGATTTCC TGAAGGGGAC GGCGGCGGCC 
GCTGCGGCGA CGGCGGCGTT CGGGCTGTCC GGCTGCTCGA CGGGCTCTGC GCAGGCCGCC
GGGTCCGACG GCGCGCACGT CGTGGCATCC GACGCCAGCA TCCTGGCCGA CGCGGGCGAG
TGGATGCCCA TCCACTGCCA CCAGAACTGC AATCAGATGT GCCTGAACAT GGGCTATGTG
GTGGACGGCG TGGTCGTCCG CCAGAAGACC GACGACGCGC GCGAGGACAG CTTCGACTGC
CCGCAGCAGC GCGGCTGCCT GCGCGGGCGG TCCCTGCGCC AGCAGGTGTA CAACGCCGAC
CGCATCAAGT ACCCCATGAA GCGCAAGAGC TGGCAGCCGG GCGGCGCCGA GAACGCCCAC
GGCGAGCTGC GCGGCAAGGA CGAGTGGGAG CGCATCAGCT GGGAAGAGGC GCTCGACTAC
ATCGCGGACG AGCTGAAGCG CGTCTACGCC GAATACGGCC AGGACGCCGT CATCTGCAAC
GGCTGGCGCT GGGCGCCCGG CTCGGCCATG TTCCCCGTCA TCGGCGGCGC GGTATACGAC
ACCGAGACCG AGTCGTTCGG CTGCTGGGCA TTCCAGACCG AGGCGCTGGG CCTGTACTCC
TGGGGCGACC ACCCCGATAT CATGATGGGC CCCGACAAGT ACGACCTGCC GAACGCCGAC
ACCATCGTGT TGTACGGCTG CAATCCCGCC TGGGCTCAGC ACTCCAGCAT GTACTGGCTG
AACAACGCAA AGGAGTCCGG CACCGGGTTC GTGTACGTGG GCCCCAGCTA CAACGTGACC
GCCGCGCAGC TGGGCGCGCG CTGGATCCGC GTGCGGCCGG GCACCGACAC GGCGTTCTTG
CTGGCCGTGA TCTACGAGAT GATCCGTCTG GACGGGGAGC GCGGCGACAT CATCGATTGG
GACTTCGTGA ACGAGCGCAC GGTGGGCTTC ACGCCCGAGA CCATGCCCGA GGATGCGACG
ACCGACGAGA ACTACCGCGA CTACATCCTG GGCGCCTACG ACGGCACGCC GAAAACCCCG
GAGTGGGCGA GCGAGATCTG CGGCACGCCG GTCGAGGACA TCACGTGGTA CGCGGAGCTC
GCCGCCAAGG ACAACAAGGT GATCTTCTTC CACAGCTACG CGGCCTCCCG CTACCTGGGC
GCCGAGAACC TGCCGCAGGC GTTCATGACC GTGTCGGCGC TGGGCGGCCA TTACGGCAAG
TCGGGCCACG GCTCGGCCGC CATCTACACG TGGGACGCCG GCGACTCGGG CTACCGACTC
ATCCAGCACG CGGGCGGCGA GTACGCCTAC ATCGACAACT TGGTAGGCTC GCCGGGCGCT
ACCGGTCCGA ACCGCTGCAT CGAAGGCAAC TCGTGGTGGA GCTCGCTGGC CGAGGGGAAG
TACTTGTCCA CGTCCGAGGG CCCCTACGAC CTGGGGTCGG GCGACGATCC GACCAAGCTG
CGCGCGAACA CGCCCACCTA CCATGCGGCG CGCGAGATGC CGGTGAACCC GCGCCTCATG
TTCGCCACGT GCAGCAACTT CATGCAGACG CGCGGCAACC TGCCCACCGC CATCGAGGTG
ATGCGCGCCG CCGACACCTG CATCTCGCTG GAGATCAAGT ACTCACTGAC CGCCTCGTTC
GCCGACATCA TCCTGCCGGT GGCCACCCAC TGGGAGGGCA ACGACGACGA GAGCTGGGGC
GAGCTCTGCT GGCCGAGCCC CTTCGGCGAC GGCAACGGGC AGAAGCAGCG CAAGGACGCG
CTTCTGGCTT GGCGCCCGTT GGTGAAGCCG ATGTACGAGG CGCGCGAGGA GAAGCGCATC
TGCCGCGACA TCGTCGAGCG CATGGGCTTC GACGCGGACG ACGCCTATCC CAAGAGCAAC
TACGACCAGT GGCTGGGCTA CTTCCTGGGG ATGCGCGAGC TGTCCGAAGA CCTCTCCCGT
TGGGAGCCGG TGATCACCTG GACGGCCGAT GACAACCGGA AGCACCATGC GAACTACCCG
GAACAGCAGG GCAAGATCTC GTTCGACCAG TTCATGGCCG ACGGCTCGTA CGTGGTGCGC
CGTTCGCCCG ACGACAGGCG CAACTACATC GGCTACCGCG ACGACAAGCT GCGCATCGGT
GAGAACGGCG AGGTAGTCGT GGCCGACACC GCGTGGCCGC GCCCGTCGCG CTCGGGCAAG
CTGGAGATAT ACTGCCAGTT CAAAGCCGAC AACGTGAACC GCACGGGACT CAATCCCGAG
CCCATCAAGC CCTACGCGAA CTACTTCGTG CCCAATCGCG GCTACCAGGA CACGTTCGCC
GACTGGGACG CCAAGGTGAA GGGCGCCTAT CCTCTGCAGG CGTACACGCC GCATTACATG
CGCCGCGCGC ACACCTGCTA CGACAACATG ACGTGGACGC AGGAGGCGTT CAGGAACCCG
GTGTTCATGA ACGCGCAGGA TGCTGAGGAG CGCGGCATCG AGGCGGGCGA CACGGTGGTG
TGCTACAACG ACTTCGGCCG CATGCTGCGC ATCGCCCAGC CGCTGCAGGG GATGATGCCG
GGCACCGTCG GCATCCCGCA CGGCGTGCGC TCGCTGTTCG ACGAGAGCGA CCCCGCGGGC
ATCGTCGATC GCGGCGGATC CGAGCAGATG CTGTCCGACG GGCAGCAGTC GAACTACTTC
CCCCAGGTGG ACGGGTACAA CAGCCTGCTC ATCGAGATCG AGAAGTACGA CGGCGAGGCG
CTGGTGGAAG ACTGCGACCG CGGCCCGTTC CTGGCTGCCG GCATCGACGC CGAGGGAACG
CCCGCCTACG TCGCCGAAGG CATGTACGAA GGCAAGGAGG CTTAA
 
Protein sequence
MSLLDKSLKR RDFLKGTAAA AAATAAFGLS GCSTGSAQAA GSDGAHVVAS DASILADAGE 
WMPIHCHQNC NQMCLNMGYV VDGVVVRQKT DDAREDSFDC PQQRGCLRGR SLRQQVYNAD
RIKYPMKRKS WQPGGAENAH GELRGKDEWE RISWEEALDY IADELKRVYA EYGQDAVICN
GWRWAPGSAM FPVIGGAVYD TETESFGCWA FQTEALGLYS WGDHPDIMMG PDKYDLPNAD
TIVLYGCNPA WAQHSSMYWL NNAKESGTGF VYVGPSYNVT AAQLGARWIR VRPGTDTAFL
LAVIYEMIRL DGERGDIIDW DFVNERTVGF TPETMPEDAT TDENYRDYIL GAYDGTPKTP
EWASEICGTP VEDITWYAEL AAKDNKVIFF HSYAASRYLG AENLPQAFMT VSALGGHYGK
SGHGSAAIYT WDAGDSGYRL IQHAGGEYAY IDNLVGSPGA TGPNRCIEGN SWWSSLAEGK
YLSTSEGPYD LGSGDDPTKL RANTPTYHAA REMPVNPRLM FATCSNFMQT RGNLPTAIEV
MRAADTCISL EIKYSLTASF ADIILPVATH WEGNDDESWG ELCWPSPFGD GNGQKQRKDA
LLAWRPLVKP MYEAREEKRI CRDIVERMGF DADDAYPKSN YDQWLGYFLG MRELSEDLSR
WEPVITWTAD DNRKHHANYP EQQGKISFDQ FMADGSYVVR RSPDDRRNYI GYRDDKLRIG
ENGEVVVADT AWPRPSRSGK LEIYCQFKAD NVNRTGLNPE PIKPYANYFV PNRGYQDTFA
DWDAKVKGAY PLQAYTPHYM RRAHTCYDNM TWTQEAFRNP VFMNAQDAEE RGIEAGDTVV
CYNDFGRMLR IAQPLQGMMP GTVGIPHGVR SLFDESDPAG IVDRGGSEQM LSDGQQSNYF
PQVDGYNSLL IEIEKYDGEA LVEDCDRGPF LAAGIDAEGT PAYVAEGMYE GKEA