Gene Elen_1237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1237 
Symbol 
ID8415528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1481319 
End bp1484216 
Gene Length2898 bp 
Protein Length965 aa 
Translation table11 
GC content73% 
IMG OID645024200 
ProductFAD dependent oxidoreductase 
Protein accessionYP_003181596 
Protein GI257790990 
COG category[R] General function prediction only 
COG ID[COG0579] Predicted dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.18893 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.393596 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACCG AACCTCTGCA GAGCATCGAC GTTGCGATCG TGGGAGCCGG CGTTGCCGGC 
GCGACGACGG CGCGTGCATT GGCGCGCTGG CGTCTGAACG TCGTGGTGCT TGAGGCAGGC
AACGATGTGG CCTGCGGCGC GACGCGAGCG AACTCTGGCA TCGTGCATGC CGGCTACGAC
CCTTTGCCTG GAACGCTCAA GGCTCGCTTC AACGCGGCCG GGTCCAAGCT GTTTCCGCAA
TGGGCCGACG AGCTGGGATT CTCCTACGTC CGCAACGGCT CGCTCGTGCT CGCGTTCTCC
GATGAGGAGC TGGCCAGCAT ACGGCGCCTC GTGGCGCGCG CGGCGGAGAA CGGCGTGGAA
GGCGTGCGCG AGCTGGACGC CGCCGAAGTG CGCGCGCTCG AACCGCATGC GAGCCCGCAC
GTGCGCGGCG GCCTGCTGGC CGAGACGGGC GCCATTTGCG ACCCGTACGA GGTTGCCCTG
TTCTCGGCAG AGCAGGCGGC GCTGCACGGC ACGGCGTTCC GCTTCAACGA GCGCGTCGTG
TCCGTCGAGC GCCTGGCCGC GGGCTCGCCC TCGTCCGCGC GCTATCTGCT GTCCACCTCG
ACAGGCGCGC GGTACGCGGC GCGCGCGGTG GTGAACGCCG CCGGCGTGTT CGCCGACGAG
CTGAACAATG CCGTGAGCGC GCATCGCCTG CGCATCGCGG CGCGGCGCGG CGAGTACTGC
CTGTACGATT CCGAGTACGG CCCGCTGTTC TCGCATACCG TGTTTCAGGC GCCGTCGTCA
GCGGGCAAGG GCGTGCTCGT GACGCCCACC GTGCACGGCA ACCTGCTGGT GGGGCCGAAC
GCCGTGGAGC AGGCGAGCAA GACCGACCTG TCCACGAGCG CGGAGGGGCT GCGCTTCGTG
CTGGACGCCG CGAAGAAGAC GTGGCCCGAC GCCGGCGCGC GCGGCATGAT CGCGAACTTC
GCAGGGCTGC GCGCCTCGAA CGCCGACGGC GACGACTTCG TCATCGGCGA ACCGGACGAC
GCGCCCGGGT TCTTCAACAT CGCCTGCTTC GACTCGCCGG GGCTCACCTC GGCTCCGGCC
GTAGCCGAGC ACGTGGCGCA GGCGGTGGCG GAACAGCTGG GCGCGGAGGC GAACGGGGAA
TTCCAGGCGA GTCGCGAGCG CTGCAAGCCG TTCGCCGAGC GCGACGAGGC CGAGCGTGCG
CGCGCCATCG AGGCCGACCC GCGGTGGGGG CACATCGTGT GCCGCTGCTG CGAGGTGACC
GAGGCCGAGA TCGTGGCCGC GCTGCACGCT CCGCTGCCCG TGCTGTCGCT CGACGCGCTG
AAGTGGCGCA CGCGCGCGAT GATGGGGCGC TGCCACGGCG GGTTCTGCTC GCCGGAGATC
GCGCGCATCG TGGCGCGCGA GACGGGCGTG GCGCCCGACG TGCTGGACAA GCGCCTGCCG
GGATCGCCCG TGGTGGCCGC TTCGCGCCCC GATTACGCGG AGCTGGCGCG CAAGGGCGAG
CGGTCGGAGG CGCAGGACGC GGAGCGCGAG CGCGCGCATG TCTACGACGT GGCGGTGGCG
GGCGGCGGGG CAGCCGGCAT CGCCGCAGCC CAGGCGGCCG CGCAGCAGGG CGCGCGCGTG
CTTTTGCTCG ACCGCGAGGA GAAGCTGGGC GGCATCCTCA AGCAGTGCGT GCACAACGGG
TTCGGGCTGC ACCGCTTCGG CGTGGAGCTG ACGGGTCCCG AGTACGCGCA GCGCGAGATC
GACGCGCTTG CGGACGCGGG CTCGGTGGAC GTGCTGGCGG GTGCCAGCGT GACGTCCGTC
GATCCGGGGC GCCCGGACGA CGGAGCGCCG CTCACGGTGC ACGCGGTGGA CGCGCGCGGC
GCGCATGCCA TCGCGGCGCG CGCCGTGGTG CTGGCCACCG GCTCGCGCGA GCGCGGGCTG
GGCGCGCTCA ACCTGGCGGG CTCGCGTCCG TCGGGCGTGT TCTCGGCGGG CAGCGCGCAG
AACTTCATGA ACCTCCAAGG GTGCCTGCCC GGACGTCGCG CGGTGATCCT CGGGTCGGGA
GACATCGGGC TGATCATGGC GCGCCGTTTG GCGTCGCAGG GGGCCGAGGT GGTGGGCGTG
CACGAGCTGA TGCCGCATCC GTCCGGTCTG CGTCGCAACG TGGTGCAGTG CCTGGACGAC
TTCGGCATCC CGCTGCACCT CAGCTCCACG GTGACGCGGC TGGAAGGGGA GGGTCGCCTG
AGCGCGGTGT ACGTGTCGCA GGTGGATCCC GAGACGATCC AGGTGATCCC CGGCACCGAG
CAGCGCATCG CGTGCGACAC GCTGCTGCTG TCGGTGGGCC TCGTGCCCGA GAACGAGGTG
GCGAAGTCGG CCGGGGTGGG GCTCGATCCC GTCACCGGCG GAGCGCGGGT GGACAACCGT
TTGGCTACCG ACGTTCCCGG CGTGTTCGCC TGCGGCAACG CGCTGCACGT GCACGATCTG
GTGGACCATG CGTCGCAAGA GGGCGAGCGC GCGGGCGCCG CCGCGGCCGC TTATGCGCGG
CAGGCGGCCT CGGGCGCCTC GGCCGCGCGC GATGCGCATG TCGCCGTTCC CGTGATGGCG
GGCGAGGACG TGCGCTACGT GGTGCCGCAG AGCATCGACG CCGCCACGCC GCCCGACGAG
AAGCTCATGC TGTCGCTGCG CGTCGCGCGC ACGGTGAACG AGCCGCGCTT CGTGGTGGAG
GGGATCGACG AAGCCGGCCG GGTGCGCGAG CTGAAGCGCG CGAAGACGAT GATCGCCGTG
CCCGCCGAGA TGGTGCTCGT CGTCCTGCCC GCGGGCGCCG CGGCGGGGTG CTCGGCCGTG
CGCGTGCGCG TCGAGGGCCG CGACGAGGCC GCGCGCGTGG CCGACGAGAC CGGCATGGCC
GGAGGAGGCG CCGACTGA
 
Protein sequence
MDTEPLQSID VAIVGAGVAG ATTARALARW RLNVVVLEAG NDVACGATRA NSGIVHAGYD 
PLPGTLKARF NAAGSKLFPQ WADELGFSYV RNGSLVLAFS DEELASIRRL VARAAENGVE
GVRELDAAEV RALEPHASPH VRGGLLAETG AICDPYEVAL FSAEQAALHG TAFRFNERVV
SVERLAAGSP SSARYLLSTS TGARYAARAV VNAAGVFADE LNNAVSAHRL RIAARRGEYC
LYDSEYGPLF SHTVFQAPSS AGKGVLVTPT VHGNLLVGPN AVEQASKTDL STSAEGLRFV
LDAAKKTWPD AGARGMIANF AGLRASNADG DDFVIGEPDD APGFFNIACF DSPGLTSAPA
VAEHVAQAVA EQLGAEANGE FQASRERCKP FAERDEAERA RAIEADPRWG HIVCRCCEVT
EAEIVAALHA PLPVLSLDAL KWRTRAMMGR CHGGFCSPEI ARIVARETGV APDVLDKRLP
GSPVVAASRP DYAELARKGE RSEAQDAERE RAHVYDVAVA GGGAAGIAAA QAAAQQGARV
LLLDREEKLG GILKQCVHNG FGLHRFGVEL TGPEYAQREI DALADAGSVD VLAGASVTSV
DPGRPDDGAP LTVHAVDARG AHAIAARAVV LATGSRERGL GALNLAGSRP SGVFSAGSAQ
NFMNLQGCLP GRRAVILGSG DIGLIMARRL ASQGAEVVGV HELMPHPSGL RRNVVQCLDD
FGIPLHLSST VTRLEGEGRL SAVYVSQVDP ETIQVIPGTE QRIACDTLLL SVGLVPENEV
AKSAGVGLDP VTGGARVDNR LATDVPGVFA CGNALHVHDL VDHASQEGER AGAAAAAYAR
QAASGASAAR DAHVAVPVMA GEDVRYVVPQ SIDAATPPDE KLMLSLRVAR TVNEPRFVVE
GIDEAGRVRE LKRAKTMIAV PAEMVLVVLP AGAAAGCSAV RVRVEGRDEA ARVADETGMA
GGGAD