Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1237 |
Symbol | |
ID | 8415528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1481319 |
End bp | 1484216 |
Gene Length | 2898 bp |
Protein Length | 965 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 645024200 |
Product | FAD dependent oxidoreductase |
Protein accession | YP_003181596 |
Protein GI | 257790990 |
COG category | [R] General function prediction only |
COG ID | [COG0579] Predicted dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.18893 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.393596 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATACCG AACCTCTGCA GAGCATCGAC GTTGCGATCG TGGGAGCCGG CGTTGCCGGC GCGACGACGG CGCGTGCATT GGCGCGCTGG CGTCTGAACG TCGTGGTGCT TGAGGCAGGC AACGATGTGG CCTGCGGCGC GACGCGAGCG AACTCTGGCA TCGTGCATGC CGGCTACGAC CCTTTGCCTG GAACGCTCAA GGCTCGCTTC AACGCGGCCG GGTCCAAGCT GTTTCCGCAA TGGGCCGACG AGCTGGGATT CTCCTACGTC CGCAACGGCT CGCTCGTGCT CGCGTTCTCC GATGAGGAGC TGGCCAGCAT ACGGCGCCTC GTGGCGCGCG CGGCGGAGAA CGGCGTGGAA GGCGTGCGCG AGCTGGACGC CGCCGAAGTG CGCGCGCTCG AACCGCATGC GAGCCCGCAC GTGCGCGGCG GCCTGCTGGC CGAGACGGGC GCCATTTGCG ACCCGTACGA GGTTGCCCTG TTCTCGGCAG AGCAGGCGGC GCTGCACGGC ACGGCGTTCC GCTTCAACGA GCGCGTCGTG TCCGTCGAGC GCCTGGCCGC GGGCTCGCCC TCGTCCGCGC GCTATCTGCT GTCCACCTCG ACAGGCGCGC GGTACGCGGC GCGCGCGGTG GTGAACGCCG CCGGCGTGTT CGCCGACGAG CTGAACAATG CCGTGAGCGC GCATCGCCTG CGCATCGCGG CGCGGCGCGG CGAGTACTGC CTGTACGATT CCGAGTACGG CCCGCTGTTC TCGCATACCG TGTTTCAGGC GCCGTCGTCA GCGGGCAAGG GCGTGCTCGT GACGCCCACC GTGCACGGCA ACCTGCTGGT GGGGCCGAAC GCCGTGGAGC AGGCGAGCAA GACCGACCTG TCCACGAGCG CGGAGGGGCT GCGCTTCGTG CTGGACGCCG CGAAGAAGAC GTGGCCCGAC GCCGGCGCGC GCGGCATGAT CGCGAACTTC GCAGGGCTGC GCGCCTCGAA CGCCGACGGC GACGACTTCG TCATCGGCGA ACCGGACGAC GCGCCCGGGT TCTTCAACAT CGCCTGCTTC GACTCGCCGG GGCTCACCTC GGCTCCGGCC GTAGCCGAGC ACGTGGCGCA GGCGGTGGCG GAACAGCTGG GCGCGGAGGC GAACGGGGAA TTCCAGGCGA GTCGCGAGCG CTGCAAGCCG TTCGCCGAGC GCGACGAGGC CGAGCGTGCG CGCGCCATCG AGGCCGACCC GCGGTGGGGG CACATCGTGT GCCGCTGCTG CGAGGTGACC GAGGCCGAGA TCGTGGCCGC GCTGCACGCT CCGCTGCCCG TGCTGTCGCT CGACGCGCTG AAGTGGCGCA CGCGCGCGAT GATGGGGCGC TGCCACGGCG GGTTCTGCTC GCCGGAGATC GCGCGCATCG TGGCGCGCGA GACGGGCGTG GCGCCCGACG TGCTGGACAA GCGCCTGCCG GGATCGCCCG TGGTGGCCGC TTCGCGCCCC GATTACGCGG AGCTGGCGCG CAAGGGCGAG CGGTCGGAGG CGCAGGACGC GGAGCGCGAG CGCGCGCATG TCTACGACGT GGCGGTGGCG GGCGGCGGGG CAGCCGGCAT CGCCGCAGCC CAGGCGGCCG CGCAGCAGGG CGCGCGCGTG CTTTTGCTCG ACCGCGAGGA GAAGCTGGGC GGCATCCTCA AGCAGTGCGT GCACAACGGG TTCGGGCTGC ACCGCTTCGG CGTGGAGCTG ACGGGTCCCG AGTACGCGCA GCGCGAGATC GACGCGCTTG CGGACGCGGG CTCGGTGGAC GTGCTGGCGG GTGCCAGCGT GACGTCCGTC GATCCGGGGC GCCCGGACGA CGGAGCGCCG CTCACGGTGC ACGCGGTGGA CGCGCGCGGC GCGCATGCCA TCGCGGCGCG CGCCGTGGTG CTGGCCACCG GCTCGCGCGA GCGCGGGCTG GGCGCGCTCA ACCTGGCGGG CTCGCGTCCG TCGGGCGTGT TCTCGGCGGG CAGCGCGCAG AACTTCATGA ACCTCCAAGG GTGCCTGCCC GGACGTCGCG CGGTGATCCT CGGGTCGGGA GACATCGGGC TGATCATGGC GCGCCGTTTG GCGTCGCAGG GGGCCGAGGT GGTGGGCGTG CACGAGCTGA TGCCGCATCC GTCCGGTCTG CGTCGCAACG TGGTGCAGTG CCTGGACGAC TTCGGCATCC CGCTGCACCT CAGCTCCACG GTGACGCGGC TGGAAGGGGA GGGTCGCCTG AGCGCGGTGT ACGTGTCGCA GGTGGATCCC GAGACGATCC AGGTGATCCC CGGCACCGAG CAGCGCATCG CGTGCGACAC GCTGCTGCTG TCGGTGGGCC TCGTGCCCGA GAACGAGGTG GCGAAGTCGG CCGGGGTGGG GCTCGATCCC GTCACCGGCG GAGCGCGGGT GGACAACCGT TTGGCTACCG ACGTTCCCGG CGTGTTCGCC TGCGGCAACG CGCTGCACGT GCACGATCTG GTGGACCATG CGTCGCAAGA GGGCGAGCGC GCGGGCGCCG CCGCGGCCGC TTATGCGCGG CAGGCGGCCT CGGGCGCCTC GGCCGCGCGC GATGCGCATG TCGCCGTTCC CGTGATGGCG GGCGAGGACG TGCGCTACGT GGTGCCGCAG AGCATCGACG CCGCCACGCC GCCCGACGAG AAGCTCATGC TGTCGCTGCG CGTCGCGCGC ACGGTGAACG AGCCGCGCTT CGTGGTGGAG GGGATCGACG AAGCCGGCCG GGTGCGCGAG CTGAAGCGCG CGAAGACGAT GATCGCCGTG CCCGCCGAGA TGGTGCTCGT CGTCCTGCCC GCGGGCGCCG CGGCGGGGTG CTCGGCCGTG CGCGTGCGCG TCGAGGGCCG CGACGAGGCC GCGCGCGTGG CCGACGAGAC CGGCATGGCC GGAGGAGGCG CCGACTGA
|
Protein sequence | MDTEPLQSID VAIVGAGVAG ATTARALARW RLNVVVLEAG NDVACGATRA NSGIVHAGYD PLPGTLKARF NAAGSKLFPQ WADELGFSYV RNGSLVLAFS DEELASIRRL VARAAENGVE GVRELDAAEV RALEPHASPH VRGGLLAETG AICDPYEVAL FSAEQAALHG TAFRFNERVV SVERLAAGSP SSARYLLSTS TGARYAARAV VNAAGVFADE LNNAVSAHRL RIAARRGEYC LYDSEYGPLF SHTVFQAPSS AGKGVLVTPT VHGNLLVGPN AVEQASKTDL STSAEGLRFV LDAAKKTWPD AGARGMIANF AGLRASNADG DDFVIGEPDD APGFFNIACF DSPGLTSAPA VAEHVAQAVA EQLGAEANGE FQASRERCKP FAERDEAERA RAIEADPRWG HIVCRCCEVT EAEIVAALHA PLPVLSLDAL KWRTRAMMGR CHGGFCSPEI ARIVARETGV APDVLDKRLP GSPVVAASRP DYAELARKGE RSEAQDAERE RAHVYDVAVA GGGAAGIAAA QAAAQQGARV LLLDREEKLG GILKQCVHNG FGLHRFGVEL TGPEYAQREI DALADAGSVD VLAGASVTSV DPGRPDDGAP LTVHAVDARG AHAIAARAVV LATGSRERGL GALNLAGSRP SGVFSAGSAQ NFMNLQGCLP GRRAVILGSG DIGLIMARRL ASQGAEVVGV HELMPHPSGL RRNVVQCLDD FGIPLHLSST VTRLEGEGRL SAVYVSQVDP ETIQVIPGTE QRIACDTLLL SVGLVPENEV AKSAGVGLDP VTGGARVDNR LATDVPGVFA CGNALHVHDL VDHASQEGER AGAAAAAYAR QAASGASAAR DAHVAVPVMA GEDVRYVVPQ SIDAATPPDE KLMLSLRVAR TVNEPRFVVE GIDEAGRVRE LKRAKTMIAV PAEMVLVVLP AGAAAGCSAV RVRVEGRDEA ARVADETGMA GGGAD
|
| |