Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_3106 |
Symbol | |
ID | 8417442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 3612097 |
End bp | 3613515 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645026086 |
Product | FAD dependent oxidoreductase |
Protein accession | YP_003183437 |
Protein GI | 257792831 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0000790545 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCCTGGA GAGAGGCGGC CCCCCTTCGG GTGGCCGCTT TCGCCTCCGT GATGCGGGGG GGGGCAGGGC GCCCTTACCG TATTGGAAAA AAGAGAAGGA AGAGAGAGGA GAGGAACGTG GCTGAGACCG ATTTCGACGC GATCGTCGTG GGCTCCGGGT GCGCGGGCGC CGTGGCGGCC TACGAGCTCG CGAAGGCCGG CAAGTCGACC CTCGTGGTCG AGCGCGGCAA CTTCGCCGGC GCGAAGAACA TGACCGGCGG GCGCATCTAC TCGCACTCGC TGAAGAAGGT GTTCCCGGAC TTCGAGTCCG AGGCGCCGCT CGAGCGCAAG ATCACCCACG AGCGCATCGC GCTCATGGAC CCGGCCTCGC AGACGGCGGT CGACTTCACG AGCCCCGAGC TCGCCGAGGA GGGCAAGGAC TCCTACTCGG TCCTGCGCGC GCCCTTCGAC CAGTGGCTGG CCTCCAAGGC CGAGGACGCC GGCGCCGAGT ACATCTGCGG CATCGCCGTC GAGGAGCTCC TGAAGGACGG GTCCGGCAGG GTCGTCGGCG TGCGCGCCGG CGAGGACGAG ATCACCGCCG AGGCGACGAT CGTCGCCGAG GGCGTGAACA GCCTGCTCTG CGAGCGCAGC CTGGGCAACC CCCGCCCGAA GCCCTCCCAG ATGGCGGTCG GGATCAAGCA GGTGTTCGAG CTGCCGGCCT CCCAGATCGA GGACCGCTTC CTCGTGCCCG AGGGCGAGGG CGCGGCGATG CTGTTCGTCG GCGACTGCAC GCACGGAAAC GTCGGCGGAG GCTTCCTGTA CACGAACAGG GACTCCATCT CGCTCGGCCT CGTGGCCACC ATCTCGCTGG CCGCCGACGG CGCGAACGAG ACGCCGGTGT ACCAGATGCT CGAGGACTTC AAGAACCACC CGGCCGTGGC GCCCGTCATC CGCGGCGCGA AGCTCGTCGA GCACTCCGGC CACATGGTGC CCGAGGGCGG CTACGGCATG GTCCCGAAGT ACGTGTTCGA CGGCTGCCTG GTGGCCGGCG AGTCGGCGGG CCTGTGCATG AACATGGGCT ACCAGGTGCG AGGCATGGAC TTCGCCGTGG CCTCCGGCCA GATGGCGGGC CAGGCGGCCG TGCGCGCGCT CGACGCGGGC GACACGTCGG CAGCGGGCCT GTCCTCCTAC AAGGAGGCGA TGGAGGGCTC CTTCGTCATC CAGGACCTGC GCACCTTCTC CAAGTGGCCG CACGTCATGG AGGGCTGGGA CCGCATGTTC ACCGAGTACC CGGCCATGGC GCGCGACGTC TTCAACGCCA TGTTCAGCGT GGACGGGAAG CCCCAGAAGC CGCTGATGAA GCGCATGATG CCCATAGTCA AGCAGCGCGG CCTGTTCAAG CTGGCCGGCG AGGTTCGGAA GGCGGTGAAG TCGCTGTGA
|
Protein sequence | MPWREAAPLR VAAFASVMRG GAGRPYRIGK KRRKREERNV AETDFDAIVV GSGCAGAVAA YELAKAGKST LVVERGNFAG AKNMTGGRIY SHSLKKVFPD FESEAPLERK ITHERIALMD PASQTAVDFT SPELAEEGKD SYSVLRAPFD QWLASKAEDA GAEYICGIAV EELLKDGSGR VVGVRAGEDE ITAEATIVAE GVNSLLCERS LGNPRPKPSQ MAVGIKQVFE LPASQIEDRF LVPEGEGAAM LFVGDCTHGN VGGGFLYTNR DSISLGLVAT ISLAADGANE TPVYQMLEDF KNHPAVAPVI RGAKLVEHSG HMVPEGGYGM VPKYVFDGCL VAGESAGLCM NMGYQVRGMD FAVASGQMAG QAAVRALDAG DTSAAGLSSY KEAMEGSFVI QDLRTFSKWP HVMEGWDRMF TEYPAMARDV FNAMFSVDGK PQKPLMKRMM PIVKQRGLFK LAGEVRKAVK SL
|
| |