Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1014 |
Symbol | |
ID | 8415304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1229446 |
End bp | 1230519 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645023978 |
Product | oxidoreductase molybdopterin binding |
Protein accession | YP_003181375 |
Protein GI | 257790769 |
COG category | [R] General function prediction only |
COG ID | [COG2041] Sulfite oxidase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0972537 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.212982 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAGC CAGCTCAGAA GATCATGGCC GGCGTGGCGG GCGCGGTGCT GCTCGCATCC GGCGCCGGCG CGGCCCTTGC CGCCCAGCAG CCCGCGGCTG CGGACGGGGG CTCCCGGCCC GTCGGCGCCA CCCACACGGT GGCCGACCAC GACATCCGCG CCCAGTGGCT GGGCGAGGAA TCCGACTACG TGCGCGTCGC GGACGTGCAG GGGTCGTTCA CGTTCAATCA GGAGGGCGTC ACGCCCAACG ACGAGCTGTT CAACGTGTTC GGAACCGCCA TCCTGTCGAT GTGCTCCAAG CCCGCGCCCG AGCTTGCCGC CGGGCAGGAC GGCGTGGCCA CCTACTTCGT GAACGTGGGC GGGCACGTGA AGGAGAGCTT CACGGTGGAC CTGTCCGAGC TCGACGACGA GGAGCAGGAG GCGCTCATGG GCTGCTCGTG CGCCACGGGG TCGCCCTTCG GCCAGGCGGC CGTCATCGGC GTGCCGCTGG CGTCGGTGGT GGGCATGGCC GACCTCGAGG ACGGCGTGAA CACCGTGACG GCCTACGGCG CGGACGGCTA CGGCGAGCCG CTGCCGCTGC AGTACGCGCT CGACAAGAAC GCGCTGCTCG TGTACCAGGT GAACGGCGAG GAGCTGAAGG CGTCGGAGGG CTCGAGCCTG CAGCTGTGGA TGCCCGAGAC GGTGGCGCGC TACTTCACGC GCAACATCGC CAGCATCGAG CTCACGCGCG AGGACGCGGT GCCCGAGGTG GCCTCGGTCG ATCCCATGTA CCGCAACAAG ATCGAGATCA AGAACTCCGC CGACGGCTGC GCGTTCGAGG CGGGCGACGA GATCACGTTC GAGGGCGTGG CCGACGACTG CGGAAGCCCC ATCGCCGCCA TCGAGTTCTC CTTCGACGGC GGGCGCACCT GGACGGCGTG CGACACCGAC GGCGCCACGG CCGACAAGTG GGTGAACTGG CAGTTCACCG CCTCGTTCGA GGAGAAGGGC GACTACGAGA TGACCGTGCG CGCCCGCACG GCCGACGACG TGGTGTCGCC GCTGTCCGCC AGCCTTGCCT TCGCGGTGCG GTAG
|
Protein sequence | MNKPAQKIMA GVAGAVLLAS GAGAALAAQQ PAAADGGSRP VGATHTVADH DIRAQWLGEE SDYVRVADVQ GSFTFNQEGV TPNDELFNVF GTAILSMCSK PAPELAAGQD GVATYFVNVG GHVKESFTVD LSELDDEEQE ALMGCSCATG SPFGQAAVIG VPLASVVGMA DLEDGVNTVT AYGADGYGEP LPLQYALDKN ALLVYQVNGE ELKASEGSSL QLWMPETVAR YFTRNIASIE LTREDAVPEV ASVDPMYRNK IEIKNSADGC AFEAGDEITF EGVADDCGSP IAAIEFSFDG GRTWTACDTD GATADKWVNW QFTASFEEKG DYEMTVRART ADDVVSPLSA SLAFAVR
|
| |