Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1353 |
Symbol | |
ID | 8415651 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1619031 |
End bp | 1620665 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645024322 |
Product | Dak phosphatase |
Protein accession | YP_003181711 |
Protein GI | 257791105 |
COG category | [R] General function prediction only |
COG ID | [COG1461] Predicted kinase related to dihydroxyacetone kinase |
TIGRFAM ID | [TIGR03599] DAK2 domain fusion protein YloV |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000477806 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.2025 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGAAC CCTATACCGC GAACGACCTG CTCAACGCCA TCGCCGTCGC GAGCAAGACC CTGAGCGAGC GCAAGGACGA GATCAACCGT CTGAACGTGT TCCCGGTGCC CGACGGCGAC ACGGGCACGA ACATGTCGCT GACGCTGGAG ACGGTCGTCG AGAACCTGGC CAACCTGCCC ATCGGGGCGG CCGGCGCGGA AATTCGCAAG GCGATCACGA CCGGCGCGCT CATGGGCGCA CGCGGAAACT CCGGTGTCAT TACCTCGCAA ATTTTGCGCG GCCTGTGCGA GGGCAGCGTA GGCCATGACG AGCTGAACGC CGACAGCATC GACGCGGCGT TCGCGAAATC GCAGGAAGTG GCGTTCCAGG CCGTCCGCAA GCCGGTCGAG GGCACCATCC TCACCGTGCT GCGCGACAGC GCCGCCGCCG CGAAGCACGC CCGTAAGAAG AAGATGGGCT GCGACGAGGC GCTGGCCTAC GTGGTGGAGG AGGCCTACGC CTCCGTGCAG CGCACGCCCG ACCTGCTGCC CGTGCTCAAG GAGAATGGCG TGGTGGACGC GGGCGGCTTC GGCCTGGCCA TCTTCTTCGA CGCGTTCGTC TCAGCGCTGC TGGGCAAGGA AGGCCCCATG GTGGACGAGC TGGCGTTCGC GCGCGGCACG GCGCCGAAGG TGGAGATCGA GCAGATCAAC GACTGGGAGG GGTCGGCGTA CCGCTACTGC ACCGAGTTCC TCGTGCATTC CGACACGGTG GACGTGGACG CGGCCAAGGA CTTCCTGCCC ACGATGGGCG ACTGCGACCT CATGGTGGGC ATGCACCCCA ACTTCAAGGT GCACGTGCAC TCGAACCGCC CCGACCAGGT GCTGGGCTGG TTCCTCACGC ACGATGCGCA GATCTCCGAG GTGCACATCC ACAACATGCA GCAGCAGAGC GCCGCGCGCA CCGACGCGCT GGCCGCCGAG CAGGGGGAGG CGCCCAAGCC GCTCGGATTC GTGGCCGTGG CCGCGGGCGA GGGCAACGCG AAGATCCTCA AGAGCCTGGG CGTGGACGTG GTGGTGTCCG GCGGGCAGAC CATGAACCCG TCCACGAAGG ACCTGCTTGA TGCGGCGGGT CAGGTGAACG CCGACGCCGT CATCATCCTG CCCAACAACA AGAACATCAT CATGGCCGCC CAGAGCGCCT GCGAGCTGGC CGAGACGCCG TGCGCCGTGG TTCCCACGAG AAGCGTGCCC GAGGCGTTCG CCGCCCTGTT CGGTTTCGAC GAGGGCGCCA GCCTCGAAGA GAACGTCGAG TCGATGACCG AGGCCTACGC CGACGTGAAG ACCGGCGAGG TGACCGTGGC AATCAAGGAT TCCAAGGACG CGCACGACAA CCCCATCAAG GAGGGCGACG TCATCGGCAT CGCCGACGGG GCCATCGAGG CCGTGGGCTC CACGACCGAG GACGTGGTCA TGGCGCTGCT CGGCACGATG GAGGCCGAAG ACGCCGACAC GCTCACCATC CTGGCGGGCG AGGATATGGG GGATGACGCC TTCGACGCGC TGATCGCGCG CATCGAGGAT GCCTACGACG ACCTCGAGAT CGACGCCCAC CGCGGCGACC AGCCCTTGTA CCCGGTGGTC ATGTCCGTTG AATAA
|
Protein sequence | MPEPYTANDL LNAIAVASKT LSERKDEINR LNVFPVPDGD TGTNMSLTLE TVVENLANLP IGAAGAEIRK AITTGALMGA RGNSGVITSQ ILRGLCEGSV GHDELNADSI DAAFAKSQEV AFQAVRKPVE GTILTVLRDS AAAAKHARKK KMGCDEALAY VVEEAYASVQ RTPDLLPVLK ENGVVDAGGF GLAIFFDAFV SALLGKEGPM VDELAFARGT APKVEIEQIN DWEGSAYRYC TEFLVHSDTV DVDAAKDFLP TMGDCDLMVG MHPNFKVHVH SNRPDQVLGW FLTHDAQISE VHIHNMQQQS AARTDALAAE QGEAPKPLGF VAVAAGEGNA KILKSLGVDV VVSGGQTMNP STKDLLDAAG QVNADAVIIL PNNKNIIMAA QSACELAETP CAVVPTRSVP EAFAALFGFD EGASLEENVE SMTEAYADVK TGEVTVAIKD SKDAHDNPIK EGDVIGIADG AIEAVGSTTE DVVMALLGTM EAEDADTLTI LAGEDMGDDA FDALIARIED AYDDLEIDAH RGDQPLYPVV MSVE
|
| |