Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1229 |
Symbol | |
ID | 8415520 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1476235 |
End bp | 1477233 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645024192 |
Product | Glycerone kinase |
Protein accession | YP_003181588 |
Protein GI | 257790982 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2376] Dihydroxyacetone kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00208201 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.114494 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGATGA AGAAGTTCAT CAACGACCCC GACAACCTCA CCGCGGAGCT GCTCGAGGGC CTGGCCCTGG CCAACCCCGA CATTCTCGAG CTGGGCGAGG ACAACATGGT CATCAACAAG AAGCTGGCCG AGGCCGACCG CGTGACCATC GTGACGCAGG GCGGCAGCGG CCACGAGCCG GCCATCGAGG GCTTCGTGGG CGAGGGCATG GTGGACATCG ACGTGGTGGG CGACATCTTC GCCGCGCCCG GCCCGCAGGC CTGCGTCGAC GCCATCAAGC TGGCCGACAA GGGCAAGGGC GTGCTCTACA TCGTGCTCAA CCACGCCGGC GACATGCTGA CGGGCAACAT GACCATGAAG CAGTGCAAGA AGCAGGGCCT CAACGTGGTC AAGGTGGTCA CGCAGGAGGA CGTGTCGAAC GCCCCGCGCG AGAACGCCGA CGACCGCCGC GGCCTCGTGG GCTGCATCCC CACCTACAAG ATCGCCGGCG CCGCGGCCGC CGAGGGCAGA AGCCTCGAGG AGGTGGCGGC CGTCGCACAG CGCTTCGCCG ACAACATGGC GACGCTGGCC GTGGCCGTGC GCGGCGCCAC GCATCCGCAG ACGGGCACGC TGCTGGCAGA GCTCGGCGAC GACGAGATGG AAATCGGCAT GGGCCAGCAC GGCGAGGAGG GCGGCGGCCG CCAGCCCCTG AAGTCTGCCG ACGAGACGGC CGCCATTATG GTGAACGCGC TCGTGAAGGA CATCGGCATC GAGCCGGGCG AGCGGGTCAT GCTCATCATC AACGGCTCGG GCGCCACCAC GCTCATGGAG CAGCTCATCG TGTACCGCGC CGCGGTCAAG GAGCTGGCGA AGCAGGACAT CGAAGTGGTG GCGAACTTCG TGGGCGAGAT GCTGACCGTG CAGGAGCAGG CCGGGTTCCA GATGTTCATG GCGCGCATGG ACGACGAGCT GCTGCGCCTA TGGAACGCCC CCTGCACCAC GCCGTACCTG AAGAAGTAG
|
Protein sequence | MQMKKFINDP DNLTAELLEG LALANPDILE LGEDNMVINK KLAEADRVTI VTQGGSGHEP AIEGFVGEGM VDIDVVGDIF AAPGPQACVD AIKLADKGKG VLYIVLNHAG DMLTGNMTMK QCKKQGLNVV KVVTQEDVSN APRENADDRR GLVGCIPTYK IAGAAAAEGR SLEEVAAVAQ RFADNMATLA VAVRGATHPQ TGTLLAELGD DEMEIGMGQH GEEGGGRQPL KSADETAAIM VNALVKDIGI EPGERVMLII NGSGATTLME QLIVYRAAVK ELAKQDIEVV ANFVGEMLTV QEQAGFQMFM ARMDDELLRL WNAPCTTPYL KK
|
| |