Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2063 |
Symbol | |
ID | 8416379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2428856 |
End bp | 2429692 |
Gene Length | 837 bp |
Protein Length | 278 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645025044 |
Product | hydroxyethylthiazole kinase |
Protein accession | YP_003182415 |
Protein GI | 257791809 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG2145] Hydroxyethylthiazole kinase, sugar kinase family |
TIGRFAM ID | [TIGR00694] hydroxyethylthiazole kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0960486 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.0682861 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACCGT TCGCTTTGAA GAGGGCCTTG GACAACGTGC GGGCGACGAC GCCGCTCGTG CATAACATCA CAAACTACGT GACGGTGAAC GACTGCGCGA ACGCGCTGCT GGCCATCGGC GCAAGCCCCA TCATGAGCGA CGAGCCCGCC GACGTGTTCG ACATCACGTC CATCTGCGGC GGCCTCACGC TGAACATCGG CACGCTCAAC GAGCGCTCCA TCCAGGGCAT GTTCGCGGCC GGCGAGCGTG CGAGCGAGCT GGGCCATCCC ATCGTGCTCG ACCCGGTGGG CGCCGGGGCG TCGGCTCTGC GCACGCGCAC GGCTTCCGAC CTGCTGGACA AGCTGGCCGT GTCCGTCGTG CGCGGAAACA TGTCGGAGGC GAAGGCGCTC GCAGGCGGCG CGGCGGCCAC GCGCGGCGTG GACGTGTGCC CCGGCGATGC GGTCACTGAG GACAACCTGG CGGCCGGCGC GGCGTTCGCG CGCGACTTCG CCGCGAAGAC GGGCGCCGTC GTGGCCGTCA CGGGCGCCAT CGACATCGTG GCCGACGCCG AGCGCGCCTA CGCCGTTCGC AACGGCAGTC CCCTCATGGG CCGCATCACG GGCGCGGGCT GCATGCTGTC GTGCGTGTGC GCGGCGTTCG CGACGGCGAA CCCCGATGCG CTGCTCGACG CCACGGTGGC CGCCGTCGTC GGCATGGGGC TGGCAGGCCA GATCGCCCAA GGCCGCATGG GCGGCTACGA CGGCAACGGG TCGTTCCGCA CGTACCTGCT GGACGCCCTG TACAACCTCG ACGGCGATGC CCTGGAAGCG GGCGCGCGGG TCGAAGAGCT AGCATAA
|
Protein sequence | MEPFALKRAL DNVRATTPLV HNITNYVTVN DCANALLAIG ASPIMSDEPA DVFDITSICG GLTLNIGTLN ERSIQGMFAA GERASELGHP IVLDPVGAGA SALRTRTASD LLDKLAVSVV RGNMSEAKAL AGGAAATRGV DVCPGDAVTE DNLAAGAAFA RDFAAKTGAV VAVTGAIDIV ADAERAYAVR NGSPLMGRIT GAGCMLSCVC AAFATANPDA LLDATVAAVV GMGLAGQIAQ GRMGGYDGNG SFRTYLLDAL YNLDGDALEA GARVEELA
|
| |