Gene Elen_1229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1229 
Symbol 
ID8415520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1476235 
End bp1477233 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content67% 
IMG OID645024192 
ProductGlycerone kinase 
Protein accessionYP_003181588 
Protein GI257790982 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2376] Dihydroxyacetone kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00208201 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.114494 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGATGA AGAAGTTCAT CAACGACCCC GACAACCTCA CCGCGGAGCT GCTCGAGGGC 
CTGGCCCTGG CCAACCCCGA CATTCTCGAG CTGGGCGAGG ACAACATGGT CATCAACAAG
AAGCTGGCCG AGGCCGACCG CGTGACCATC GTGACGCAGG GCGGCAGCGG CCACGAGCCG
GCCATCGAGG GCTTCGTGGG CGAGGGCATG GTGGACATCG ACGTGGTGGG CGACATCTTC
GCCGCGCCCG GCCCGCAGGC CTGCGTCGAC GCCATCAAGC TGGCCGACAA GGGCAAGGGC
GTGCTCTACA TCGTGCTCAA CCACGCCGGC GACATGCTGA CGGGCAACAT GACCATGAAG
CAGTGCAAGA AGCAGGGCCT CAACGTGGTC AAGGTGGTCA CGCAGGAGGA CGTGTCGAAC
GCCCCGCGCG AGAACGCCGA CGACCGCCGC GGCCTCGTGG GCTGCATCCC CACCTACAAG
ATCGCCGGCG CCGCGGCCGC CGAGGGCAGA AGCCTCGAGG AGGTGGCGGC CGTCGCACAG
CGCTTCGCCG ACAACATGGC GACGCTGGCC GTGGCCGTGC GCGGCGCCAC GCATCCGCAG
ACGGGCACGC TGCTGGCAGA GCTCGGCGAC GACGAGATGG AAATCGGCAT GGGCCAGCAC
GGCGAGGAGG GCGGCGGCCG CCAGCCCCTG AAGTCTGCCG ACGAGACGGC CGCCATTATG
GTGAACGCGC TCGTGAAGGA CATCGGCATC GAGCCGGGCG AGCGGGTCAT GCTCATCATC
AACGGCTCGG GCGCCACCAC GCTCATGGAG CAGCTCATCG TGTACCGCGC CGCGGTCAAG
GAGCTGGCGA AGCAGGACAT CGAAGTGGTG GCGAACTTCG TGGGCGAGAT GCTGACCGTG
CAGGAGCAGG CCGGGTTCCA GATGTTCATG GCGCGCATGG ACGACGAGCT GCTGCGCCTA
TGGAACGCCC CCTGCACCAC GCCGTACCTG AAGAAGTAG
 
Protein sequence
MQMKKFINDP DNLTAELLEG LALANPDILE LGEDNMVINK KLAEADRVTI VTQGGSGHEP 
AIEGFVGEGM VDIDVVGDIF AAPGPQACVD AIKLADKGKG VLYIVLNHAG DMLTGNMTMK
QCKKQGLNVV KVVTQEDVSN APRENADDRR GLVGCIPTYK IAGAAAAEGR SLEEVAAVAQ
RFADNMATLA VAVRGATHPQ TGTLLAELGD DEMEIGMGQH GEEGGGRQPL KSADETAAIM
VNALVKDIGI EPGERVMLII NGSGATTLME QLIVYRAAVK ELAKQDIEVV ANFVGEMLTV
QEQAGFQMFM ARMDDELLRL WNAPCTTPYL KK