Gene Elen_1353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1353 
Symbol 
ID8415651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1619031 
End bp1620665 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content67% 
IMG OID645024322 
ProductDak phosphatase 
Protein accessionYP_003181711 
Protein GI257791105 
COG category[R] General function prediction only 
COG ID[COG1461] Predicted kinase related to dihydroxyacetone kinase 
TIGRFAM ID[TIGR03599] DAK2 domain fusion protein YloV 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000477806 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.2025 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGAAC CCTATACCGC GAACGACCTG CTCAACGCCA TCGCCGTCGC GAGCAAGACC 
CTGAGCGAGC GCAAGGACGA GATCAACCGT CTGAACGTGT TCCCGGTGCC CGACGGCGAC
ACGGGCACGA ACATGTCGCT GACGCTGGAG ACGGTCGTCG AGAACCTGGC CAACCTGCCC
ATCGGGGCGG CCGGCGCGGA AATTCGCAAG GCGATCACGA CCGGCGCGCT CATGGGCGCA
CGCGGAAACT CCGGTGTCAT TACCTCGCAA ATTTTGCGCG GCCTGTGCGA GGGCAGCGTA
GGCCATGACG AGCTGAACGC CGACAGCATC GACGCGGCGT TCGCGAAATC GCAGGAAGTG
GCGTTCCAGG CCGTCCGCAA GCCGGTCGAG GGCACCATCC TCACCGTGCT GCGCGACAGC
GCCGCCGCCG CGAAGCACGC CCGTAAGAAG AAGATGGGCT GCGACGAGGC GCTGGCCTAC
GTGGTGGAGG AGGCCTACGC CTCCGTGCAG CGCACGCCCG ACCTGCTGCC CGTGCTCAAG
GAGAATGGCG TGGTGGACGC GGGCGGCTTC GGCCTGGCCA TCTTCTTCGA CGCGTTCGTC
TCAGCGCTGC TGGGCAAGGA AGGCCCCATG GTGGACGAGC TGGCGTTCGC GCGCGGCACG
GCGCCGAAGG TGGAGATCGA GCAGATCAAC GACTGGGAGG GGTCGGCGTA CCGCTACTGC
ACCGAGTTCC TCGTGCATTC CGACACGGTG GACGTGGACG CGGCCAAGGA CTTCCTGCCC
ACGATGGGCG ACTGCGACCT CATGGTGGGC ATGCACCCCA ACTTCAAGGT GCACGTGCAC
TCGAACCGCC CCGACCAGGT GCTGGGCTGG TTCCTCACGC ACGATGCGCA GATCTCCGAG
GTGCACATCC ACAACATGCA GCAGCAGAGC GCCGCGCGCA CCGACGCGCT GGCCGCCGAG
CAGGGGGAGG CGCCCAAGCC GCTCGGATTC GTGGCCGTGG CCGCGGGCGA GGGCAACGCG
AAGATCCTCA AGAGCCTGGG CGTGGACGTG GTGGTGTCCG GCGGGCAGAC CATGAACCCG
TCCACGAAGG ACCTGCTTGA TGCGGCGGGT CAGGTGAACG CCGACGCCGT CATCATCCTG
CCCAACAACA AGAACATCAT CATGGCCGCC CAGAGCGCCT GCGAGCTGGC CGAGACGCCG
TGCGCCGTGG TTCCCACGAG AAGCGTGCCC GAGGCGTTCG CCGCCCTGTT CGGTTTCGAC
GAGGGCGCCA GCCTCGAAGA GAACGTCGAG TCGATGACCG AGGCCTACGC CGACGTGAAG
ACCGGCGAGG TGACCGTGGC AATCAAGGAT TCCAAGGACG CGCACGACAA CCCCATCAAG
GAGGGCGACG TCATCGGCAT CGCCGACGGG GCCATCGAGG CCGTGGGCTC CACGACCGAG
GACGTGGTCA TGGCGCTGCT CGGCACGATG GAGGCCGAAG ACGCCGACAC GCTCACCATC
CTGGCGGGCG AGGATATGGG GGATGACGCC TTCGACGCGC TGATCGCGCG CATCGAGGAT
GCCTACGACG ACCTCGAGAT CGACGCCCAC CGCGGCGACC AGCCCTTGTA CCCGGTGGTC
ATGTCCGTTG AATAA
 
Protein sequence
MPEPYTANDL LNAIAVASKT LSERKDEINR LNVFPVPDGD TGTNMSLTLE TVVENLANLP 
IGAAGAEIRK AITTGALMGA RGNSGVITSQ ILRGLCEGSV GHDELNADSI DAAFAKSQEV
AFQAVRKPVE GTILTVLRDS AAAAKHARKK KMGCDEALAY VVEEAYASVQ RTPDLLPVLK
ENGVVDAGGF GLAIFFDAFV SALLGKEGPM VDELAFARGT APKVEIEQIN DWEGSAYRYC
TEFLVHSDTV DVDAAKDFLP TMGDCDLMVG MHPNFKVHVH SNRPDQVLGW FLTHDAQISE
VHIHNMQQQS AARTDALAAE QGEAPKPLGF VAVAAGEGNA KILKSLGVDV VVSGGQTMNP
STKDLLDAAG QVNADAVIIL PNNKNIIMAA QSACELAETP CAVVPTRSVP EAFAALFGFD
EGASLEENVE SMTEAYADVK TGEVTVAIKD SKDAHDNPIK EGDVIGIADG AIEAVGSTTE
DVVMALLGTM EAEDADTLTI LAGEDMGDDA FDALIARIED AYDDLEIDAH RGDQPLYPVV
MSVE