Gene Elen_0636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0636 
Symbol 
ID8414926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp812087 
End bp813406 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content66% 
IMG OID645023613 
Productnucleotide sugar dehydrogenase 
Protein accessionYP_003181010 
Protein GI257790404 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0677] UDP-N-acetyl-D-mannosaminuronate dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.96458 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGGTT TTCAAGAAGG ATGGGTGAGC ATCCTGTCTC GCACCCTCGT CTGCGGCACG 
GTCGGCCTCG GCTACGTCGG GCTGCCGCTG GCCGTGGAGA AGGCCAAGGC GGGCTTCAAG
ACGATTGGCT TCGACGTGCA GGCCGACAAG GTCGACATGG TCAACCGTGG CGAGAACTAC
ATCGGCGACG TCGTGCAGGA GGACCTCGCC GCGCTCGTGG AGGCCGGCAC GCTCGAGGCG
ACCTGCGACT TCTCGCGCGT CGCGGAGTGC GGCTTCGTCG CCATCTGCGT GCCCACCCCG
CTCGACGCCC ACCAGATGCC CGACACCTCC TACATGGAGG CGTCCGCCCG CGAGATAGCG
CCCTACATCC GCGAGGGCTG CATGGTGGTG CTCGAGTCCA CCACCTACCC CGGCACCACC
GAGGAGCTGA TCCTGCCCAT CCTGGAGGAG GGGTCGGGCC TCAAGTGCGG CGAGGGCTTC
TACCTGGGCT TCTCTCCCGA GCGCGTGGAT CCTGGCAACC TCGTCTACAA GACCAAGAAC
ACGCCCAAGG TGGTGGGCGC CGTCGGCGAG GAGGCGCTCG AGCTCATCAG CGCGGTCTAC
GAGGCGGTGC TCGAGGGGGG CGTGACGCGC GTGTCGAGCC CGGCCGTGGC CGAGATGGAG
AAGATCCTCG AGAACACCTA CCGAAACGTC AACATCGGCC TCGTCAACGA GCTCTGCATG
CTCTGCGACC GCATGGGCAT CGACGTGTGG GAGGTAATAG ACGCCGCCAA GACGAAGCCC
TACGGCTTCA CGGCCTTCTA CCCCGGCCCC GGTCTCGGCG GCCACTGCAT ACCGCTCGAC
CCCTACTACC TGTCGTGGAA GGCGCGCGAG TACGGCTTCC ACACCTCCAT GATCGAGGCG
TCCATGACCG TGAACGACTC TATGCCCGAG TGGGTGGCCT CGCGCGCGGC CCGCATCCTC
AACCGGGAGG GCAAGGCGAT GAGGGGATCG AAGGCGCTCG TGCTGGGCGT CGCCTACAAG
CAGGACATCG ACGACTATCG GGAGTCGCCG GCCCTGCGCG TGATTGAGCG CCTGGAGGCG
CGGGGCGCCG AGGTCTCCTA CTACGACCCC TGGGTGCCGC GCTGCCAGCA CAAAGAAGAG
GTAAAGGAAT CCATCCCGGA CCTGTCGGCC GAGGCGATCG CCTCGGCCGA TATTGTTCTG
GTCGCCTGCG CGCACACCAA CGTCGACTAC GCCCTCGTGC AGAGGCACGC GAGGGCCGTG
CTCGACGCCA AGAACGCCAT GAAGGGCGTG TCCCCGCGAG AGAACATCGA GGTGCTGTGA
 
Protein sequence
MMGFQEGWVS ILSRTLVCGT VGLGYVGLPL AVEKAKAGFK TIGFDVQADK VDMVNRGENY 
IGDVVQEDLA ALVEAGTLEA TCDFSRVAEC GFVAICVPTP LDAHQMPDTS YMEASAREIA
PYIREGCMVV LESTTYPGTT EELILPILEE GSGLKCGEGF YLGFSPERVD PGNLVYKTKN
TPKVVGAVGE EALELISAVY EAVLEGGVTR VSSPAVAEME KILENTYRNV NIGLVNELCM
LCDRMGIDVW EVIDAAKTKP YGFTAFYPGP GLGGHCIPLD PYYLSWKARE YGFHTSMIEA
SMTVNDSMPE WVASRAARIL NREGKAMRGS KALVLGVAYK QDIDDYRESP ALRVIERLEA
RGAEVSYYDP WVPRCQHKEE VKESIPDLSA EAIASADIVL VACAHTNVDY ALVQRHARAV
LDAKNAMKGV SPRENIEVL