Gene Elen_2347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2347 
Symbol 
ID8416671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2762479 
End bp2764329 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content65% 
IMG OID645025331 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_003182694 
Protein GI257792088 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.866725 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGACGCTC ATATCACAAA ACGCACAGCG ATCCTGCTGC TGCTGGATAT CGTCGCGACG 
TACGCCGCGT ACTGGCTCGC ATCGCTGCTC ACCGACGTCG AGGGCGAAGT GTTCGTCAAC
AACGAGATCT ACTTCATGCT GGGCATCCTC GCACTCATCA ACGTTGCCGT GCTGGGGCTG
TTCCATCTGT ACAACAACCT CTGGGAATAC GCCAGCGTCG ACGAAGCCAT CCAGATCGTG
CTGGCCGTGG TGCTGTCAAC CCTGGTGGGC GCCGTGTTCC TCTGGATCAT CGACGTGCGG
CTGCCCATCC GCGTGTTCTT CGTCTCGTGT TTCATGCTCA TATTCTTCAT GGGCGGTATC
CGCCTGATCT TCCGCGTCAT GCGCCAGAAA AGGCGCGCGC TCGTCTCCAC GCAGCGCGCG
TGCGACCGGC CGCGCACGCT GGTGGTGGGC GCGGGGGAGA CGGGCTCGCT GGCCATCGGG
CGCATGGCCT CGAAGGACCC GCTCATGCCG GGCATCCCCA TCGTGGCCAC CGACGACGAC
CCCACCAAGC GCGGCTCGCG CATCCACGGC GTAAGGGTGG CCGGTTCCAC GGACGACATC
GTCGACCTCG TGGACAAGCA CAACATCGAC CAGATCGTCG TGGCCATCCC GTCCTCCACG
CCCGAGGAGC GCAAGCGCAT CTACGGCGAA TGCACGAAGA CCGACTGCAA GCTGCGCACC
CTGCCGAACG TGCGCGAGCT GTCGCTCGAC GAGATCGGCG ACGTGCGCCT GCGCGACGTG
GACGTGGCCG ACCTTCTGGG CCGCGAGGAG ATCATCCTCA ACACGCGCGC GGTGTCGGGC
TACATCGCCG GCGAGACCGT GCTGGTCACG GGCGGCGGCG GCTCCATCGG CAGCGAGCTG
TGCCGCCAGC TGTGCAAGGT GGCGCCCGCC CGCATCGTCA TCTTCGACAT GTACGAGAAC
GACGCCTACA TGCTGCGCAA CGAGCTTTTG GCCGAATACG ACGACATCGA CCTCGTCATC
GAGATCGGCA ACGTCTGCGA CGAGGCGCGC CTGAACGAGG TGTTCGCGAA GTACCGCCCC
GGCGCCGTGT TCCACGCGGC CGCCCACAAG CACGTGCCCC TCATGGAGCA ATGCCCGCGC
GAGGCGCTGC ACAACAACGT GTTCGGCACG CTCAACGCCG TGCGCGCCGC CGACGCCTAC
GGCGCCGCGC GCTTCATCTT CATCTCCACC GACAAGGCCG TGAACCCCAC CAGCGTCATG
GGCGCCACGA AGCGCATGGG CGAGATGGTC ATGCAGTACT ACGCGCGCAC GTCGAAGACC
ATTTTCTCCG CCGTGCGCTT CGGCAACGTG CTGGGCTCGA ACGGCAGCGT CATCCCCGTG
TTCCAGCGCC AGATCGCCGC GGGAGGCCCC CTCACCGTCA CCCATCCCGA CATCGAGCGC
TTCTTCATGA CCATCCCCGA GGCGTCGCGC CTGGTCATCC AAGCAGGCGG CATGGCGAAG
GGCGGCGAGA TCTTCATTCT CGACATGGGC GAGCCGGTGA AGATCGTCGA CTTGGCGAAG
GGCCTCATCC AGCTGCAGGG CCTCACGCCC GACGTGGACG TCAAAATCGT GTTCACGGGC
CTGCGCGAGG GCGAGAAGAT GTACGAGGAG CTGCTCATGG ACGAAGAGAG CACGCTGCCC
ACCGACAACC ACTCCATCCT CATCTCCACC GGCCAGGAGA TCAGCTACAC CGAAGTGGCT
GAGAAACTGG ACGAGCTGGA AGCCGCACTC ACCCTCACCG ACGAGGAAGC CGTCCACGTG
CTGGAGAAAA CCGTCTGCAC CTATCGCCAC ACCCCCAACA AGGTATCCTG A
 
Protein sequence
MDAHITKRTA ILLLLDIVAT YAAYWLASLL TDVEGEVFVN NEIYFMLGIL ALINVAVLGL 
FHLYNNLWEY ASVDEAIQIV LAVVLSTLVG AVFLWIIDVR LPIRVFFVSC FMLIFFMGGI
RLIFRVMRQK RRALVSTQRA CDRPRTLVVG AGETGSLAIG RMASKDPLMP GIPIVATDDD
PTKRGSRIHG VRVAGSTDDI VDLVDKHNID QIVVAIPSST PEERKRIYGE CTKTDCKLRT
LPNVRELSLD EIGDVRLRDV DVADLLGREE IILNTRAVSG YIAGETVLVT GGGGSIGSEL
CRQLCKVAPA RIVIFDMYEN DAYMLRNELL AEYDDIDLVI EIGNVCDEAR LNEVFAKYRP
GAVFHAAAHK HVPLMEQCPR EALHNNVFGT LNAVRAADAY GAARFIFIST DKAVNPTSVM
GATKRMGEMV MQYYARTSKT IFSAVRFGNV LGSNGSVIPV FQRQIAAGGP LTVTHPDIER
FFMTIPEASR LVIQAGGMAK GGEIFILDMG EPVKIVDLAK GLIQLQGLTP DVDVKIVFTG
LREGEKMYEE LLMDEESTLP TDNHSILIST GQEISYTEVA EKLDELEAAL TLTDEEAVHV
LEKTVCTYRH TPNKVS