Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2347 |
Symbol | |
ID | 8416671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2762479 |
End bp | 2764329 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645025331 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_003182694 |
Protein GI | 257792088 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.866725 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGACGCTC ATATCACAAA ACGCACAGCG ATCCTGCTGC TGCTGGATAT CGTCGCGACG TACGCCGCGT ACTGGCTCGC ATCGCTGCTC ACCGACGTCG AGGGCGAAGT GTTCGTCAAC AACGAGATCT ACTTCATGCT GGGCATCCTC GCACTCATCA ACGTTGCCGT GCTGGGGCTG TTCCATCTGT ACAACAACCT CTGGGAATAC GCCAGCGTCG ACGAAGCCAT CCAGATCGTG CTGGCCGTGG TGCTGTCAAC CCTGGTGGGC GCCGTGTTCC TCTGGATCAT CGACGTGCGG CTGCCCATCC GCGTGTTCTT CGTCTCGTGT TTCATGCTCA TATTCTTCAT GGGCGGTATC CGCCTGATCT TCCGCGTCAT GCGCCAGAAA AGGCGCGCGC TCGTCTCCAC GCAGCGCGCG TGCGACCGGC CGCGCACGCT GGTGGTGGGC GCGGGGGAGA CGGGCTCGCT GGCCATCGGG CGCATGGCCT CGAAGGACCC GCTCATGCCG GGCATCCCCA TCGTGGCCAC CGACGACGAC CCCACCAAGC GCGGCTCGCG CATCCACGGC GTAAGGGTGG CCGGTTCCAC GGACGACATC GTCGACCTCG TGGACAAGCA CAACATCGAC CAGATCGTCG TGGCCATCCC GTCCTCCACG CCCGAGGAGC GCAAGCGCAT CTACGGCGAA TGCACGAAGA CCGACTGCAA GCTGCGCACC CTGCCGAACG TGCGCGAGCT GTCGCTCGAC GAGATCGGCG ACGTGCGCCT GCGCGACGTG GACGTGGCCG ACCTTCTGGG CCGCGAGGAG ATCATCCTCA ACACGCGCGC GGTGTCGGGC TACATCGCCG GCGAGACCGT GCTGGTCACG GGCGGCGGCG GCTCCATCGG CAGCGAGCTG TGCCGCCAGC TGTGCAAGGT GGCGCCCGCC CGCATCGTCA TCTTCGACAT GTACGAGAAC GACGCCTACA TGCTGCGCAA CGAGCTTTTG GCCGAATACG ACGACATCGA CCTCGTCATC GAGATCGGCA ACGTCTGCGA CGAGGCGCGC CTGAACGAGG TGTTCGCGAA GTACCGCCCC GGCGCCGTGT TCCACGCGGC CGCCCACAAG CACGTGCCCC TCATGGAGCA ATGCCCGCGC GAGGCGCTGC ACAACAACGT GTTCGGCACG CTCAACGCCG TGCGCGCCGC CGACGCCTAC GGCGCCGCGC GCTTCATCTT CATCTCCACC GACAAGGCCG TGAACCCCAC CAGCGTCATG GGCGCCACGA AGCGCATGGG CGAGATGGTC ATGCAGTACT ACGCGCGCAC GTCGAAGACC ATTTTCTCCG CCGTGCGCTT CGGCAACGTG CTGGGCTCGA ACGGCAGCGT CATCCCCGTG TTCCAGCGCC AGATCGCCGC GGGAGGCCCC CTCACCGTCA CCCATCCCGA CATCGAGCGC TTCTTCATGA CCATCCCCGA GGCGTCGCGC CTGGTCATCC AAGCAGGCGG CATGGCGAAG GGCGGCGAGA TCTTCATTCT CGACATGGGC GAGCCGGTGA AGATCGTCGA CTTGGCGAAG GGCCTCATCC AGCTGCAGGG CCTCACGCCC GACGTGGACG TCAAAATCGT GTTCACGGGC CTGCGCGAGG GCGAGAAGAT GTACGAGGAG CTGCTCATGG ACGAAGAGAG CACGCTGCCC ACCGACAACC ACTCCATCCT CATCTCCACC GGCCAGGAGA TCAGCTACAC CGAAGTGGCT GAGAAACTGG ACGAGCTGGA AGCCGCACTC ACCCTCACCG ACGAGGAAGC CGTCCACGTG CTGGAGAAAA CCGTCTGCAC CTATCGCCAC ACCCCCAACA AGGTATCCTG A
|
Protein sequence | MDAHITKRTA ILLLLDIVAT YAAYWLASLL TDVEGEVFVN NEIYFMLGIL ALINVAVLGL FHLYNNLWEY ASVDEAIQIV LAVVLSTLVG AVFLWIIDVR LPIRVFFVSC FMLIFFMGGI RLIFRVMRQK RRALVSTQRA CDRPRTLVVG AGETGSLAIG RMASKDPLMP GIPIVATDDD PTKRGSRIHG VRVAGSTDDI VDLVDKHNID QIVVAIPSST PEERKRIYGE CTKTDCKLRT LPNVRELSLD EIGDVRLRDV DVADLLGREE IILNTRAVSG YIAGETVLVT GGGGSIGSEL CRQLCKVAPA RIVIFDMYEN DAYMLRNELL AEYDDIDLVI EIGNVCDEAR LNEVFAKYRP GAVFHAAAHK HVPLMEQCPR EALHNNVFGT LNAVRAADAY GAARFIFIST DKAVNPTSVM GATKRMGEMV MQYYARTSKT IFSAVRFGNV LGSNGSVIPV FQRQIAAGGP LTVTHPDIER FFMTIPEASR LVIQAGGMAK GGEIFILDMG EPVKIVDLAK GLIQLQGLTP DVDVKIVFTG LREGEKMYEE LLMDEESTLP TDNHSILIST GQEISYTEVA EKLDELEAAL TLTDEEAVHV LEKTVCTYRH TPNKVS
|
| |