Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2040 |
Symbol | |
ID | 8416351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2389951 |
End bp | 2391111 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 645025017 |
Product | glycosyl transferase family 2 |
Protein accession | YP_003182393 |
Protein GI | 257791787 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.12763 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.970209 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCGGAAT TCTCGATAAT CATCCCGGTG TACCATGGCG AGCGATGCGT GGAGCAGTGC ATCGCGAGCT TGACGACGCA GTCTTTCGGC GATTTCGAGA TCTTGTGCGT CGACGATGCG AGCGAGGACG GATCAGCAGC GGTGCTGTCG CGGTTGAGCG AGGAGGATCG ACGCGTCCGC GTCATCGGCC TTGAGCGAAA CGGAGGGTGC TCGCGCGCTC GTCGCCTAGG TGTTTTGAGC TCGAAGGGCG ACTACGTGCT GTTCGCGGAT CAGGACGACG CGTACGCCCC CGGAGCTCTC CAGCGCCTCC ACGATGAGCT TGTAGCCGAC CCTGTGGACA TTCTCGCGTT CGACGCCGAT GTCGAGAGCG TCGACGGCGT GGGAGAGGAG GAGGTTCGCG GCGTTCGGGA GTGGATGAAG GCTCCGCAGG TTCGACTGAG CGGCCGCCGC GTTCTCGATG CGTGCTTCCT GGACAACGAA TACGGGTATT CCCTGTGGAA CAAGGCCTAC CGTGGCGATA TGGCGCGGTG CGCGTTTGCG GCGACCGAAG ACGAGACCGT TCCCCTGGGA GAGGACAACT ACGCGTACTT CGTGCTTGCG TATTTCGCCG GGTCGCTTCG CGGGATTCCA GGTGCGCCGC TGTACCGCTA TCGCTATGGG GCCGGCTTCA CGGGCCATGG TTCGATGAGC CTTGCGGCAT GGAGGCGCAC GTCTACGCTC GCAGAGGCCG CTGACCTCAT CCGTAGGTTC TTGGAGCGTC AGGGGACGTG GATCGAGTAC GAGGATGTCC ACGCCGCCGC TCGCGCGCAT ATGGTCGAGT ACGCGTTCGA TCACTATCGA ACCGAGATCT CCGAAGAAGA TCGCTCTCAG GCTCTTGCCA TCGCCTTGGA GTACTGGACG TACGAAGAAG TGCTCGAGGG TCTCGCACGC TCTGCGCCTG GTGATTTGCC CCTGCTCGTG GATGCGCGCT TCGGTGATGA TCCTGCGCGT ATCGAACTGC GAGAATGGGT TGAGGAGCAG GCGGACACCC TTTCGAGGAT GGATGAAATC GCTCGTGCGC GACGCGAAGA GGCTGCCCGG CTGCGGGAGC GGTACGAATC GTCCCGGGCA TATCGTCTCG GGCGAAAGGC GACGGCGCCG CTGCGATTGT TGAGAAGATA G
|
Protein sequence | MPEFSIIIPV YHGERCVEQC IASLTTQSFG DFEILCVDDA SEDGSAAVLS RLSEEDRRVR VIGLERNGGC SRARRLGVLS SKGDYVLFAD QDDAYAPGAL QRLHDELVAD PVDILAFDAD VESVDGVGEE EVRGVREWMK APQVRLSGRR VLDACFLDNE YGYSLWNKAY RGDMARCAFA ATEDETVPLG EDNYAYFVLA YFAGSLRGIP GAPLYRYRYG AGFTGHGSMS LAAWRRTSTL AEAADLIRRF LERQGTWIEY EDVHAAARAH MVEYAFDHYR TEISEEDRSQ ALAIALEYWT YEEVLEGLAR SAPGDLPLLV DARFGDDPAR IELREWVEEQ ADTLSRMDEI ARARREEAAR LRERYESSRA YRLGRKATAP LRLLRR
|
| |