Gene Elen_2040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2040 
Symbol 
ID8416351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2389951 
End bp2391111 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content62% 
IMG OID645025017 
Productglycosyl transferase family 2 
Protein accessionYP_003182393 
Protein GI257791787 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.12763 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.970209 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCGGAAT TCTCGATAAT CATCCCGGTG TACCATGGCG AGCGATGCGT GGAGCAGTGC 
ATCGCGAGCT TGACGACGCA GTCTTTCGGC GATTTCGAGA TCTTGTGCGT CGACGATGCG
AGCGAGGACG GATCAGCAGC GGTGCTGTCG CGGTTGAGCG AGGAGGATCG ACGCGTCCGC
GTCATCGGCC TTGAGCGAAA CGGAGGGTGC TCGCGCGCTC GTCGCCTAGG TGTTTTGAGC
TCGAAGGGCG ACTACGTGCT GTTCGCGGAT CAGGACGACG CGTACGCCCC CGGAGCTCTC
CAGCGCCTCC ACGATGAGCT TGTAGCCGAC CCTGTGGACA TTCTCGCGTT CGACGCCGAT
GTCGAGAGCG TCGACGGCGT GGGAGAGGAG GAGGTTCGCG GCGTTCGGGA GTGGATGAAG
GCTCCGCAGG TTCGACTGAG CGGCCGCCGC GTTCTCGATG CGTGCTTCCT GGACAACGAA
TACGGGTATT CCCTGTGGAA CAAGGCCTAC CGTGGCGATA TGGCGCGGTG CGCGTTTGCG
GCGACCGAAG ACGAGACCGT TCCCCTGGGA GAGGACAACT ACGCGTACTT CGTGCTTGCG
TATTTCGCCG GGTCGCTTCG CGGGATTCCA GGTGCGCCGC TGTACCGCTA TCGCTATGGG
GCCGGCTTCA CGGGCCATGG TTCGATGAGC CTTGCGGCAT GGAGGCGCAC GTCTACGCTC
GCAGAGGCCG CTGACCTCAT CCGTAGGTTC TTGGAGCGTC AGGGGACGTG GATCGAGTAC
GAGGATGTCC ACGCCGCCGC TCGCGCGCAT ATGGTCGAGT ACGCGTTCGA TCACTATCGA
ACCGAGATCT CCGAAGAAGA TCGCTCTCAG GCTCTTGCCA TCGCCTTGGA GTACTGGACG
TACGAAGAAG TGCTCGAGGG TCTCGCACGC TCTGCGCCTG GTGATTTGCC CCTGCTCGTG
GATGCGCGCT TCGGTGATGA TCCTGCGCGT ATCGAACTGC GAGAATGGGT TGAGGAGCAG
GCGGACACCC TTTCGAGGAT GGATGAAATC GCTCGTGCGC GACGCGAAGA GGCTGCCCGG
CTGCGGGAGC GGTACGAATC GTCCCGGGCA TATCGTCTCG GGCGAAAGGC GACGGCGCCG
CTGCGATTGT TGAGAAGATA G
 
Protein sequence
MPEFSIIIPV YHGERCVEQC IASLTTQSFG DFEILCVDDA SEDGSAAVLS RLSEEDRRVR 
VIGLERNGGC SRARRLGVLS SKGDYVLFAD QDDAYAPGAL QRLHDELVAD PVDILAFDAD
VESVDGVGEE EVRGVREWMK APQVRLSGRR VLDACFLDNE YGYSLWNKAY RGDMARCAFA
ATEDETVPLG EDNYAYFVLA YFAGSLRGIP GAPLYRYRYG AGFTGHGSMS LAAWRRTSTL
AEAADLIRRF LERQGTWIEY EDVHAAARAH MVEYAFDHYR TEISEEDRSQ ALAIALEYWT
YEEVLEGLAR SAPGDLPLLV DARFGDDPAR IELREWVEEQ ADTLSRMDEI ARARREEAAR
LRERYESSRA YRLGRKATAP LRLLRR