Gene Elen_2052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2052 
Symbol 
ID8416363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2405761 
End bp2408145 
Gene Length2385 bp 
Protein Length794 aa 
Translation table11 
GC content49% 
IMG OID645025029 
Productglycosyl transferase family 2 
Protein accessionYP_003182405 
Protein GI257791799 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.039975 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGCGA GGTTGCTCGT GGACGAGATG GCTTCCGACA TCTCGCTAAA AGCGAAGGCT 
TCCTCGAACG ACCTGATTGT CCCCTGTGGG TTGTACGAGA CCCTTTCGAG CGAGACTGGT
AGCGAGCGGA CGTTCGTGCT CGTGTTTCCA ATCTTACGCA TCCGGAGTGT TTCGCTTTCT
ATCTTCGGGG TTGACGAGAA AGGAAGCGCG CTTGACCAAT GCTCCTTTTC GATCAATTTC
GAAAAGGCGA AGTGGCAGTC GCGTATAAAC TATCGTTTCA ACAAGGAGCT ATGCAAAGAA
ATCAGGGACT ACGATAAGAT TGGAACGTAC AGCAAAATAA GCATGGAGTT TTGGGACTGC
ATATCCGACA AGAATGTAAA CATCTTGCGT GGTCTTGTCC GCATGCCGTA CAGAAACGAC
AGTATCGTTC AAATTACCTG TACAAACGAT AGTCTCCAAG AGATTGCGAT AAGCCCGGTG
TTTCTCAGCG ATGTTAAGGT TCGATCGGAC GTTTCGGAAC GCATCTTTTT CCGAGAGATA
CAGTTCTCGA TTCGCGTTCC CAACAAGATT CAGAACCTCG TGTTCCGCTT GATCGATAAG
GCCCATCCCG AGCTCGATAG CTTTGAGGCT ATCGAGGACC ATGCCTATAG GAAAATACGA
GATGACAGCA ATGAGATAAT GCTCAGTGCG CAAAGCGATC CCTGCTACCC GAAATGGTTT
GAGGAGCATA GGGTAGATTT AGGAGCTCTC GCCAAACAGC ATGAAACGTT TTTCGATTAT
CGGCCGCTTT TCAGCATCGT GGTGCCGCTT TACAAAACCC CGAAACCCTT TTTCCTGGAT
ATGCTGAATT CCGTTGTTTC GCAGAGCTAT GGGCGCTGGG AACTTATCCT AGTCAACGCC
AGTCCGAAAG ACGAGGTGCT TGTCGGCTTG GTCGAAGAGG CATCGTCGAA CGACAAGCGC
ATTAAAAGCG TTGTTCTTGA GTCGAACGGC GGGATTTCCG AGAACACCAA TGCCGGTTTG
GCGGTTTCGT CAGGCGATTT CGTTTGCTAT TTCGATCACG ACGATCTTCT CGAACCTGAT
CTTCTCTTTG AGTACGCAAA AGCCTTGAAT GCCGACGAAA GCATTGATCT GTTGTATTGC
GACGAAGATA AGATGTTGCC AAGCGGCACG TTGGCCGAGC CCTTCTTCAA GCCGGATTTC
AACATAGATT TGCTGCGCGA CAACAATTAT ATCTGCCACT TGCTGACGAT TCGAAAGAGT
CTTCTTGACG AGTTAGAGCC CAACACGGCG CAGTTCGATG GTGCCCAGGA TCACAATATG
ACACTTCAGG CTTCCGAACG TGCGAGGAAG ATACATCATG TTGCCCGTGT GCTCTATCAT
TGGCGCATCA GCGAATCCTC TACGGCGGCG AATGCCGACA ACAAGCCTTA CGCAACGCAA
GCCGGTATAA AAGCAGTCCA AAACCACTTG GATAGACTCG GCATTCGGGC CTGCGTGAGG
CAGTCGCGAC GCCCGTTTAC GTACAGCGTC GACTATCTGC CGCCTGAGAG CGAACCGCTT
GTGTCGATTA TCATCCCAAC TAAAGACCAC AGTGATGTGC TTCGTACGTG CGTGGAATCC
GTTTTAGACA GAACAACCTA TGACAAGTAC GAGATTGTGA TCGTAGAGAA TAACAGCACG
GAGCCGAAAA CGTTTGCCTA TTACGAGGAA TTAGAAAAAG AGCATGGTGA TCGCATTCGG
ATTGAATATT GGCCGGCTGA GTTTAACTTC TCGAAGCTCA TCAATTTCGG CGTTTCGAAA
GCAAGAGGAG ATCTTCTCTT GCTTTTGAAC AACGATACCG AGGTGATCAC CCCAGAGTGG
ATGGAGCGCA TGGTCGGTAT CTGTTCTCGC GAAGACGTCG GCGTCGTTGG TGTGCGTCTG
TATTTTAGGG ATGAGACTAT TCAGCATGCA GGCGTATGCG TTTCCGGTGG GGTTGCGGGG
CATCTTGGTC GCAATCTGCC CAAAGGCAAC TGGGGCTATT TTTCTCTGAG CGATGCAACG
CAAGACATGA GTGCAGTTAC TGCTGCTTGC ATGATGACTA AACGAGGCGT TTTCGAAAGT
GTTGACGGAT TCTCTGAGGA ATTGGCAGTT GCGTTCAACG ATGTAGACTA TTGCTTGAAA
GTCAGAGATA TGGAATTGCT GGTCGTTTAC ACGCCCGAAG TGGAGCTGTT CCACTATGAG
TCCCTGTCTC GAGGATTCGA GAGTAGCGCT GAAAAGAAGA TAAGGTTCCA TCGCGAGGTT
TCGTTCATGA ATTACAGATG GGCGGAATAT TATGTCAAAG GGGATCCATA TGCGAATCCC
AATCTGTCGA CAAACGAGCC TTATAACTGC TATTACCATC TGTAA
 
Protein sequence
MYARLLVDEM ASDISLKAKA SSNDLIVPCG LYETLSSETG SERTFVLVFP ILRIRSVSLS 
IFGVDEKGSA LDQCSFSINF EKAKWQSRIN YRFNKELCKE IRDYDKIGTY SKISMEFWDC
ISDKNVNILR GLVRMPYRND SIVQITCTND SLQEIAISPV FLSDVKVRSD VSERIFFREI
QFSIRVPNKI QNLVFRLIDK AHPELDSFEA IEDHAYRKIR DDSNEIMLSA QSDPCYPKWF
EEHRVDLGAL AKQHETFFDY RPLFSIVVPL YKTPKPFFLD MLNSVVSQSY GRWELILVNA
SPKDEVLVGL VEEASSNDKR IKSVVLESNG GISENTNAGL AVSSGDFVCY FDHDDLLEPD
LLFEYAKALN ADESIDLLYC DEDKMLPSGT LAEPFFKPDF NIDLLRDNNY ICHLLTIRKS
LLDELEPNTA QFDGAQDHNM TLQASERARK IHHVARVLYH WRISESSTAA NADNKPYATQ
AGIKAVQNHL DRLGIRACVR QSRRPFTYSV DYLPPESEPL VSIIIPTKDH SDVLRTCVES
VLDRTTYDKY EIVIVENNST EPKTFAYYEE LEKEHGDRIR IEYWPAEFNF SKLINFGVSK
ARGDLLLLLN NDTEVITPEW MERMVGICSR EDVGVVGVRL YFRDETIQHA GVCVSGGVAG
HLGRNLPKGN WGYFSLSDAT QDMSAVTAAC MMTKRGVFES VDGFSEELAV AFNDVDYCLK
VRDMELLVVY TPEVELFHYE SLSRGFESSA EKKIRFHREV SFMNYRWAEY YVKGDPYANP
NLSTNEPYNC YYHL