Gene Elen_2042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2042 
Symbol 
ID8416353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2391284 
End bp2393782 
Gene Length2499 bp 
Protein Length832 aa 
Translation table11 
GC content61% 
IMG OID645025019 
Productglycosyl transferase family 2 
Protein accessionYP_003182395 
Protein GI257791789 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.635088 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAATAG ACCTCAAAAC CATGTGCCGT GGCGACGGCA AGGGATTCGT GCTTGTCGAG 
CTGCACGATG TCGGCGCTGC TTCGACCGTG GCGCTTTGCG TTGCGGACGA AAAGGGGACG
GTCGTCCCCT CGGGTCTGTA TTCCTATTTG GAAAACGGCC CCGAAGGTGT CGGCCCCGAA
GGTGTGGCAG CTCCTCCAAA AACATGGTAC GACGAGGCGC GTTCCTGGTG GTGCTGCGCG
GTGCCGGCAT CGCGCAGCGT CCGCGCTGTC GCGGTTATCC CCCTGCTTGA AACGAATGCG
TGGACGCTCG CGTTTTCGGC TGTCGACGAC GGGGGCCGCG TGGTGGCCGA GGCCGATGCG
CGCATCGGTG CGACGGCTCT GAAGTGGCGT TCGCGTGTGA ATTATCGACT TCGGTCGCAA
ACGTGCCGAT CCATTCGGGA TATCGACACG CGCGGCGATG CCAATGCCGC TGCGTCGTTT
ACGCGCGTGA TCGAGGATGG CGAGCACGCG ATCGTCCGTG CGCACGTCAA CGTGCCGTTC
TTCGAGTCGA GCGTCCTCGA TTGCGCTCTG CTTGACGGAC GAGGATGCGT GCTGCATGCA
GAGCCTCTCG TGCTCGAGGA TACCCGTTAC CGACGAAGCT CGACGGGCGA TGAACGGAGG
GCGTTGACTT TGTCGGTTCG CACGCCGCTC GCGCAGCGTC TCGTAACGCT GCGCGTGGTG
GACGAAGAGG GGCTGTTCGC CCCCTGCTTC GCCACGCTCG ACGAATGCTC CTTCGATAGT
CTGTTGAACT CGACGCGCGA GGAGACGATG AGTGCAGAGC GCGACCCTCG TTACGACGCG
TGGTTCAAAG CTCGAGCTGC AACCGCTTCG CACTTGGTTG CCCAATCCGG CGAAACGGTG
TCTCCTGCTC CGACGTTCAG CATCGTCGTC CCGCTGTACC GCACGCCTGT CGAGTATTTC
AGGTCGATGC TCCAATCGGT GCAGCGGCAG AGCTATGGGG GGTGGGAGCT GATTCTCGTC
AACGCCTCGC CCGACGACGG CCGGCTGGTC GAAGAGCTCG AAAACGTTTC CGACGCTCGG
GTGCGCGTCG TGAATCTTGA GGAGAATCAC GGCATTGCGG AAAACACGAA TGCGGGTATT
CGTGTGGCGC AGGGCGATTT CGTCGCCTTC TTGGACCACG ACGACGTTTT GGCTCCGGAT
GCGCTGTTCG GGTACGCCCG CGCGGTATGC GACGATCCTC TCGTTGATAT CGTGTACTGC
GATGAGGATC GTATCGATTC CGTCGGCGTC CATCATGCGC CGTTTTTCAA GCCCGATTTC
TCGCCCGAGC TCCTTAACGC TCAGAACTAC ATCACGCATT TTCTGGCCGT ACGAAAGAGC
CTGATCGAAG AGATCGGCCT GCTCGATGCG ACGTTCGACG GCGCGCAGGA TTACGACCTT
GTTCTGAGGG CGACGGAACG GTCGCGTTCG GTCGCCCATA TTCCCCGCGT GCTGTATCAC
TGGCGCATGC ACGAAGCTTC TACGAGTATG AACTCCGATA GCAAGTCTTA CGCTGGCGAG
GCCGGGCGTG CCGCCCTGGA GGCTCACTGC CGTCGTTGCG GATGGAGCGC GAAGGTTGAG
CGGACCGATT TGCCGTTCGC CTATCGCGTG CGTCATGAGC TTGTCGAGCG CCCCAAGGTG
TCCATACTCA TACCCAGCAA AGACAAGACT TCGCTCTTGT CCGCTTGCGT GGAAAGCATC
GTCGAGAAGA CTTCGTACGA CAACTATGAA ATCGTGGTCA TCGAGAACAA CAGCGTGGAG
CCGGAGACGT TCGCCTACTA CGAGGAGGTG CAGCGTCTCG GCAAGGCGAG GGTCGTCGAA
TGGCCGGATA CGTTCAACTT CTCGAAAATC ATGAACTTCG GCGTGCGACA GTGCGACGGG
GACTACGTCT TGTTGCTGAA CAACGACACC GAGGTGATCA CGCCGAACTA TCTGGAAACG
ATGCTGGGAT ATTTCCAGGC CGAAGGCGTG GGGGTTGTGG GCGCGAAGCT CCTGTTCCCC
GATGACACCG TGCAGCATGG GGGAGTGGTC TTGGGCCCGT ACCGTTCGGC GGGTCATCTG
TTCGCATCGC TGCCCAAGGA CGATCTGGGC TACTTTTGTC GTGCGGTGCT TCCCCAGAAC
CTGTCTGCGG TGACCGGCGC TTGCCAGCTC GTCCCCCGCT CGGTGTTCGA GGAGGTCGGA
GGCTATACCG AGGCATTCGA AGTTGGCCTG AACGACGTCG ACTTCTGCCT GAAAGTGCGT
GAAGCCGGTT ATCGAGTCGT ATGGACGCCC GACGCGCTGC TGTACCATTA TGAATTCTCC
TCTCGCGGGC GCGACAGGGA AGGCGCGCAG GCGGAGCGGG CGGAGCGTGA AATCGCGTTG
CTGCGTACGC GCTGGCCACG GTATTTCGAG GCGGGCGATC CCTACGTGGG ACCCAATGTG
AGTCCTGATT CCCTCTATTT CGGATTGGAC TGCCGATGA
 
Protein sequence
MRIDLKTMCR GDGKGFVLVE LHDVGAASTV ALCVADEKGT VVPSGLYSYL ENGPEGVGPE 
GVAAPPKTWY DEARSWWCCA VPASRSVRAV AVIPLLETNA WTLAFSAVDD GGRVVAEADA
RIGATALKWR SRVNYRLRSQ TCRSIRDIDT RGDANAAASF TRVIEDGEHA IVRAHVNVPF
FESSVLDCAL LDGRGCVLHA EPLVLEDTRY RRSSTGDERR ALTLSVRTPL AQRLVTLRVV
DEEGLFAPCF ATLDECSFDS LLNSTREETM SAERDPRYDA WFKARAATAS HLVAQSGETV
SPAPTFSIVV PLYRTPVEYF RSMLQSVQRQ SYGGWELILV NASPDDGRLV EELENVSDAR
VRVVNLEENH GIAENTNAGI RVAQGDFVAF LDHDDVLAPD ALFGYARAVC DDPLVDIVYC
DEDRIDSVGV HHAPFFKPDF SPELLNAQNY ITHFLAVRKS LIEEIGLLDA TFDGAQDYDL
VLRATERSRS VAHIPRVLYH WRMHEASTSM NSDSKSYAGE AGRAALEAHC RRCGWSAKVE
RTDLPFAYRV RHELVERPKV SILIPSKDKT SLLSACVESI VEKTSYDNYE IVVIENNSVE
PETFAYYEEV QRLGKARVVE WPDTFNFSKI MNFGVRQCDG DYVLLLNNDT EVITPNYLET
MLGYFQAEGV GVVGAKLLFP DDTVQHGGVV LGPYRSAGHL FASLPKDDLG YFCRAVLPQN
LSAVTGACQL VPRSVFEEVG GYTEAFEVGL NDVDFCLKVR EAGYRVVWTP DALLYHYEFS
SRGRDREGAQ AERAEREIAL LRTRWPRYFE AGDPYVGPNV SPDSLYFGLD CR