Gene Elen_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2021 
Symbol 
ID8416332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2367813 
End bp2369048 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content70% 
IMG OID645024998 
Productglycosyl transferase group 1 
Protein accessionYP_003182374 
Protein GI257791768 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000000675032 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGAGTGA AGCTGGGCGA CGAGACGCGC GGTTACACGC GCTTCCGCTT CCTGTCGGAG 
CTGCTCGCGC GCGAGGGCTT CGAAGTCGAC CTCATCACGT CGTCGTTCCA GCATTGGGAC
AAGGCGCATC GCGACACGTC GAAAGCCTGC TACCAGGGCC TTCCCTACCG CGTCGTGTTC
ATCGACGAGC CCGGCTACAC GAAGAACCTC GACCTCGCGC GCATCCGCAG CCATCGCGTC
GCGGCGAAGA ACCTGCGCGC GCACTTCGAG CGAACGGCCG GCGCGTACGA CCTCATCTAC
GCGGAGATCC CGCCGAACGA CGTCGCGCGC GTGTGCGCCG AAGCGGCCGA CGCGCAGGGC
ATCCCGTTCG TGGCGGACAT CAACGACCTG TGGCCCGAGG CCATGCGCAT GGTCGTCGAC
GTGCCCGTGG TCAGCGACGT CGCCTTCTAC CCGTTCTCGC GCGACGCGAA GCGCGTCTAC
CAGCTGCTGG CGGGCGCCGT CGGCACCTCC GACGAGTACG CGGCGCGTCC GGCGAAGGAC
CGCGCGAAGC CCTACCCCCA GGCCACGGTG TACGTGGGCA ACGACCTGGC CGCCTTCGAC
GAAGGAGCCC GCGTGCACGC GCCCGAGGTG GACAAGCCGG AAGGCGAGCT GTGGGTCGCC
TACGCCGGAA CGCTCGGCGC CAGCTACGAC GTGGCCACGC TCGTCGAGGC CGCCGCGCTG
CTCGAGCGCC GACGCCTCGC ACGGGCGGCG TCGAAGGGCG ACGACCAGGC GCCGGCCTTG
CCCCCCGTGC GCGTGAAGGT GCTCGGCGAC GGCCCCGACC GCGAGAAGCT CGAGGCGCTC
GCGGCGCAGC TCGACGCCCC GGCGGACTTC CTGGGTTACA CGGCCTACGA GCTGATGGCC
GCCTACCTGT GCGCGTCGGA CATCGTGGTG AACTCGCTCG TCACGTCGGC GGTTCAGAGC
ATCGTGACGA AGATCGGCGA CTACCTGGCC AGCGGCAACC CCATGATCAA CACGGGCTCG
AGCCCCGAGT TCCGCGCGAA GGTGACCGCC GACGGCTTCG GCGTGAACGT CGAGGCGGAA
GATGCCGAAG CGCTCGCCGA CGCCATCGCC AAGCTCGCGG GGCACGCGTC GCTGCGCAAG
ATCATGGGCT CGAAGGCACG CGCCGTCGCC GAGAGCGAGT TCGACCAGCC CCGCGCGTAT
CGCGAGATCG TGGATTTGCT GCGCACGTTG CTGTGA
 
Protein sequence
MGVKLGDETR GYTRFRFLSE LLAREGFEVD LITSSFQHWD KAHRDTSKAC YQGLPYRVVF 
IDEPGYTKNL DLARIRSHRV AAKNLRAHFE RTAGAYDLIY AEIPPNDVAR VCAEAADAQG
IPFVADINDL WPEAMRMVVD VPVVSDVAFY PFSRDAKRVY QLLAGAVGTS DEYAARPAKD
RAKPYPQATV YVGNDLAAFD EGARVHAPEV DKPEGELWVA YAGTLGASYD VATLVEAAAL
LERRRLARAA SKGDDQAPAL PPVRVKVLGD GPDREKLEAL AAQLDAPADF LGYTAYELMA
AYLCASDIVV NSLVTSAVQS IVTKIGDYLA SGNPMINTGS SPEFRAKVTA DGFGVNVEAE
DAEALADAIA KLAGHASLRK IMGSKARAVA ESEFDQPRAY REIVDLLRTL L