Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2021 |
Symbol | |
ID | 8416332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2367813 |
End bp | 2369048 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645024998 |
Product | glycosyl transferase group 1 |
Protein accession | YP_003182374 |
Protein GI | 257791768 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00000000000675032 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGAGTGA AGCTGGGCGA CGAGACGCGC GGTTACACGC GCTTCCGCTT CCTGTCGGAG CTGCTCGCGC GCGAGGGCTT CGAAGTCGAC CTCATCACGT CGTCGTTCCA GCATTGGGAC AAGGCGCATC GCGACACGTC GAAAGCCTGC TACCAGGGCC TTCCCTACCG CGTCGTGTTC ATCGACGAGC CCGGCTACAC GAAGAACCTC GACCTCGCGC GCATCCGCAG CCATCGCGTC GCGGCGAAGA ACCTGCGCGC GCACTTCGAG CGAACGGCCG GCGCGTACGA CCTCATCTAC GCGGAGATCC CGCCGAACGA CGTCGCGCGC GTGTGCGCCG AAGCGGCCGA CGCGCAGGGC ATCCCGTTCG TGGCGGACAT CAACGACCTG TGGCCCGAGG CCATGCGCAT GGTCGTCGAC GTGCCCGTGG TCAGCGACGT CGCCTTCTAC CCGTTCTCGC GCGACGCGAA GCGCGTCTAC CAGCTGCTGG CGGGCGCCGT CGGCACCTCC GACGAGTACG CGGCGCGTCC GGCGAAGGAC CGCGCGAAGC CCTACCCCCA GGCCACGGTG TACGTGGGCA ACGACCTGGC CGCCTTCGAC GAAGGAGCCC GCGTGCACGC GCCCGAGGTG GACAAGCCGG AAGGCGAGCT GTGGGTCGCC TACGCCGGAA CGCTCGGCGC CAGCTACGAC GTGGCCACGC TCGTCGAGGC CGCCGCGCTG CTCGAGCGCC GACGCCTCGC ACGGGCGGCG TCGAAGGGCG ACGACCAGGC GCCGGCCTTG CCCCCCGTGC GCGTGAAGGT GCTCGGCGAC GGCCCCGACC GCGAGAAGCT CGAGGCGCTC GCGGCGCAGC TCGACGCCCC GGCGGACTTC CTGGGTTACA CGGCCTACGA GCTGATGGCC GCCTACCTGT GCGCGTCGGA CATCGTGGTG AACTCGCTCG TCACGTCGGC GGTTCAGAGC ATCGTGACGA AGATCGGCGA CTACCTGGCC AGCGGCAACC CCATGATCAA CACGGGCTCG AGCCCCGAGT TCCGCGCGAA GGTGACCGCC GACGGCTTCG GCGTGAACGT CGAGGCGGAA GATGCCGAAG CGCTCGCCGA CGCCATCGCC AAGCTCGCGG GGCACGCGTC GCTGCGCAAG ATCATGGGCT CGAAGGCACG CGCCGTCGCC GAGAGCGAGT TCGACCAGCC CCGCGCGTAT CGCGAGATCG TGGATTTGCT GCGCACGTTG CTGTGA
|
Protein sequence | MGVKLGDETR GYTRFRFLSE LLAREGFEVD LITSSFQHWD KAHRDTSKAC YQGLPYRVVF IDEPGYTKNL DLARIRSHRV AAKNLRAHFE RTAGAYDLIY AEIPPNDVAR VCAEAADAQG IPFVADINDL WPEAMRMVVD VPVVSDVAFY PFSRDAKRVY QLLAGAVGTS DEYAARPAKD RAKPYPQATV YVGNDLAAFD EGARVHAPEV DKPEGELWVA YAGTLGASYD VATLVEAAAL LERRRLARAA SKGDDQAPAL PPVRVKVLGD GPDREKLEAL AAQLDAPADF LGYTAYELMA AYLCASDIVV NSLVTSAVQS IVTKIGDYLA SGNPMINTGS SPEFRAKVTA DGFGVNVEAE DAEALADAIA KLAGHASLRK IMGSKARAVA ESEFDQPRAY REIVDLLRTL L
|
| |