Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2042 |
Symbol | |
ID | 8416353 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2391284 |
End bp | 2393782 |
Gene Length | 2499 bp |
Protein Length | 832 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645025019 |
Product | glycosyl transferase family 2 |
Protein accession | YP_003182395 |
Protein GI | 257791789 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.635088 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAATAG ACCTCAAAAC CATGTGCCGT GGCGACGGCA AGGGATTCGT GCTTGTCGAG CTGCACGATG TCGGCGCTGC TTCGACCGTG GCGCTTTGCG TTGCGGACGA AAAGGGGACG GTCGTCCCCT CGGGTCTGTA TTCCTATTTG GAAAACGGCC CCGAAGGTGT CGGCCCCGAA GGTGTGGCAG CTCCTCCAAA AACATGGTAC GACGAGGCGC GTTCCTGGTG GTGCTGCGCG GTGCCGGCAT CGCGCAGCGT CCGCGCTGTC GCGGTTATCC CCCTGCTTGA AACGAATGCG TGGACGCTCG CGTTTTCGGC TGTCGACGAC GGGGGCCGCG TGGTGGCCGA GGCCGATGCG CGCATCGGTG CGACGGCTCT GAAGTGGCGT TCGCGTGTGA ATTATCGACT TCGGTCGCAA ACGTGCCGAT CCATTCGGGA TATCGACACG CGCGGCGATG CCAATGCCGC TGCGTCGTTT ACGCGCGTGA TCGAGGATGG CGAGCACGCG ATCGTCCGTG CGCACGTCAA CGTGCCGTTC TTCGAGTCGA GCGTCCTCGA TTGCGCTCTG CTTGACGGAC GAGGATGCGT GCTGCATGCA GAGCCTCTCG TGCTCGAGGA TACCCGTTAC CGACGAAGCT CGACGGGCGA TGAACGGAGG GCGTTGACTT TGTCGGTTCG CACGCCGCTC GCGCAGCGTC TCGTAACGCT GCGCGTGGTG GACGAAGAGG GGCTGTTCGC CCCCTGCTTC GCCACGCTCG ACGAATGCTC CTTCGATAGT CTGTTGAACT CGACGCGCGA GGAGACGATG AGTGCAGAGC GCGACCCTCG TTACGACGCG TGGTTCAAAG CTCGAGCTGC AACCGCTTCG CACTTGGTTG CCCAATCCGG CGAAACGGTG TCTCCTGCTC CGACGTTCAG CATCGTCGTC CCGCTGTACC GCACGCCTGT CGAGTATTTC AGGTCGATGC TCCAATCGGT GCAGCGGCAG AGCTATGGGG GGTGGGAGCT GATTCTCGTC AACGCCTCGC CCGACGACGG CCGGCTGGTC GAAGAGCTCG AAAACGTTTC CGACGCTCGG GTGCGCGTCG TGAATCTTGA GGAGAATCAC GGCATTGCGG AAAACACGAA TGCGGGTATT CGTGTGGCGC AGGGCGATTT CGTCGCCTTC TTGGACCACG ACGACGTTTT GGCTCCGGAT GCGCTGTTCG GGTACGCCCG CGCGGTATGC GACGATCCTC TCGTTGATAT CGTGTACTGC GATGAGGATC GTATCGATTC CGTCGGCGTC CATCATGCGC CGTTTTTCAA GCCCGATTTC TCGCCCGAGC TCCTTAACGC TCAGAACTAC ATCACGCATT TTCTGGCCGT ACGAAAGAGC CTGATCGAAG AGATCGGCCT GCTCGATGCG ACGTTCGACG GCGCGCAGGA TTACGACCTT GTTCTGAGGG CGACGGAACG GTCGCGTTCG GTCGCCCATA TTCCCCGCGT GCTGTATCAC TGGCGCATGC ACGAAGCTTC TACGAGTATG AACTCCGATA GCAAGTCTTA CGCTGGCGAG GCCGGGCGTG CCGCCCTGGA GGCTCACTGC CGTCGTTGCG GATGGAGCGC GAAGGTTGAG CGGACCGATT TGCCGTTCGC CTATCGCGTG CGTCATGAGC TTGTCGAGCG CCCCAAGGTG TCCATACTCA TACCCAGCAA AGACAAGACT TCGCTCTTGT CCGCTTGCGT GGAAAGCATC GTCGAGAAGA CTTCGTACGA CAACTATGAA ATCGTGGTCA TCGAGAACAA CAGCGTGGAG CCGGAGACGT TCGCCTACTA CGAGGAGGTG CAGCGTCTCG GCAAGGCGAG GGTCGTCGAA TGGCCGGATA CGTTCAACTT CTCGAAAATC ATGAACTTCG GCGTGCGACA GTGCGACGGG GACTACGTCT TGTTGCTGAA CAACGACACC GAGGTGATCA CGCCGAACTA TCTGGAAACG ATGCTGGGAT ATTTCCAGGC CGAAGGCGTG GGGGTTGTGG GCGCGAAGCT CCTGTTCCCC GATGACACCG TGCAGCATGG GGGAGTGGTC TTGGGCCCGT ACCGTTCGGC GGGTCATCTG TTCGCATCGC TGCCCAAGGA CGATCTGGGC TACTTTTGTC GTGCGGTGCT TCCCCAGAAC CTGTCTGCGG TGACCGGCGC TTGCCAGCTC GTCCCCCGCT CGGTGTTCGA GGAGGTCGGA GGCTATACCG AGGCATTCGA AGTTGGCCTG AACGACGTCG ACTTCTGCCT GAAAGTGCGT GAAGCCGGTT ATCGAGTCGT ATGGACGCCC GACGCGCTGC TGTACCATTA TGAATTCTCC TCTCGCGGGC GCGACAGGGA AGGCGCGCAG GCGGAGCGGG CGGAGCGTGA AATCGCGTTG CTGCGTACGC GCTGGCCACG GTATTTCGAG GCGGGCGATC CCTACGTGGG ACCCAATGTG AGTCCTGATT CCCTCTATTT CGGATTGGAC TGCCGATGA
|
Protein sequence | MRIDLKTMCR GDGKGFVLVE LHDVGAASTV ALCVADEKGT VVPSGLYSYL ENGPEGVGPE GVAAPPKTWY DEARSWWCCA VPASRSVRAV AVIPLLETNA WTLAFSAVDD GGRVVAEADA RIGATALKWR SRVNYRLRSQ TCRSIRDIDT RGDANAAASF TRVIEDGEHA IVRAHVNVPF FESSVLDCAL LDGRGCVLHA EPLVLEDTRY RRSSTGDERR ALTLSVRTPL AQRLVTLRVV DEEGLFAPCF ATLDECSFDS LLNSTREETM SAERDPRYDA WFKARAATAS HLVAQSGETV SPAPTFSIVV PLYRTPVEYF RSMLQSVQRQ SYGGWELILV NASPDDGRLV EELENVSDAR VRVVNLEENH GIAENTNAGI RVAQGDFVAF LDHDDVLAPD ALFGYARAVC DDPLVDIVYC DEDRIDSVGV HHAPFFKPDF SPELLNAQNY ITHFLAVRKS LIEEIGLLDA TFDGAQDYDL VLRATERSRS VAHIPRVLYH WRMHEASTSM NSDSKSYAGE AGRAALEAHC RRCGWSAKVE RTDLPFAYRV RHELVERPKV SILIPSKDKT SLLSACVESI VEKTSYDNYE IVVIENNSVE PETFAYYEEV QRLGKARVVE WPDTFNFSKI MNFGVRQCDG DYVLLLNNDT EVITPNYLET MLGYFQAEGV GVVGAKLLFP DDTVQHGGVV LGPYRSAGHL FASLPKDDLG YFCRAVLPQN LSAVTGACQL VPRSVFEEVG GYTEAFEVGL NDVDFCLKVR EAGYRVVWTP DALLYHYEFS SRGRDREGAQ AERAEREIAL LRTRWPRYFE AGDPYVGPNV SPDSLYFGLD CR
|
| |