Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2052 |
Symbol | |
ID | 8416363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2405761 |
End bp | 2408145 |
Gene Length | 2385 bp |
Protein Length | 794 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 645025029 |
Product | glycosyl transferase family 2 |
Protein accession | YP_003182405 |
Protein GI | 257791799 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.039975 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATGCGA GGTTGCTCGT GGACGAGATG GCTTCCGACA TCTCGCTAAA AGCGAAGGCT TCCTCGAACG ACCTGATTGT CCCCTGTGGG TTGTACGAGA CCCTTTCGAG CGAGACTGGT AGCGAGCGGA CGTTCGTGCT CGTGTTTCCA ATCTTACGCA TCCGGAGTGT TTCGCTTTCT ATCTTCGGGG TTGACGAGAA AGGAAGCGCG CTTGACCAAT GCTCCTTTTC GATCAATTTC GAAAAGGCGA AGTGGCAGTC GCGTATAAAC TATCGTTTCA ACAAGGAGCT ATGCAAAGAA ATCAGGGACT ACGATAAGAT TGGAACGTAC AGCAAAATAA GCATGGAGTT TTGGGACTGC ATATCCGACA AGAATGTAAA CATCTTGCGT GGTCTTGTCC GCATGCCGTA CAGAAACGAC AGTATCGTTC AAATTACCTG TACAAACGAT AGTCTCCAAG AGATTGCGAT AAGCCCGGTG TTTCTCAGCG ATGTTAAGGT TCGATCGGAC GTTTCGGAAC GCATCTTTTT CCGAGAGATA CAGTTCTCGA TTCGCGTTCC CAACAAGATT CAGAACCTCG TGTTCCGCTT GATCGATAAG GCCCATCCCG AGCTCGATAG CTTTGAGGCT ATCGAGGACC ATGCCTATAG GAAAATACGA GATGACAGCA ATGAGATAAT GCTCAGTGCG CAAAGCGATC CCTGCTACCC GAAATGGTTT GAGGAGCATA GGGTAGATTT AGGAGCTCTC GCCAAACAGC ATGAAACGTT TTTCGATTAT CGGCCGCTTT TCAGCATCGT GGTGCCGCTT TACAAAACCC CGAAACCCTT TTTCCTGGAT ATGCTGAATT CCGTTGTTTC GCAGAGCTAT GGGCGCTGGG AACTTATCCT AGTCAACGCC AGTCCGAAAG ACGAGGTGCT TGTCGGCTTG GTCGAAGAGG CATCGTCGAA CGACAAGCGC ATTAAAAGCG TTGTTCTTGA GTCGAACGGC GGGATTTCCG AGAACACCAA TGCCGGTTTG GCGGTTTCGT CAGGCGATTT CGTTTGCTAT TTCGATCACG ACGATCTTCT CGAACCTGAT CTTCTCTTTG AGTACGCAAA AGCCTTGAAT GCCGACGAAA GCATTGATCT GTTGTATTGC GACGAAGATA AGATGTTGCC AAGCGGCACG TTGGCCGAGC CCTTCTTCAA GCCGGATTTC AACATAGATT TGCTGCGCGA CAACAATTAT ATCTGCCACT TGCTGACGAT TCGAAAGAGT CTTCTTGACG AGTTAGAGCC CAACACGGCG CAGTTCGATG GTGCCCAGGA TCACAATATG ACACTTCAGG CTTCCGAACG TGCGAGGAAG ATACATCATG TTGCCCGTGT GCTCTATCAT TGGCGCATCA GCGAATCCTC TACGGCGGCG AATGCCGACA ACAAGCCTTA CGCAACGCAA GCCGGTATAA AAGCAGTCCA AAACCACTTG GATAGACTCG GCATTCGGGC CTGCGTGAGG CAGTCGCGAC GCCCGTTTAC GTACAGCGTC GACTATCTGC CGCCTGAGAG CGAACCGCTT GTGTCGATTA TCATCCCAAC TAAAGACCAC AGTGATGTGC TTCGTACGTG CGTGGAATCC GTTTTAGACA GAACAACCTA TGACAAGTAC GAGATTGTGA TCGTAGAGAA TAACAGCACG GAGCCGAAAA CGTTTGCCTA TTACGAGGAA TTAGAAAAAG AGCATGGTGA TCGCATTCGG ATTGAATATT GGCCGGCTGA GTTTAACTTC TCGAAGCTCA TCAATTTCGG CGTTTCGAAA GCAAGAGGAG ATCTTCTCTT GCTTTTGAAC AACGATACCG AGGTGATCAC CCCAGAGTGG ATGGAGCGCA TGGTCGGTAT CTGTTCTCGC GAAGACGTCG GCGTCGTTGG TGTGCGTCTG TATTTTAGGG ATGAGACTAT TCAGCATGCA GGCGTATGCG TTTCCGGTGG GGTTGCGGGG CATCTTGGTC GCAATCTGCC CAAAGGCAAC TGGGGCTATT TTTCTCTGAG CGATGCAACG CAAGACATGA GTGCAGTTAC TGCTGCTTGC ATGATGACTA AACGAGGCGT TTTCGAAAGT GTTGACGGAT TCTCTGAGGA ATTGGCAGTT GCGTTCAACG ATGTAGACTA TTGCTTGAAA GTCAGAGATA TGGAATTGCT GGTCGTTTAC ACGCCCGAAG TGGAGCTGTT CCACTATGAG TCCCTGTCTC GAGGATTCGA GAGTAGCGCT GAAAAGAAGA TAAGGTTCCA TCGCGAGGTT TCGTTCATGA ATTACAGATG GGCGGAATAT TATGTCAAAG GGGATCCATA TGCGAATCCC AATCTGTCGA CAAACGAGCC TTATAACTGC TATTACCATC TGTAA
|
Protein sequence | MYARLLVDEM ASDISLKAKA SSNDLIVPCG LYETLSSETG SERTFVLVFP ILRIRSVSLS IFGVDEKGSA LDQCSFSINF EKAKWQSRIN YRFNKELCKE IRDYDKIGTY SKISMEFWDC ISDKNVNILR GLVRMPYRND SIVQITCTND SLQEIAISPV FLSDVKVRSD VSERIFFREI QFSIRVPNKI QNLVFRLIDK AHPELDSFEA IEDHAYRKIR DDSNEIMLSA QSDPCYPKWF EEHRVDLGAL AKQHETFFDY RPLFSIVVPL YKTPKPFFLD MLNSVVSQSY GRWELILVNA SPKDEVLVGL VEEASSNDKR IKSVVLESNG GISENTNAGL AVSSGDFVCY FDHDDLLEPD LLFEYAKALN ADESIDLLYC DEDKMLPSGT LAEPFFKPDF NIDLLRDNNY ICHLLTIRKS LLDELEPNTA QFDGAQDHNM TLQASERARK IHHVARVLYH WRISESSTAA NADNKPYATQ AGIKAVQNHL DRLGIRACVR QSRRPFTYSV DYLPPESEPL VSIIIPTKDH SDVLRTCVES VLDRTTYDKY EIVIVENNST EPKTFAYYEE LEKEHGDRIR IEYWPAEFNF SKLINFGVSK ARGDLLLLLN NDTEVITPEW MERMVGICSR EDVGVVGVRL YFRDETIQHA GVCVSGGVAG HLGRNLPKGN WGYFSLSDAT QDMSAVTAAC MMTKRGVFES VDGFSEELAV AFNDVDYCLK VRDMELLVVY TPEVELFHYE SLSRGFESSA EKKIRFHREV SFMNYRWAEY YVKGDPYANP NLSTNEPYNC YYHL
|
| |