Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1805 |
Symbol | |
ID | 8416109 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2117972 |
End bp | 2120035 |
Gene Length | 2064 bp |
Protein Length | 687 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645024776 |
Product | sulfatase |
Protein accession | YP_003182159 |
Protein GI | 257791553 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00015883 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.251832 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTAC GTCGGCGCCT CGAAACGGCG CTCGCATCGG CCGCTGCGCT TCTTTCCAAC CTGCGCGCTC GCCTGTCCCC CGCGCCCTGC GACCTCGTCG GTGCGGTTCT GTGCGCCTTA GCGGCGCTGT TCCTGATTCT GTACGCTGCC AGCGGCTTCT CCGACGCGCC CTGGTTCGCC GTGGCGGGCG CTGCGGCCGT GCTGCTGGGC TGCGGCTGCG CGCTGCTGCT GCGCGGCCGG CGCTCCCAGC GGGCCTTGGA GCGCTGGCGC CTTGCCCGCC CCATCGTGCT GCTGCTAGCG GTTCCCATAG GATTCTACCT TCTGGAACGT CCCTGGAACG ATCAGCTGCT CGCCATGGAT CCGTTCTACG CCGCGGTGAA CCTGTGCGTT CTGGGCGCGC TGTTCGCCAT CGTGTACGCG GCCGGGCAGC GCACCCGCGG CGCCGTCGTC GCATTCTTGG CCGCCTGCCT CCTCGCCGGT ACGGCGAACC ACTTCGTCAT CCTGTTCAAG GGCCAGCCCA TCGTGCCCGC CGACGTGTTC GCCCTGTCGA CGGCGGCCTC GGTGGGCGCG GGCTACACGT TCGCGCTCGA CGCGCGCTTG CTGGCGGCCG TCTCCGTGTT CGCGATCGCC TCGACGGCCA TCGCCTTCCT GCCGAAGGTG GCGGCCACTC CGCGTCGCGC GGTGGCGAAC GGCGCGTGCG CTTTAGCGCT GGCAAGCTGC TTCGGCTGGT GGATAGGAAC GTACGACATC GAGGAAGCCT ATGCCTGCAC GGTGGATGTA TGGGGCGTGA AGGAGTCGTA CGCCGAGCAG GGATCGACGT TGTGCTTCCT CAAACGCGTG CAGGATCTGA GCCCTGCACG CCCCGAAGGA TACGATGCCG ATGCCGTTTC GGACCTGCTG CGCTCGCTCT CAGACGATAG CGCGAGCATG CCGGACGGCG CCGAAGCCCC TACCGTCATC GCGATCATGA ACGAGACGTT CTCCGACCTC TCGCGCTACC CGGGCTTGGC CGGCACCGAC GCGCGTCCCG CCCATTACTA CGATATCGCG GCCGAGGCGC TCGAAGCGGG CGACGCATAC GCGTCGGCGC TCGGCGGGGG CACGTGCAAC AGCGAGTTCG AGTTCCTCAC GGGATCGAGC ATGGGGCACC TCGGCGGCGG CGTGTACCCC TACGTGCTGT ACGACCTTGA CGGAACGGAG AGCCTCGTGT CGTACTTCTC GTCGCTCGGA TACGCCACCC ATGCCGTGCA CCCCGCCGAG AGCACGAACT GGCGTCGCGA CCGCGTGTAC GAGCAGCTGG GCTTCGACGA GTTCGCAGAC CAGCGGGCGT TCGCGAACGC CGACACGCTG CGCGGGCTCA CCACCGACCG CGCCACCTAC GACTACGTGC TCGACCTGCT GGAAGCCGAC GAGGGCCCGC AGTTCGTGTT CGACGTGACG CTGCAGAACC ACGGCGGCTA CGACGTAGGA GGCCTGTCCG ACGAGCTTGC GGTGAGCGTG CCCTTAGGCG ACGGCAGCAG GTCGTCGGAG CTGGACGAGT ACGCCAGCGT CATACGCCAG GCGGATCGCG ACCTCGCCTA CCTCGTGGAT CGGCTGAACG CTCTCGACCG CCCCGTGGTG CTATGCTTTT TCGGCGACCA CCAGCCGGGC TTCAGCGATT GGCTGTTCGA GGCGACCCAC GACGGAGCTG CAGCAGACGA CCTGGGGCTC GAGGCCGTGC AGGAGCGCTA CACGGTGCCA TACCTCATCT GGGCGAACGA CGCAGCCCGC GCGCAAGGCG CACACGAGCC TCAGGGCGTC GCGCACGAGC GCACGAGCCT TAACTACCTG GGGTCGAGGC TCGTCGAGGC CGCAGGCCTG CCCACGACGA GCTATCAGCG CTTCCTGCTG GCCATGCGCG AAGCCGTCCC CGCCATCAAT CTGAACGGAT TCCTCACGGC CGACGGCATT TGGCACGGAT TCGGCAACGA GGAGGCGGCG GGCGTGCTCG ACGCGCTGCA AGCGTACGCA ACCGTCCAGT ACGACAACCT CTTCAACAAG GACTCGGCCT GGGCGGTGAA GTAA
|
Protein sequence | MTVRRRLETA LASAAALLSN LRARLSPAPC DLVGAVLCAL AALFLILYAA SGFSDAPWFA VAGAAAVLLG CGCALLLRGR RSQRALERWR LARPIVLLLA VPIGFYLLER PWNDQLLAMD PFYAAVNLCV LGALFAIVYA AGQRTRGAVV AFLAACLLAG TANHFVILFK GQPIVPADVF ALSTAASVGA GYTFALDARL LAAVSVFAIA STAIAFLPKV AATPRRAVAN GACALALASC FGWWIGTYDI EEAYACTVDV WGVKESYAEQ GSTLCFLKRV QDLSPARPEG YDADAVSDLL RSLSDDSASM PDGAEAPTVI AIMNETFSDL SRYPGLAGTD ARPAHYYDIA AEALEAGDAY ASALGGGTCN SEFEFLTGSS MGHLGGGVYP YVLYDLDGTE SLVSYFSSLG YATHAVHPAE STNWRRDRVY EQLGFDEFAD QRAFANADTL RGLTTDRATY DYVLDLLEAD EGPQFVFDVT LQNHGGYDVG GLSDELAVSV PLGDGSRSSE LDEYASVIRQ ADRDLAYLVD RLNALDRPVV LCFFGDHQPG FSDWLFEATH DGAAADDLGL EAVQERYTVP YLIWANDAAR AQGAHEPQGV AHERTSLNYL GSRLVEAAGL PTTSYQRFLL AMREAVPAIN LNGFLTADGI WHGFGNEEAA GVLDALQAYA TVQYDNLFNK DSAWAVK
|
| |