Gene Elen_1805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1805 
Symbol 
ID8416109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2117972 
End bp2120035 
Gene Length2064 bp 
Protein Length687 aa 
Translation table11 
GC content68% 
IMG OID645024776 
Productsulfatase 
Protein accessionYP_003182159 
Protein GI257791553 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00015883 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.251832 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTAC GTCGGCGCCT CGAAACGGCG CTCGCATCGG CCGCTGCGCT TCTTTCCAAC 
CTGCGCGCTC GCCTGTCCCC CGCGCCCTGC GACCTCGTCG GTGCGGTTCT GTGCGCCTTA
GCGGCGCTGT TCCTGATTCT GTACGCTGCC AGCGGCTTCT CCGACGCGCC CTGGTTCGCC
GTGGCGGGCG CTGCGGCCGT GCTGCTGGGC TGCGGCTGCG CGCTGCTGCT GCGCGGCCGG
CGCTCCCAGC GGGCCTTGGA GCGCTGGCGC CTTGCCCGCC CCATCGTGCT GCTGCTAGCG
GTTCCCATAG GATTCTACCT TCTGGAACGT CCCTGGAACG ATCAGCTGCT CGCCATGGAT
CCGTTCTACG CCGCGGTGAA CCTGTGCGTT CTGGGCGCGC TGTTCGCCAT CGTGTACGCG
GCCGGGCAGC GCACCCGCGG CGCCGTCGTC GCATTCTTGG CCGCCTGCCT CCTCGCCGGT
ACGGCGAACC ACTTCGTCAT CCTGTTCAAG GGCCAGCCCA TCGTGCCCGC CGACGTGTTC
GCCCTGTCGA CGGCGGCCTC GGTGGGCGCG GGCTACACGT TCGCGCTCGA CGCGCGCTTG
CTGGCGGCCG TCTCCGTGTT CGCGATCGCC TCGACGGCCA TCGCCTTCCT GCCGAAGGTG
GCGGCCACTC CGCGTCGCGC GGTGGCGAAC GGCGCGTGCG CTTTAGCGCT GGCAAGCTGC
TTCGGCTGGT GGATAGGAAC GTACGACATC GAGGAAGCCT ATGCCTGCAC GGTGGATGTA
TGGGGCGTGA AGGAGTCGTA CGCCGAGCAG GGATCGACGT TGTGCTTCCT CAAACGCGTG
CAGGATCTGA GCCCTGCACG CCCCGAAGGA TACGATGCCG ATGCCGTTTC GGACCTGCTG
CGCTCGCTCT CAGACGATAG CGCGAGCATG CCGGACGGCG CCGAAGCCCC TACCGTCATC
GCGATCATGA ACGAGACGTT CTCCGACCTC TCGCGCTACC CGGGCTTGGC CGGCACCGAC
GCGCGTCCCG CCCATTACTA CGATATCGCG GCCGAGGCGC TCGAAGCGGG CGACGCATAC
GCGTCGGCGC TCGGCGGGGG CACGTGCAAC AGCGAGTTCG AGTTCCTCAC GGGATCGAGC
ATGGGGCACC TCGGCGGCGG CGTGTACCCC TACGTGCTGT ACGACCTTGA CGGAACGGAG
AGCCTCGTGT CGTACTTCTC GTCGCTCGGA TACGCCACCC ATGCCGTGCA CCCCGCCGAG
AGCACGAACT GGCGTCGCGA CCGCGTGTAC GAGCAGCTGG GCTTCGACGA GTTCGCAGAC
CAGCGGGCGT TCGCGAACGC CGACACGCTG CGCGGGCTCA CCACCGACCG CGCCACCTAC
GACTACGTGC TCGACCTGCT GGAAGCCGAC GAGGGCCCGC AGTTCGTGTT CGACGTGACG
CTGCAGAACC ACGGCGGCTA CGACGTAGGA GGCCTGTCCG ACGAGCTTGC GGTGAGCGTG
CCCTTAGGCG ACGGCAGCAG GTCGTCGGAG CTGGACGAGT ACGCCAGCGT CATACGCCAG
GCGGATCGCG ACCTCGCCTA CCTCGTGGAT CGGCTGAACG CTCTCGACCG CCCCGTGGTG
CTATGCTTTT TCGGCGACCA CCAGCCGGGC TTCAGCGATT GGCTGTTCGA GGCGACCCAC
GACGGAGCTG CAGCAGACGA CCTGGGGCTC GAGGCCGTGC AGGAGCGCTA CACGGTGCCA
TACCTCATCT GGGCGAACGA CGCAGCCCGC GCGCAAGGCG CACACGAGCC TCAGGGCGTC
GCGCACGAGC GCACGAGCCT TAACTACCTG GGGTCGAGGC TCGTCGAGGC CGCAGGCCTG
CCCACGACGA GCTATCAGCG CTTCCTGCTG GCCATGCGCG AAGCCGTCCC CGCCATCAAT
CTGAACGGAT TCCTCACGGC CGACGGCATT TGGCACGGAT TCGGCAACGA GGAGGCGGCG
GGCGTGCTCG ACGCGCTGCA AGCGTACGCA ACCGTCCAGT ACGACAACCT CTTCAACAAG
GACTCGGCCT GGGCGGTGAA GTAA
 
Protein sequence
MTVRRRLETA LASAAALLSN LRARLSPAPC DLVGAVLCAL AALFLILYAA SGFSDAPWFA 
VAGAAAVLLG CGCALLLRGR RSQRALERWR LARPIVLLLA VPIGFYLLER PWNDQLLAMD
PFYAAVNLCV LGALFAIVYA AGQRTRGAVV AFLAACLLAG TANHFVILFK GQPIVPADVF
ALSTAASVGA GYTFALDARL LAAVSVFAIA STAIAFLPKV AATPRRAVAN GACALALASC
FGWWIGTYDI EEAYACTVDV WGVKESYAEQ GSTLCFLKRV QDLSPARPEG YDADAVSDLL
RSLSDDSASM PDGAEAPTVI AIMNETFSDL SRYPGLAGTD ARPAHYYDIA AEALEAGDAY
ASALGGGTCN SEFEFLTGSS MGHLGGGVYP YVLYDLDGTE SLVSYFSSLG YATHAVHPAE
STNWRRDRVY EQLGFDEFAD QRAFANADTL RGLTTDRATY DYVLDLLEAD EGPQFVFDVT
LQNHGGYDVG GLSDELAVSV PLGDGSRSSE LDEYASVIRQ ADRDLAYLVD RLNALDRPVV
LCFFGDHQPG FSDWLFEATH DGAAADDLGL EAVQERYTVP YLIWANDAAR AQGAHEPQGV
AHERTSLNYL GSRLVEAAGL PTTSYQRFLL AMREAVPAIN LNGFLTADGI WHGFGNEEAA
GVLDALQAYA TVQYDNLFNK DSAWAVK