Gene Elen_1202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1202 
Symbol 
ID8415493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1441978 
End bp1443837 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content69% 
IMG OID645024165 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_003181561 
Protein GI257790955 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.302396 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.234387 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGACA TCAAGCGCGT CATCGCGAGC GCTCCGCGCA AGCCGGAGCA AGCGGAGCTG 
CGGCCGCTGA CCACGCCGTG GAGCGAGCAG GCGGCCGCAG GCAAGGGCCG TGCGGGGCTG
CATCCGCATC CGCAGTTCGC GCGGGCGGGG TTCGAGCTGC TCGACGGCTG GTGGGACTAC
GCCATCGTCG ACGCCGACAG CGCCGCGCAG GCCTGGCGCG ATGCCGCCCC TCCGAACGCC
TGGGACGGGT GCATCCTCGT GCCGTTCTCG CCGGAAGCGC CCCTGTCGGG CGCGGAGCGG
CAGTTGCAGC CGGACGAGCT TCTGTGGTAC CGACGGCCGT TCGCCGTGCC GGACGGCATG
GACGTGGAGG GCGGTCGGCG CCTCGTTGTG CACTTCGAAG CCGTGGACTA CGCATGCGCA
TGCTATGTGA ACAGGGTGCG CGTAGGCGAG CACGTAGGCG GCTACCTGCC GTTCGCGTTC
GACGTCACGG ATGCGCTGGT CTCCGGCGAG AACGAGCTGT CCATATGCGT GTGGGACCCG
AGCGACGCGG GCGTGCAGCT GCGCGGCAAG CAACGGCTGA AGCGCGGGGG CATCTGGTAC
ACCGCGCAAA GCGGCATCTG GCAGAGCGTG TGGCTGGAAG CGGTTCCGGA AGCGCGCATC
GAACGGTTGG CCGTCGACGC GAGCGTCGAA GGGCGGCTCA CGTTGCGAGC GGTGGTGCGC
GGCGCGGCGG TGGACGGCGG CGAGCTGACC GTGCGGGTGT TCGACGAAGG CGCGGAAGTT
GCGCGCGCAT CCGCGGCGCC CGCAGCCGAC GGGACGTGCG AGCCGATCCT CGACGTGGCG
CGTCCGCACC GGTGGAGCAC CGACGACCCA CATCTGTACG ATCTGGAGCT GACGTACGGA
AGCGACCGCG TGACCAGCTA TTGCGCGTTT CGCACGGTGA GCGTGGAAGC GGACGAGCAC
GGCGCGAGGC GGCTGTTCCT CAATCACGAA CCCCTCTTCC TGCGCGGCGT GCTGGACCAG
GGGTATTGGC CCGACGGCCT CATGACCGCG CCGTCGGACG AGGCGCTGGC CTTCGACGTC
CGGTCGATGA AGGACCTCGG GTTCAACCTG CTGCGCAAGC ACATCAAGGT GGAGAGCGAT
CGCTGGTACT ACCACTGCGA CAGGCTGGGC ATGCTCGTGT GGCAGGACAT GGTGAGCGGC
GGCGCGGCGC CCAGCCCCTG GCATTCCAGC TACAAGCCCA CCTTCTTCCG CGGCTCGTGG
GGCCGCTACG CCGACGACGA CCCGCGCCAC TTCCCCGGGC TCGCCTCCGA CAGCGCGGCG
TTTCGCGCCG AGTGGACCGA AGCATGCGAG GACACCGTGC GCTACCTGGG GAACCATCCG
TCCATCGTGA CCTGGGTGCT GTTCAACGAG GCCTGGGGGC AGTTCGACGC GCGCAGGGCC
GTGGAGCGGG TGCGCGCGAT CGACCCCTCG CGGCCCGTCG ACGCGGTCAG CGGCTGGTAC
GACCAGGCCT GCGGGGACTT CCTGAGCGTG CATAACTACT TCCGGCCGCT CGAGGTGTAC
CGGGACGAGG CGCGCCCGGC GCGAGCGTTC GTCATATCGG AGTTCGGCGG ATCGTCGTGC
CATCTGGCCG ACCACAGCTC GCTGGCCACA TCGTACGGAT ACGCCGCCTG CCCCGACCCG
GCATCGTTTC GGGATGCCGT GCACAAGACG CTCGCGCAGG CGGACGCGCT GGAGGCGGAA
GGGCTTGCGG GTTACGTGTA CACGCAGCTG TCCGACGTGG AAGAGGAGAC GAACGGGCTG
CTCACCTACG ACCGGCGCGT CAACAAGCTG GCCGATCCGG CGGAGGAGGA TGCGCGATGA
 
Protein sequence
MLDIKRVIAS APRKPEQAEL RPLTTPWSEQ AAAGKGRAGL HPHPQFARAG FELLDGWWDY 
AIVDADSAAQ AWRDAAPPNA WDGCILVPFS PEAPLSGAER QLQPDELLWY RRPFAVPDGM
DVEGGRRLVV HFEAVDYACA CYVNRVRVGE HVGGYLPFAF DVTDALVSGE NELSICVWDP
SDAGVQLRGK QRLKRGGIWY TAQSGIWQSV WLEAVPEARI ERLAVDASVE GRLTLRAVVR
GAAVDGGELT VRVFDEGAEV ARASAAPAAD GTCEPILDVA RPHRWSTDDP HLYDLELTYG
SDRVTSYCAF RTVSVEADEH GARRLFLNHE PLFLRGVLDQ GYWPDGLMTA PSDEALAFDV
RSMKDLGFNL LRKHIKVESD RWYYHCDRLG MLVWQDMVSG GAAPSPWHSS YKPTFFRGSW
GRYADDDPRH FPGLASDSAA FRAEWTEACE DTVRYLGNHP SIVTWVLFNE AWGQFDARRA
VERVRAIDPS RPVDAVSGWY DQACGDFLSV HNYFRPLEVY RDEARPARAF VISEFGGSSC
HLADHSSLAT SYGYAACPDP ASFRDAVHKT LAQADALEAE GLAGYVYTQL SDVEEETNGL
LTYDRRVNKL ADPAEEDAR