Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1202 |
Symbol | |
ID | 8415493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1441978 |
End bp | 1443837 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645024165 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_003181561 |
Protein GI | 257790955 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.302396 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.234387 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGACA TCAAGCGCGT CATCGCGAGC GCTCCGCGCA AGCCGGAGCA AGCGGAGCTG CGGCCGCTGA CCACGCCGTG GAGCGAGCAG GCGGCCGCAG GCAAGGGCCG TGCGGGGCTG CATCCGCATC CGCAGTTCGC GCGGGCGGGG TTCGAGCTGC TCGACGGCTG GTGGGACTAC GCCATCGTCG ACGCCGACAG CGCCGCGCAG GCCTGGCGCG ATGCCGCCCC TCCGAACGCC TGGGACGGGT GCATCCTCGT GCCGTTCTCG CCGGAAGCGC CCCTGTCGGG CGCGGAGCGG CAGTTGCAGC CGGACGAGCT TCTGTGGTAC CGACGGCCGT TCGCCGTGCC GGACGGCATG GACGTGGAGG GCGGTCGGCG CCTCGTTGTG CACTTCGAAG CCGTGGACTA CGCATGCGCA TGCTATGTGA ACAGGGTGCG CGTAGGCGAG CACGTAGGCG GCTACCTGCC GTTCGCGTTC GACGTCACGG ATGCGCTGGT CTCCGGCGAG AACGAGCTGT CCATATGCGT GTGGGACCCG AGCGACGCGG GCGTGCAGCT GCGCGGCAAG CAACGGCTGA AGCGCGGGGG CATCTGGTAC ACCGCGCAAA GCGGCATCTG GCAGAGCGTG TGGCTGGAAG CGGTTCCGGA AGCGCGCATC GAACGGTTGG CCGTCGACGC GAGCGTCGAA GGGCGGCTCA CGTTGCGAGC GGTGGTGCGC GGCGCGGCGG TGGACGGCGG CGAGCTGACC GTGCGGGTGT TCGACGAAGG CGCGGAAGTT GCGCGCGCAT CCGCGGCGCC CGCAGCCGAC GGGACGTGCG AGCCGATCCT CGACGTGGCG CGTCCGCACC GGTGGAGCAC CGACGACCCA CATCTGTACG ATCTGGAGCT GACGTACGGA AGCGACCGCG TGACCAGCTA TTGCGCGTTT CGCACGGTGA GCGTGGAAGC GGACGAGCAC GGCGCGAGGC GGCTGTTCCT CAATCACGAA CCCCTCTTCC TGCGCGGCGT GCTGGACCAG GGGTATTGGC CCGACGGCCT CATGACCGCG CCGTCGGACG AGGCGCTGGC CTTCGACGTC CGGTCGATGA AGGACCTCGG GTTCAACCTG CTGCGCAAGC ACATCAAGGT GGAGAGCGAT CGCTGGTACT ACCACTGCGA CAGGCTGGGC ATGCTCGTGT GGCAGGACAT GGTGAGCGGC GGCGCGGCGC CCAGCCCCTG GCATTCCAGC TACAAGCCCA CCTTCTTCCG CGGCTCGTGG GGCCGCTACG CCGACGACGA CCCGCGCCAC TTCCCCGGGC TCGCCTCCGA CAGCGCGGCG TTTCGCGCCG AGTGGACCGA AGCATGCGAG GACACCGTGC GCTACCTGGG GAACCATCCG TCCATCGTGA CCTGGGTGCT GTTCAACGAG GCCTGGGGGC AGTTCGACGC GCGCAGGGCC GTGGAGCGGG TGCGCGCGAT CGACCCCTCG CGGCCCGTCG ACGCGGTCAG CGGCTGGTAC GACCAGGCCT GCGGGGACTT CCTGAGCGTG CATAACTACT TCCGGCCGCT CGAGGTGTAC CGGGACGAGG CGCGCCCGGC GCGAGCGTTC GTCATATCGG AGTTCGGCGG ATCGTCGTGC CATCTGGCCG ACCACAGCTC GCTGGCCACA TCGTACGGAT ACGCCGCCTG CCCCGACCCG GCATCGTTTC GGGATGCCGT GCACAAGACG CTCGCGCAGG CGGACGCGCT GGAGGCGGAA GGGCTTGCGG GTTACGTGTA CACGCAGCTG TCCGACGTGG AAGAGGAGAC GAACGGGCTG CTCACCTACG ACCGGCGCGT CAACAAGCTG GCCGATCCGG CGGAGGAGGA TGCGCGATGA
|
Protein sequence | MLDIKRVIAS APRKPEQAEL RPLTTPWSEQ AAAGKGRAGL HPHPQFARAG FELLDGWWDY AIVDADSAAQ AWRDAAPPNA WDGCILVPFS PEAPLSGAER QLQPDELLWY RRPFAVPDGM DVEGGRRLVV HFEAVDYACA CYVNRVRVGE HVGGYLPFAF DVTDALVSGE NELSICVWDP SDAGVQLRGK QRLKRGGIWY TAQSGIWQSV WLEAVPEARI ERLAVDASVE GRLTLRAVVR GAAVDGGELT VRVFDEGAEV ARASAAPAAD GTCEPILDVA RPHRWSTDDP HLYDLELTYG SDRVTSYCAF RTVSVEADEH GARRLFLNHE PLFLRGVLDQ GYWPDGLMTA PSDEALAFDV RSMKDLGFNL LRKHIKVESD RWYYHCDRLG MLVWQDMVSG GAAPSPWHSS YKPTFFRGSW GRYADDDPRH FPGLASDSAA FRAEWTEACE DTVRYLGNHP SIVTWVLFNE AWGQFDARRA VERVRAIDPS RPVDAVSGWY DQACGDFLSV HNYFRPLEVY RDEARPARAF VISEFGGSSC HLADHSSLAT SYGYAACPDP ASFRDAVHKT LAQADALEAE GLAGYVYTQL SDVEEETNGL LTYDRRVNKL ADPAEEDAR
|
| |