Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2576 |
Symbol | |
ID | 8416901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 3013127 |
End bp | 3014971 |
Gene Length | 1845 bp |
Protein Length | 614 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645025556 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003182918 |
Protein GI | 257792312 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAAGCGT TGAAGAGAGC CGGTAAGGCG GCGAAGACGT TCGCGTCGAT CGTGCTAGCG GGGTCGCTGG TCGGCATCGG ACTGGTCGGC TGTTCCGGCG AGGCTATGGG CAATGCGTCC GGACGCGACG ATGAGATAGT GCTGACGGTA TCGAAATGCC AGACCAAGGA GCTGCCGCAG GAGCTGTTGG ACGAGATGAC CAATCGCCAC CCCAACCTGC GCTTCGAGTT CGATACGTAT TCGAACAGCA ACTATTCCGC GCAGATCGTC ACCGAGCTCG AGCAGCGCGA TATCCCCGAT ATCCTGATCA ACACGCGCAA TCAGGATCTG ACCGAGGATT TGGAGCATAA TCTGGTGGAT CTCGCGGCGT ACGATTTTTC CAGCGAGTAC CTTCCCAGCG TTCTCGATCG CATGACCATC GACGGCAGCC TCTACTATCT GCCGGGTTAC CTGACGCTTG CAGGGTTCTT CTACAACAAG GACCTTTTCG CCGAGCACGG CTGGGAGGCG CCTCAGTCGC TCGAGGAACT CATTGCCCTC AACGAGCAGG CGAAGGCGGA GGGCATCCGG CTTATGGCGT ACTCGATGGA GCTGACGGGT CAACGCTTCC TGCAGCTGAC CAATATCGCT TCGGCGCAGT TTCTGCACAC GCCGCAGGGG TCGTCTTGGG AACAGGATTA TCTTGCGGGC GAAGCGAGCA TGGTGGGCAC GTTCGGGCCT TTCATGGACG AGTATCGGCT TTGGCTCGAC AGCGGATTGA TTTCGGCTGA CGACCTTTCC CTGTCGAATT CGGACGCCGC GGAGATGTTC GCGAACGGTG ACGTGGCCAT GATCTACGGC GTTGCCAACA ACGTGAAGAC CACTGATTTC GACTTCGATT TGGGCCAAGC TCCGTTCCTT GCGAGAGGCG AGGGCGAAGA TAACGGCTGG TACCTGTATG CGGTCAGCTC GTACTACGGT ATTAACAAGA AGCTGGAAGA GCCGGGCAAC GAGGAGAAGC TGGCCATAGC GCTGGAGATG TTCGACCTGA TGAGCACCCC CGAGGGCCAG AGCATGTTCA CGGATGGCGC GGAAGGGCGA TATCCGGCCA CGAGAAAGGC GGACGGCGAG CTCCACGCGC CGCTTTTGAG CGATTACCGC AACGTGGTCG ACCGCAATAA CCTTGTGGAG TTGGCGGCTT ACACTGCGCC GCTCTTGCTG GGAGGAGAGG CGCTGGGCGG GTACATCGCT GGCACCGTCA GCGCCGAGGA AGCGTTGCAG GCCTGCGACG AGGCTATGAA GTCCAACAAG TCGGAAACCC AGATCGGCGA CGTAGTGGCA CATATCGAAC GCGACCTGAG CCGGGAGGAG ACCGTCCGAT ACTTCGCCGA CGCGTTCAGG GAGTACGGGG GCACCGACCT GTGCCTTATG CTGCCTGGCG GCATGGCAGA CGGCCAAATG CATCCTTATG GGATTTCCGG CAAGCTGTAC GAAGGGGAGC TGCACGCCAA TGAGCTGACC GTGCTCTTGC CCAATGCGGG GAAGCCGGTG CCTACGCTGG CCACTGCGCG CATTTCGGGC GAAGACCTGC GCGCGGTGCT AGAGAGCGGG CGCACGTTCG AGCGGAAGGA CGCGTCGAAG GAGGCCCTTG CTCCGTTCCG CTACGAGGTG TCGGGAGCCG AGGTGGACTA TGATGCAGAC CGCAAAGTGC GATCGCTCAA AGTGAACGGC GTTGAAGTCG CCGACGAAGA CGTATTCACG GTGACGTATT TCGACGGGGC GGTCGAAACG TCCCGCTTGA CCGATGCGGC GGTGTCGGAC GTGAAGCCCG TCCCCGCGTT CACTGCTTGC AAGGCCGCGC GTTGA
|
Protein sequence | MKALKRAGKA AKTFASIVLA GSLVGIGLVG CSGEAMGNAS GRDDEIVLTV SKCQTKELPQ ELLDEMTNRH PNLRFEFDTY SNSNYSAQIV TELEQRDIPD ILINTRNQDL TEDLEHNLVD LAAYDFSSEY LPSVLDRMTI DGSLYYLPGY LTLAGFFYNK DLFAEHGWEA PQSLEELIAL NEQAKAEGIR LMAYSMELTG QRFLQLTNIA SAQFLHTPQG SSWEQDYLAG EASMVGTFGP FMDEYRLWLD SGLISADDLS LSNSDAAEMF ANGDVAMIYG VANNVKTTDF DFDLGQAPFL ARGEGEDNGW YLYAVSSYYG INKKLEEPGN EEKLAIALEM FDLMSTPEGQ SMFTDGAEGR YPATRKADGE LHAPLLSDYR NVVDRNNLVE LAAYTAPLLL GGEALGGYIA GTVSAEEALQ ACDEAMKSNK SETQIGDVVA HIERDLSREE TVRYFADAFR EYGGTDLCLM LPGGMADGQM HPYGISGKLY EGELHANELT VLLPNAGKPV PTLATARISG EDLRAVLESG RTFERKDASK EALAPFRYEV SGAEVDYDAD RKVRSLKVNG VEVADEDVFT VTYFDGAVET SRLTDAAVSD VKPVPAFTAC KAAR
|
| |