Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2145 |
Symbol | |
ID | 8416467 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2526464 |
End bp | 2528077 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645025132 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003182497 |
Protein GI | 257791891 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACAGC CTTTGAAAGC AACTTTGTCT CGCCGCACGT TCTTGGCGGG CAGCGCCGTC GCGGCGGCTG CAGCAGGCCT GACTCTGGCC GGCTGCGGTG GCGGCGGCGA AACGACGGAC ACCCCGTCGA CCGATGCCGG CACCGACGCC GGCGCAGCGG CCCAGGGCGG CACGCTGACC GGCGCCATGG CCTACACGAG CACGAACGTC AACCCGATCG GCAACAGCTC CGCGCTGATG CTGGCCGCCA CGTGGCATGT GTTCGAGGGC CTGTACGACC TCGATCTGCA CACCTACAAG ACCTATAACG CGCTTGCCGC CGGCGAGCCC ACGAAGGTCT CCGACACCGA GTACGAGGTT GCCCTGCGCG ACGGTGCCAA GTTCTCCGAC GGCACGGACG TCACCACCGC CGACGTGGTG AACGCGTTCG AGAAGAACAT GGCCGACGCC ACCTACGGCG CCTTCCTCGA ATTCATCGAC ACGGTGTCTG CGAAGGACGA CAAGACCGTC TCCTTCACGC TGAAGTACCC CTTCGACAGC CTGCTGAAGG GCCGTCTGAG CGTGGTCAAG GTGTTCCCCG CCTCGCTGAC CGAGGATGAT CTGAAGACGA AGCCGATCGG TTCCGGCCCG TGGGTGTACG ACACCATCAA CGGCGACGAC GGCGGCTCCA TCGAGTTCGT GCCGAACACG AACTACAACG GCAAGTACGC CGCTACGGCC GACAAGATGC ACTGGGACAT CCTGCTCGAC GACACCTCCC GCACCACCGC GCTGCAGGAG GCCACGGTCC AGGTCATGGA GAACGTGCCC GACGCCAACG CCGAACAGCT CATGGCCGCC GGCGCGTCCG TCGACTACAT CCAGGGCTTC AACCAGCCGT TCTTCATGTT CAACACGCTC AAGAAGCCGT TCGACGATAA GCGCGTCCGC CAGGCGTTCT ACTATGCCGT GGACGTGGAC AAGCTGATCT CCAACGCCAT GGCCGGCCAT GCCGCGAAGG TGACGAGCTT CCTGCCCGAG AGCCACGAGA ACTACCACAA GGCTTCCACG GTGTACACCT ACGACCCCGA GAAGGCCAAG AGCCTGCTTT CCGAGGCCGG CGTCACCGAC CTGAGCTTCG AGCTGATGAC GAACAACAAC TGGGTGAAGA ACCTGGCCGC CGGCATCAAG AACGACCTCG ATGCCATCGG CGTGAACTGC ACCATCAACG AGACGAAGAT CGACTGGGCG TCTCTGGCCG AGTCGGCCGA CGTGCTGCCC TACGACGTCA TGCTGACCCC GGGCGACCCG ACCTGCTTCG GCAACGACCC CGACCTGCTG ATGTCCTGGT GGTACGGCGA CAACGTGTGG ACCCAGGGCC GCAGCTGCTG GAAGAAGGCC GGCGACGGCA AGTTCGACGA GCTGCAGACC CTCATGCAGC AGGCTCGCGA GGCCACCGGC AACGAGCAGC AGGAGCTGTG GAACAAGTGC TTCGACCTTC TGGCCGAGGA AGTTCCGCTG TACCCGCTGT TCCACCGCGA GCTGGCCACG GGCTACCAGG AGACGCAGAT CACCGGCTTC GAGCCCATCG CCACGACGGG CCTCGTGTTC CTCGGTGCGA GCGTGAAGGC GTAA
|
Protein sequence | MEQPLKATLS RRTFLAGSAV AAAAAGLTLA GCGGGGETTD TPSTDAGTDA GAAAQGGTLT GAMAYTSTNV NPIGNSSALM LAATWHVFEG LYDLDLHTYK TYNALAAGEP TKVSDTEYEV ALRDGAKFSD GTDVTTADVV NAFEKNMADA TYGAFLEFID TVSAKDDKTV SFTLKYPFDS LLKGRLSVVK VFPASLTEDD LKTKPIGSGP WVYDTINGDD GGSIEFVPNT NYNGKYAATA DKMHWDILLD DTSRTTALQE ATVQVMENVP DANAEQLMAA GASVDYIQGF NQPFFMFNTL KKPFDDKRVR QAFYYAVDVD KLISNAMAGH AAKVTSFLPE SHENYHKAST VYTYDPEKAK SLLSEAGVTD LSFELMTNNN WVKNLAAGIK NDLDAIGVNC TINETKIDWA SLAESADVLP YDVMLTPGDP TCFGNDPDLL MSWWYGDNVW TQGRSCWKKA GDGKFDELQT LMQQAREATG NEQQELWNKC FDLLAEEVPL YPLFHRELAT GYQETQITGF EPIATTGLVF LGASVKA
|
| |