Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1725 |
Symbol | |
ID | 8416024 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2031565 |
End bp | 2033292 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645024691 |
Product | oligopeptide transporter, OPT superfamily |
Protein accession | YP_003182079 |
Protein GI | 257791473 |
COG category | [S] Function unknown |
COG ID | [COG1297] Predicted membrane protein |
TIGRFAM ID | [TIGR00728] oligopeptide transporters, OPT superfamily [TIGR00733] putative oligopeptide transporter, OPT family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.116386 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.268809 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTCTG TCAAAGGGCA GCTGACGCTG CGCGGCATCG TCATAGGCTG CGTCGGTTGC GCCATCATCA CCGCGGCATC GGTGTACACG GCCCTCAAGA TGGGGGCGCT TCCTTGGCCC ATCGTGTTCG CGGCCATCAT CTCGCTGTTC TTCCTCAAGG CCCTCGGGCA CGGCAAGGCC AGCCTCAACG AGGCGAACGT CACGCACACC GTGATGAGCG CCGGCGCCAT GGTGGCGGGC GGCCTGGCCT TCACCATCCC CGGCATCTGG ATGCTGGGCT ACGCCGACGA GGTGGGCTGG TTCGAGATGC TCCTCGTAGC CGTCTCGGGC GTCATCATGG GCCTCGTCTG CACCGCGCTT CTGCGCCGCC ACTTCATCGA GGACTCCGAG CTGGAGTACC CCATCGGCGA AGCGGCCGCC CAGACGCTCA TCGCGGGCGA TTCCGGCGGC AAGACCGGCT GGAAGCTGTT CGGCTCCATG GGCTTCGCGG GCGCGTTCAC GGCGTTGCGC GATTTCTTCG GCGTGATCCC CGCGATGCTC TTCGGCAACG CCGCGGTCCC CGGCGTTGCG TTCGGCATCT ACCTGTCGCC GATGCTTCTG GCCGTGGGCT TCCTCGTGGG CACGGGCGCC GTCGTCGTGT GGTTCGTCGG AGCGCTGCTG GCGAACTTCG GCATCATCGT GGGCGGCTCG GCCGCGGGAC TCTGGGACGT GGCGTCCGCG CAGGGCATCG TGTCCAGCCT CGGTATGGGC GTCATGATGG GCGCGGGCCT GGGCGTCATC TTCAAGAACA TCCTGCCCAA GGCCTGGCGC ATGCTGCGCG ACGCCCGCAG CTCGAACGCG TTCGGCCTCA CCGCCGCAAG CACCATGGAT GCGCCCGCAA GCGGCGCTAA GGACGGACGG CTGCGCATCG GCAGCTTCCG CCTCACCGCC GGGCTGGCCG CGCTCGCCGT GGCCGCCGTC GCGCTCATCG TCTGCTTCGG CCTGCAGCTG GGCCCCGTCC CGGCCGTCAT CGTGGTGCTG CTCGCCTTCG TCGCCACGGC CATGAGCGCG CAGAGCGTCG GCCAGACGGG CATCGACCCC ATGGAGATAT TCGGCCTCAT CGTGCTTTTG GCCGTGGCGG CCGTATCCAG CGTGCCGCAG GTGCAGCTTT TCTTCGTCGC GGGCATCGTG GCCGTGGCGT GCGGGCTGGC CGGCGACGTG ATGAACGACT TCCACGCCGG CCACGTGCTG GGCACCAGCC CCAAGGCGCA GTGGATCGGC CAGGCCATCG GCGCCGTGCT GGGCGCCTTG GTGGCCGTCG CCGTCATGGC GATCCTCGTG AGCGCGTACG GCCCCGAGTC GTTCGGCCCC ACGGCGTCGT TCGTGTCCGC GCAGGCGTCC GTCGTGGCCA CCATGGTGTC GGGCATCCCC TCGGTGCCGG CGTTCGCCAT CGGGCTTGCG GCGGGATTCG TCCTCTACCT GCTGAACTTC CCCGCCATGA TGCTGGGGCT GGGCATCTAC CTGCCCTTCT ACATGTCGCT GACCGCGTTT CTGGGAGCCA TGGCCAAAGT CGCCTACGAC GCCGTGTGCA AGCTGCGCCG CAAGGGGCTC TCGCCCGAGG CGGCCGCGGA GAAGGAGAAG GCCCAGGGCG AAACGGGCCT CGTGGTGTCG TCCGGCCTGC TGGGCGGCGA ATCCATCGTA GGCGTCCTGG TGGCGCTCGC CGCCGTGGCG ACGGGCCTGG GAGCGTAA
|
Protein sequence | MDSVKGQLTL RGIVIGCVGC AIITAASVYT ALKMGALPWP IVFAAIISLF FLKALGHGKA SLNEANVTHT VMSAGAMVAG GLAFTIPGIW MLGYADEVGW FEMLLVAVSG VIMGLVCTAL LRRHFIEDSE LEYPIGEAAA QTLIAGDSGG KTGWKLFGSM GFAGAFTALR DFFGVIPAML FGNAAVPGVA FGIYLSPMLL AVGFLVGTGA VVVWFVGALL ANFGIIVGGS AAGLWDVASA QGIVSSLGMG VMMGAGLGVI FKNILPKAWR MLRDARSSNA FGLTAASTMD APASGAKDGR LRIGSFRLTA GLAALAVAAV ALIVCFGLQL GPVPAVIVVL LAFVATAMSA QSVGQTGIDP MEIFGLIVLL AVAAVSSVPQ VQLFFVAGIV AVACGLAGDV MNDFHAGHVL GTSPKAQWIG QAIGAVLGAL VAVAVMAILV SAYGPESFGP TASFVSAQAS VVATMVSGIP SVPAFAIGLA AGFVLYLLNF PAMMLGLGIY LPFYMSLTAF LGAMAKVAYD AVCKLRRKGL SPEAAAEKEK AQGETGLVVS SGLLGGESIV GVLVALAAVA TGLGA
|
| |