Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2815 |
Symbol | |
ID | 8417143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 3262181 |
End bp | 3263446 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 645025792 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003183151 |
Protein GI | 257792545 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0033693 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.00275134 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCTGCAA TTGCACGAGA AAAGCTCTGG ACGAGAGATT TCGTATTCGG AACCGCCGTG AACTTCCTGA TCATGGTGAA CTACTACGGG CTCATGGTGG TCGTAGCCGA CTACGCCATG AAAACCTACG ATGCGCCCGC AGCCACCGCC GGCCTCGCGG CCAGTATCTT CGTCATCGGC GCGCTGATAG CCCGCCTCTT CAGCGGGCGC ATCATGGATC GCGTCGGTCG CAAGCGCTTG CTTATCATCG GTGCCGTGCT CGAAGTGGCG TTCTCGGCGC TTTACCTTAC CGGCCTGGGA TTATGGCTGC TGTTCGCGCT GCGCCTGCTG CACGGCATCG CGTTCGGCAC GTGTTCCACC GCCATCGGCA CCATCGTCAC GGCTCTTGTG CCGGACAACC GCAAAGGCGA AGGCGTGGGC TACTATATGC TGTCCGTCAC CCTCGGCGCG GCAATCGGGC CGTTTCTGGG CATGTTTCTC ACGCAGAACG CCGGATTCCA AACGTTGTTC CTCGTAGCCG CCGCCGTAGC TCTGGCTTGC TTGCTAGCCG CCACGCAGCT GCGCGTGCCG AAAAACCCTG TATCGGCCGA AACCGTGGCC CGGAAGGCGA GCGACATCGC CCGCGACGAA CGCATCGAGC AGGCGGGCGG GTTCCGCGTG CCTCGTCCGA GCCTGACGAA CTACCTGGAA TCCAGCGTGA TCCCCATCGG CGCCGTATGC GCGCTGCTGT TCTTCTGCTA TTCCAGCCTG CTTGCGTTCC TCACGCCGTT CGCGGCCGAA AACGGGCTCG AAACGCCCGC GAGCTTCTTC TTCGTCGTGT ACGCCATCGC CACGTTCGTC ACACGGCCGT TCACCGGCAA GCTGTTCGAC CGCAAAGGCG ACCGCGTGGT CATGGTGCCC GCGTTCATCG CCTTCATCGT CGGCATGGGA CTGCTGGCCA CCGTCTACCA GCCGACGGCC ATGCTGATCG CGGCAGCGTT GCTGGGCTTC GGCGTAGGGA CGGTTCAGGC AAGCGGCCTG GCTCTGGCGG TGCGCCTCGC CCCCGACGAT CGACTAAGTC TGGCGAACTC CACATTCTAC ATCCTGCTGG ACATCGGCGT TGGCGTAGGC CCGCTGCTTT TGGGCATCGT ACAGCCACTG TGGGGCTATC GCGGCCTGTT CGAGGCCATG TCGCTAGTCG CCATCGTGGC GCTGGCAGCC TACCTGCTAG TGAGCCGCAA AAAGGGCGCC ATGCGCCACA AGCTTGAGGA AGCGGAGAAA CGGTAA
|
Protein sequence | MPAIAREKLW TRDFVFGTAV NFLIMVNYYG LMVVVADYAM KTYDAPAATA GLAASIFVIG ALIARLFSGR IMDRVGRKRL LIIGAVLEVA FSALYLTGLG LWLLFALRLL HGIAFGTCST AIGTIVTALV PDNRKGEGVG YYMLSVTLGA AIGPFLGMFL TQNAGFQTLF LVAAAVALAC LLAATQLRVP KNPVSAETVA RKASDIARDE RIEQAGGFRV PRPSLTNYLE SSVIPIGAVC ALLFFCYSSL LAFLTPFAAE NGLETPASFF FVVYAIATFV TRPFTGKLFD RKGDRVVMVP AFIAFIVGMG LLATVYQPTA MLIAAALLGF GVGTVQASGL ALAVRLAPDD RLSLANSTFY ILLDIGVGVG PLLLGIVQPL WGYRGLFEAM SLVAIVALAA YLLVSRKKGA MRHKLEEAEK R
|
| |