Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0210 |
Symbol | |
ID | 8414494 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 292669 |
End bp | 294447 |
Gene Length | 1779 bp |
Protein Length | 592 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645023190 |
Product | drug resistance transporter, EmrB/QacA subfamily |
Protein accession | YP_003180593 |
Protein GI | 257789987 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.852197 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGGCA TAGACCGGAC GGCGCTCGCG CCCGCGGACG CGGCGCGCGT GCACGAAAGC GACGAACCCG AAACCACCCC GCTGACCTCG ACCGTCGACG ATGCCGAGCG CGCGGCGGCC TTTGATGCGC CTTCGACCGC AGCACGCTCC GGCCATGTCT CGGCCTCGGC CACGTCCGAC GCGGCCCGCG ACGCGGCGGC GCGCACCCGC GCGCACGAGC TTGCCGCAGC GTCGGCGCGG CAGGGCGGCG TGCCCGCGAG CGAGCCCAAA CCCGAGCATT CGCGCGGCGT GGCCGCCGTG TTCGCTGGCT TGCTGCTGGC CATGTTCGCC AGCACGCTGT CCGAGACGGT GACGGCCACC GCGCTTCCCA CCATCGTGGG CGATCTGGGC GGCGTCGACC ATATGCAGTG GGTGACCACG GCGTACATCC TGGCGTCCAC CGTCATGATG CCAATCTACG GCAAGCTGGG CGACCTGTTC GGCCGCAAAT ACCTGTTCAT CATCGCGCTG TCCATCTTCA TCGTGGGGTC GGCCACGTGC GGGCTCGCGC CCAGCATGGA CGGCCTCATC GCGGGCCGCG CCGTGGAGGG GCTCGGCGGA GGCGGCCTTA TCATCCTGGC GCAAGCTACC ATCGCCGACA TCATCCCGCC GCGTCAGCGA GGGAAGTACA TGGGCCTCAT GGGCTCGGTG TTCGCGGTGT CCACGGTGGT GGGGCCGCTG TTGGGCGGCT GGTTCGTGCA GGTGACGGGG TGGCGCTGGC TGTTCGCGTT CAACATCCCG CTGGCGCTGC TGGCCATCGC GGCCGTGGCG TTCTTCCTGA CGAAGCCCGT GCGGCGCGAC GACCGCCCGC CTATCGACAT CGGCGGCATG ATGGCCATGG CCGTGTCGGT GTCGTCGCTG GTGCTGGCCA CGGCGTGGGG CGGCACGCTG TTTCCGTGGA TCTCGCCGCA GATATTCGCG CTGTTCGCGC TGTTCTTCGT GGCGGCGGTT GCGTTCGTAC TGGTGGAACG CAAGGCGAAG GAGCCCATCA TCCCCATGCT GCTGTTCAAG AACCGCAACT TCGTGGTGTG CACGGTCACC GGAATGTTCA TCATGCTGGG CATGATGGGA ACCATATCGT ACCTGCCCAC GTACTTCCAG ATCGTCGAGG GGCTGGCGCC CGAGCAGGCG GGTCTCATGA CGGTGCCGAT GATGGCGGGC GTGCTGATCA CGGCGGTGGG CACGGGCTTC CTCGCCACGA AAACGGGCCG CTACAAGTGG ATGCCCATCG CGTCGTGCGC CGTGACGGCC GTGGGGTTCG TGCTGCTGTC GCAGCTGACC GTGGGCACGC CGCTGGTGGT GACGGGCGTG TTCCTGTTCG TGCTGGGCTT CGGCATCGGT CTGGGCCAGC AGATCCTCGT GCTCATCGTG CAGAACGAGT TCCCGCACGC CATCGTGGGC ACGGCGACGG CGGCGAACAA CTTCTTCCGG CAGATCGGCT CCACGCTGGG CGCGTCGCTG GTGGGCGCGC TGTTCACGTC GCGCCTGACT GCCGACTTGG CCGCCAAGCT GCCGCACGTG GACAACATCA ACATGAACCG CATCACGCCC GATTTCGTGC AGCACCTCGA CAGCGGCACG CGCGCCATCA TCACCTCGGC GTACAGCGAC GCGCTCGTGC CTATCTTCCT GTACGTGGTG CCGTTGCTAG TGGTGGGCTT CGTGCTGATG CTCACGCTCA AGGAGCACCC GTTGGCCACG AAGGTGAACC ACACCGGCCA CCCTGGCGAC ACGGTGTAG
|
Protein sequence | MEGIDRTALA PADAARVHES DEPETTPLTS TVDDAERAAA FDAPSTAARS GHVSASATSD AARDAAARTR AHELAAASAR QGGVPASEPK PEHSRGVAAV FAGLLLAMFA STLSETVTAT ALPTIVGDLG GVDHMQWVTT AYILASTVMM PIYGKLGDLF GRKYLFIIAL SIFIVGSATC GLAPSMDGLI AGRAVEGLGG GGLIILAQAT IADIIPPRQR GKYMGLMGSV FAVSTVVGPL LGGWFVQVTG WRWLFAFNIP LALLAIAAVA FFLTKPVRRD DRPPIDIGGM MAMAVSVSSL VLATAWGGTL FPWISPQIFA LFALFFVAAV AFVLVERKAK EPIIPMLLFK NRNFVVCTVT GMFIMLGMMG TISYLPTYFQ IVEGLAPEQA GLMTVPMMAG VLITAVGTGF LATKTGRYKW MPIASCAVTA VGFVLLSQLT VGTPLVVTGV FLFVLGFGIG LGQQILVLIV QNEFPHAIVG TATAANNFFR QIGSTLGASL VGALFTSRLT ADLAAKLPHV DNINMNRITP DFVQHLDSGT RAIITSAYSD ALVPIFLYVV PLLVVGFVLM LTLKEHPLAT KVNHTGHPGD TV
|
| |