Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1101 |
Symbol | |
ID | 8415391 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1331904 |
End bp | 1333313 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 645024064 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003181461 |
Protein GI | 257790855 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000859946 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.0000000000000597141 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGCAACG CGAAGAAGCG AAGCGGAAGC GTGGCGAGCA TGCTGCCCGT GCTGCTGGCG TGCTCGTTCA CGGCCTCGTT CGGCCAAAGC ATGATGAACG TCGCCCTGCC CGAGCTGGCT GAGCGCTTCG GCGTCACGCT CTCCATCGCG AACTGGGTGA TCGTGGGGTA CATGGTGGTC GCGGCGACGG CCATCATGCT GTCGGCGTTC ATGCTGAGGC GCCTCGGGCT GAGGCGCGTG TTCTTCGTCG GCGCGGGGGC GCTCGCGCTC GGCAGCGCGT GCGCGCTGCT CTCGCAGGAC TTCCCGATGC TGTTCGCCAG CCGCCTCGTG CAGGCGGTGG GCACGGGCCT GTTCTTCCCG TCGGTGACGA GCGTCATCAT GACGAACTCG CCGGCCGCGG TGCGCGGCAC GCGCCTCGCG CTGAACAGCG GCGTCATCGC CGTGGGCCTT GCCATCAGCC CGACGGCCTC GGGATTCGCG CTCACGCAGT TCGGCTGGCG CGCCATGTTC GTCGTATCGC TCGCCATGTC CGTCGCGCTG CTCGCGGTCG GCTTCTTCCG CATCCACGGC GGCCCCTCGA CGAAGCGCGT TCCCATCGAC GCGCTCAGCG TGATGCTCGG GCCGCTCGGG TTGGCCGCGT TCCTGTACGG CTTGGGCGAG GTCACGCGCG ATCTCGCCCC CTCCCTCGCG GCGCTCGCGG TCGGCGCGGT GCTGCTCGCC CTGTTCGCCT GGCGGCAGTT CGCGTTGGAG AGCCCGCTGC TCGACCTGCA CCCCCTCGTC CACCCGCGGT TCGCCGTGGG CATCCTGCTC GTCATGGTGG GCATGCTCAC GTCGTTCTCC ATGAGCATCC TGCTGCCGCT GTGCTACGAG GGGGCGCTGG GGTACACGGC GTTCTTCGCG GGCCTGCTGC TGCTAGGCCC CGTGCTGGTC AACGCGGCGT TCACGTTTTT GGGCGGCCGG GTGTTCGACA GGCACGGCGC GTGGCCGCTC ATACCGGCGG GCCTCGTGCT CGTGCTCGTC GGGCAGGCGA CGGCGTTCTT CTCGGCCGAG AGCATGATCG CCATCCTGAT CGTCCTGTCG TCGGCGGCCG TGTACGCGGG CGCCGGGTTC GTGGTGGCGC CGTCCAAGAC CGCGGCGCTC GGCACGCTGC CGCCCGCGAC GTACTCCGCC GGCGCGTCCA TCAACTCCAC GGCCGTGCAG ATCGCCTCGG CCATCGGCTC GTCGCTGTTC GTCGGCGTGC TGTCGGCCGA CGTGCTCAGG GACACGGCGG CGGGCGCGGC GAAGGCGTCG GCGTACGCCG CGGCGTTCGA GCACACCCTC TCGATAGCCG TCGTCATCGC GGCGGCGGGG CTGCTCGTCG CGTTCTTCTA CGCCCGCGCC ATGCGCAAAC CGGCCGGTAA GCAGCGGTGA
|
Protein sequence | MGNAKKRSGS VASMLPVLLA CSFTASFGQS MMNVALPELA ERFGVTLSIA NWVIVGYMVV AATAIMLSAF MLRRLGLRRV FFVGAGALAL GSACALLSQD FPMLFASRLV QAVGTGLFFP SVTSVIMTNS PAAVRGTRLA LNSGVIAVGL AISPTASGFA LTQFGWRAMF VVSLAMSVAL LAVGFFRIHG GPSTKRVPID ALSVMLGPLG LAAFLYGLGE VTRDLAPSLA ALAVGAVLLA LFAWRQFALE SPLLDLHPLV HPRFAVGILL VMVGMLTSFS MSILLPLCYE GALGYTAFFA GLLLLGPVLV NAAFTFLGGR VFDRHGAWPL IPAGLVLVLV GQATAFFSAE SMIAILIVLS SAAVYAGAGF VVAPSKTAAL GTLPPATYSA GASINSTAVQ IASAIGSSLF VGVLSADVLR DTAAGAAKAS AYAAAFEHTL SIAVVIAAAG LLVAFFYARA MRKPAGKQR
|
| |