Gene Elen_1101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1101 
Symbol 
ID8415391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1331904 
End bp1333313 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content71% 
IMG OID645024064 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003181461 
Protein GI257790855 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000859946 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000000597141 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGCAACG CGAAGAAGCG AAGCGGAAGC GTGGCGAGCA TGCTGCCCGT GCTGCTGGCG 
TGCTCGTTCA CGGCCTCGTT CGGCCAAAGC ATGATGAACG TCGCCCTGCC CGAGCTGGCT
GAGCGCTTCG GCGTCACGCT CTCCATCGCG AACTGGGTGA TCGTGGGGTA CATGGTGGTC
GCGGCGACGG CCATCATGCT GTCGGCGTTC ATGCTGAGGC GCCTCGGGCT GAGGCGCGTG
TTCTTCGTCG GCGCGGGGGC GCTCGCGCTC GGCAGCGCGT GCGCGCTGCT CTCGCAGGAC
TTCCCGATGC TGTTCGCCAG CCGCCTCGTG CAGGCGGTGG GCACGGGCCT GTTCTTCCCG
TCGGTGACGA GCGTCATCAT GACGAACTCG CCGGCCGCGG TGCGCGGCAC GCGCCTCGCG
CTGAACAGCG GCGTCATCGC CGTGGGCCTT GCCATCAGCC CGACGGCCTC GGGATTCGCG
CTCACGCAGT TCGGCTGGCG CGCCATGTTC GTCGTATCGC TCGCCATGTC CGTCGCGCTG
CTCGCGGTCG GCTTCTTCCG CATCCACGGC GGCCCCTCGA CGAAGCGCGT TCCCATCGAC
GCGCTCAGCG TGATGCTCGG GCCGCTCGGG TTGGCCGCGT TCCTGTACGG CTTGGGCGAG
GTCACGCGCG ATCTCGCCCC CTCCCTCGCG GCGCTCGCGG TCGGCGCGGT GCTGCTCGCC
CTGTTCGCCT GGCGGCAGTT CGCGTTGGAG AGCCCGCTGC TCGACCTGCA CCCCCTCGTC
CACCCGCGGT TCGCCGTGGG CATCCTGCTC GTCATGGTGG GCATGCTCAC GTCGTTCTCC
ATGAGCATCC TGCTGCCGCT GTGCTACGAG GGGGCGCTGG GGTACACGGC GTTCTTCGCG
GGCCTGCTGC TGCTAGGCCC CGTGCTGGTC AACGCGGCGT TCACGTTTTT GGGCGGCCGG
GTGTTCGACA GGCACGGCGC GTGGCCGCTC ATACCGGCGG GCCTCGTGCT CGTGCTCGTC
GGGCAGGCGA CGGCGTTCTT CTCGGCCGAG AGCATGATCG CCATCCTGAT CGTCCTGTCG
TCGGCGGCCG TGTACGCGGG CGCCGGGTTC GTGGTGGCGC CGTCCAAGAC CGCGGCGCTC
GGCACGCTGC CGCCCGCGAC GTACTCCGCC GGCGCGTCCA TCAACTCCAC GGCCGTGCAG
ATCGCCTCGG CCATCGGCTC GTCGCTGTTC GTCGGCGTGC TGTCGGCCGA CGTGCTCAGG
GACACGGCGG CGGGCGCGGC GAAGGCGTCG GCGTACGCCG CGGCGTTCGA GCACACCCTC
TCGATAGCCG TCGTCATCGC GGCGGCGGGG CTGCTCGTCG CGTTCTTCTA CGCCCGCGCC
ATGCGCAAAC CGGCCGGTAA GCAGCGGTGA
 
Protein sequence
MGNAKKRSGS VASMLPVLLA CSFTASFGQS MMNVALPELA ERFGVTLSIA NWVIVGYMVV 
AATAIMLSAF MLRRLGLRRV FFVGAGALAL GSACALLSQD FPMLFASRLV QAVGTGLFFP
SVTSVIMTNS PAAVRGTRLA LNSGVIAVGL AISPTASGFA LTQFGWRAMF VVSLAMSVAL
LAVGFFRIHG GPSTKRVPID ALSVMLGPLG LAAFLYGLGE VTRDLAPSLA ALAVGAVLLA
LFAWRQFALE SPLLDLHPLV HPRFAVGILL VMVGMLTSFS MSILLPLCYE GALGYTAFFA
GLLLLGPVLV NAAFTFLGGR VFDRHGAWPL IPAGLVLVLV GQATAFFSAE SMIAILIVLS
SAAVYAGAGF VVAPSKTAAL GTLPPATYSA GASINSTAVQ IASAIGSSLF VGVLSADVLR
DTAAGAAKAS AYAAAFEHTL SIAVVIAAAG LLVAFFYARA MRKPAGKQR