Gene Elen_1186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1186 
Symbol 
ID8415477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1421984 
End bp1423237 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content70% 
IMG OID645024149 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003181545 
Protein GI257790939 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.790878 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACCG AAAACCGCAC GAGCCGCAAG CGCAACGCGT TCGCAACCGC GTTGCTGACG 
ACCCTCGCGT TCGCACTCGG GTTCGCCGAG TTCGTGCTGA TAGGCATCGT GCCCGACGTA
GCCGAAGGGC TGGGCGAGCC GCTCACGCTC ATCGGCGATC TCGTGGGCTA CTACGCGCTG
GCCTGCGCCG TCGCCACGCC CGTCATAGCG CTCGCCACGG CGCGCGCCGC CCGTTTCAAG
GTGATGGCGG CGCTGCTGGT GGTGTTCAAC GCCGGCAACC TGCTCACCCT GTTCGCCGAC
GGCTACGCGC TGCTGCTCGT CTCGCGCGTG CTGCCCGCCG TCACGTCCGG CACGCTTCTG
GCCCTCGCGC TCACCTACGT GCCCGACATC GTCGAGCCCA AGCGCGTTGC CGCGGTGCTG
GGGCTCGTGC TTGCCGGTTT CTCGGTGTCG AGCGTCGTCG GCGTGCCCAT CGGCACGGCG
CTGGCCGGCC TGTTCGACTG GAAGGCCGCC TACGCCTGCG TGTTCGCGCT CGGCCTCGCG
GCGAGCGTCG TCCTGCTGCC GACGCTGCCC CGCACTCCCG CGCATACGGG AGACGCCGCC
CCCACTCTCC GCTCGCAGCT GCGCCTGCTC GCCGACAGCC GCGTGCTGAC GAACATCGCC
ATGATTCTCG CCGGCGTAGC GTCCACCTAC GTGTTCTACA CCTACCTCGC CCCTATCCTT
GCCGACATCG CCGGCCTCGA CGCCGCAGGA TCGAGCTTCG TGCTGTTGCT GTTCGGCGCG
GCATGCGTGG GATCGAACCT GCTGTCGGGC TGGATTGCCG GACGCTTCGG GCTGCGCGCG
CTGCCCGTCG CCTTCGCCGC CCACGCGGCG CTTCTGGCCC TGCTGGCCGT GAGTCTGCCC
GCAGGCGCTG TCGGTATCGC GAACATCCTC GCGGTGGGAT TGCTCATGTA CGTGATGAAC
TCCACCGTGC AGATGCTGTT CCAGAGCGTC GCGCGCACCG ACTACCCCAG CGCGCTCACG
TTCTCGGCGT CGCTGCATCC CATGTCGTTC AACACGGGCA TCGCGCTGGG CTCGTTCGCG
GGCGGCCTCG TGATGAACGC GGGCGGGCTT CTGGCCACCG GCCCCGCAGG CGCGCTGTTC
GCCCTGACCG CCGCCGCGTT GGCGCTGGCG CTCGTGCGCA TGACCGCGCG TCGCAGCGTC
GAAATCGCCG CCGAGGCCGC CATCACTGCC GCGGCTGTCC GCGAGTCGAG GTAG
 
Protein sequence
MDTENRTSRK RNAFATALLT TLAFALGFAE FVLIGIVPDV AEGLGEPLTL IGDLVGYYAL 
ACAVATPVIA LATARAARFK VMAALLVVFN AGNLLTLFAD GYALLLVSRV LPAVTSGTLL
ALALTYVPDI VEPKRVAAVL GLVLAGFSVS SVVGVPIGTA LAGLFDWKAA YACVFALGLA
ASVVLLPTLP RTPAHTGDAA PTLRSQLRLL ADSRVLTNIA MILAGVASTY VFYTYLAPIL
ADIAGLDAAG SSFVLLLFGA ACVGSNLLSG WIAGRFGLRA LPVAFAAHAA LLALLAVSLP
AGAVGIANIL AVGLLMYVMN STVQMLFQSV ARTDYPSALT FSASLHPMSF NTGIALGSFA
GGLVMNAGGL LATGPAGALF ALTAAALALA LVRMTARRSV EIAAEAAITA AAVRESR