Gene Elen_2800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2800 
Symbol 
ID8417126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3250177 
End bp3251937 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content65% 
IMG OID645025775 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003183136 
Protein GI257792530 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0579957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00000851422 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTCCGT CTTCTTCCTC ATCTGCTTCG TCCGCCTCGC GCGGACGCGG CAAGGGGTTC 
GCGCTTGTTG CGGCCGTGTA TCTGCTGGGC CTGTTCATCG GGGCGCTCGA TACCGGCATC
GTCACGCCTG CCCGCACGGT CATCCAGAGC GATCTGGGCA TCGGCGAGCA GATGGGCGTG
TGGATCATCA CCATCTATAC GCTTGCCTAC GCGGCCGCCA TTCCGGTGAT GGGCAAGCTG
GCGGACCGTT CGGGACGCAA GTACGTGTAC CTTGCGAGCA TCCTGCTGTT CGGTGTCGGG
TCGCTTCTAT GCGGGTTGGC GCAGGACGTG GGGAGCTTTT GGATGCTGTT AGCCGCGCGC
GCCGTGCAGG CGGTGGGCGG AGGCGGCATC GTGCCTGTTG CCACGGCCGA GTTCGGCACG
ACGTTTCCTC CCGAGAAGCG CGGGCTGGCG TTGGGTCTGG TAGGCGGCGT GTACGGCATT
GCCAACATCT TCGGAGCGTC GGCCGGCAGC CTGATCCTAT CGGTGTTCGG GCAGGCCAAC
TGGCAGTTCA TCTTCTACGT GAACGTTCCC ATCTGCGCCT TCATCGTGGT GGCGGGGCTG
TTCGTGCTGC CGAACACGCG AGCCGAGCAG GTGAAGCCCA TCGACGGGTG GGGCATTGCA
GTGCTGGTGG CGATGGTGTT GTCGCTGCTG TACGGACTGA AGAACCTCGA TTTCTTCGAT
CTGGGAGCAT CTGCGACCTC GTCTGACGTG TGGCCGTTCT TGCTCGCGTT CGTCGTGCTG
CTTCCGGTGT TCGTGCTGAT CGAGCGCCGC GCGGCCGACC CGGTGCTCAA CCTGTCGTAC
TTCCGCGACC GCGACATCGT GATCACGTTG GTGCTGTCGG TGATCACCGG CGTCATCCTG
ATGGGCATCA TCTTCATCCC GCAGTTCGCC GAGAACGTGC TGAAACTGCC CTCGGGCAGC
GGCGGCTACG TAGTCATCGT GCTGGCGGCG TTCGCCGGGG TGGGAGCGCC GGTGTCGGGC
AAGCTCATCG ATCGCTTCGG CGTGAAGGCG GTGCTCGCGT TCGGGCTGGC GGCTTCGGCT
GCGGGAGCGA TGTTCCTGGC GCTGGTGGCC ACGCAGTTCG CGAATATGGC GACGCTCATC
ATCAGCCTCG TGGCCATCGG CATCGGGATG GGCTTCACGA TTGGAACGCC GCTCAACTAC
ATGATGCTGG CGAAGACGAA GGAGCGCGAG GCGAACTCGG CGCTGGCGAC GCTGTCGCTG
GTGCGCTCGG TGGGCACGGC GGTCGCGCCT GCCGTGCTGG TTGCGTTCAT CGCGCATGCG
GGCATGGCCA TTCCCGATCG CATCATGGGC GTTCTGCCCG ATGCGCCGGG CGGGCAATCG
ATTGCGCAGC TAGCGAGCGG GCAAGCGCAG GGAGGCGATG GCGCTGGGTT GCCCGACGAT
CTTCAACAGC TGATGAAGGG GTCGGATGTG ACCACGATCG TCGCGAACGT GAAGACGCTG
GCAAAAACCG AGATCGAACA GGAAGCTGCT TCGGCGGGCA TGCCGGCCGA AGCCGTCGAC
GCGGCGGAGC AACAGTACCT CGCCGCCATC GACGATCGGG CAGGCGACAT AGAAAGCACG
TTCCAGAGCA CGGTCGACGA AGGCTTTCGC GGTGCGTTTC TGCTGGTGGG GATTTGCTCG
CTCGTGGGGT TGGCGCTGCT TGCGCTCTAC CGGGAAGACA GGCCGCGGCC CGGGCAGGCG
AAGGGGGAGC CGACGCGCTG A
 
Protein sequence
MAPSSSSSAS SASRGRGKGF ALVAAVYLLG LFIGALDTGI VTPARTVIQS DLGIGEQMGV 
WIITIYTLAY AAAIPVMGKL ADRSGRKYVY LASILLFGVG SLLCGLAQDV GSFWMLLAAR
AVQAVGGGGI VPVATAEFGT TFPPEKRGLA LGLVGGVYGI ANIFGASAGS LILSVFGQAN
WQFIFYVNVP ICAFIVVAGL FVLPNTRAEQ VKPIDGWGIA VLVAMVLSLL YGLKNLDFFD
LGASATSSDV WPFLLAFVVL LPVFVLIERR AADPVLNLSY FRDRDIVITL VLSVITGVIL
MGIIFIPQFA ENVLKLPSGS GGYVVIVLAA FAGVGAPVSG KLIDRFGVKA VLAFGLAASA
AGAMFLALVA TQFANMATLI ISLVAIGIGM GFTIGTPLNY MMLAKTKERE ANSALATLSL
VRSVGTAVAP AVLVAFIAHA GMAIPDRIMG VLPDAPGGQS IAQLASGQAQ GGDGAGLPDD
LQQLMKGSDV TTIVANVKTL AKTEIEQEAA SAGMPAEAVD AAEQQYLAAI DDRAGDIEST
FQSTVDEGFR GAFLLVGICS LVGLALLALY REDRPRPGQA KGEPTR