Gene Elen_2815 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2815 
Symbol 
ID8417143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3262181 
End bp3263446 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content63% 
IMG OID645025792 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003183151 
Protein GI257792545 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0033693 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.00275134 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTGCAA TTGCACGAGA AAAGCTCTGG ACGAGAGATT TCGTATTCGG AACCGCCGTG 
AACTTCCTGA TCATGGTGAA CTACTACGGG CTCATGGTGG TCGTAGCCGA CTACGCCATG
AAAACCTACG ATGCGCCCGC AGCCACCGCC GGCCTCGCGG CCAGTATCTT CGTCATCGGC
GCGCTGATAG CCCGCCTCTT CAGCGGGCGC ATCATGGATC GCGTCGGTCG CAAGCGCTTG
CTTATCATCG GTGCCGTGCT CGAAGTGGCG TTCTCGGCGC TTTACCTTAC CGGCCTGGGA
TTATGGCTGC TGTTCGCGCT GCGCCTGCTG CACGGCATCG CGTTCGGCAC GTGTTCCACC
GCCATCGGCA CCATCGTCAC GGCTCTTGTG CCGGACAACC GCAAAGGCGA AGGCGTGGGC
TACTATATGC TGTCCGTCAC CCTCGGCGCG GCAATCGGGC CGTTTCTGGG CATGTTTCTC
ACGCAGAACG CCGGATTCCA AACGTTGTTC CTCGTAGCCG CCGCCGTAGC TCTGGCTTGC
TTGCTAGCCG CCACGCAGCT GCGCGTGCCG AAAAACCCTG TATCGGCCGA AACCGTGGCC
CGGAAGGCGA GCGACATCGC CCGCGACGAA CGCATCGAGC AGGCGGGCGG GTTCCGCGTG
CCTCGTCCGA GCCTGACGAA CTACCTGGAA TCCAGCGTGA TCCCCATCGG CGCCGTATGC
GCGCTGCTGT TCTTCTGCTA TTCCAGCCTG CTTGCGTTCC TCACGCCGTT CGCGGCCGAA
AACGGGCTCG AAACGCCCGC GAGCTTCTTC TTCGTCGTGT ACGCCATCGC CACGTTCGTC
ACACGGCCGT TCACCGGCAA GCTGTTCGAC CGCAAAGGCG ACCGCGTGGT CATGGTGCCC
GCGTTCATCG CCTTCATCGT CGGCATGGGA CTGCTGGCCA CCGTCTACCA GCCGACGGCC
ATGCTGATCG CGGCAGCGTT GCTGGGCTTC GGCGTAGGGA CGGTTCAGGC AAGCGGCCTG
GCTCTGGCGG TGCGCCTCGC CCCCGACGAT CGACTAAGTC TGGCGAACTC CACATTCTAC
ATCCTGCTGG ACATCGGCGT TGGCGTAGGC CCGCTGCTTT TGGGCATCGT ACAGCCACTG
TGGGGCTATC GCGGCCTGTT CGAGGCCATG TCGCTAGTCG CCATCGTGGC GCTGGCAGCC
TACCTGCTAG TGAGCCGCAA AAAGGGCGCC ATGCGCCACA AGCTTGAGGA AGCGGAGAAA
CGGTAA
 
Protein sequence
MPAIAREKLW TRDFVFGTAV NFLIMVNYYG LMVVVADYAM KTYDAPAATA GLAASIFVIG 
ALIARLFSGR IMDRVGRKRL LIIGAVLEVA FSALYLTGLG LWLLFALRLL HGIAFGTCST
AIGTIVTALV PDNRKGEGVG YYMLSVTLGA AIGPFLGMFL TQNAGFQTLF LVAAAVALAC
LLAATQLRVP KNPVSAETVA RKASDIARDE RIEQAGGFRV PRPSLTNYLE SSVIPIGAVC
ALLFFCYSSL LAFLTPFAAE NGLETPASFF FVVYAIATFV TRPFTGKLFD RKGDRVVMVP
AFIAFIVGMG LLATVYQPTA MLIAAALLGF GVGTVQASGL ALAVRLAPDD RLSLANSTFY
ILLDIGVGVG PLLLGIVQPL WGYRGLFEAM SLVAIVALAA YLLVSRKKGA MRHKLEEAEK
R