Gene Elen_2514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2514 
Symbol 
ID8416838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2941314 
End bp2942633 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content67% 
IMG OID645025495 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003182858 
Protein GI257792252 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCAT CGACCGCGCC CCTGAGCGCA GCCGGTCGCG AGCTGCCGAA GCGCTGGCTT 
GCGATCATCG CCACCATCTG GGGCGGACAA GCCGCTTCTA TGATCACCAG CTACGCCGCG
GGATACGCCG TGGTGTGGTA CATCACCGAG ACTACGGGCA GCGCCATCAT GCTGGCCGCG
GCCGCCATCT GCGCGTATCT GCCGCAAGGA CTGCTGTCAC CCTTCGGCGG CGTCATCGCC
GACAAGCACA ACCGCAAGAC GGTCATGATC GTGGCCGACC TCTCGGTGGG CATCGTGTCG
CTGGGGCTGG GCATCGTCAT CCTGTTCGGG CAGGTATCGT TTCCGCTGCT CATGATCCTC
GTCATCGTGC GCAGCATCGG ACAGGCCTTC CACGGCCCCG CCATGATGGC CGCCATGCCG
CTGCTCGTGC CCGAGAAGCA CCTGCTGCGC ATCAACACGC TCGACCAGCT GCTCATGTCC
GTCGCGTCCA TCGGCGCCCC CGCGTTCGGC ATCTTCCTGT ACACCACCAT CGGGTTCCAC
TCCGTGATGT TCCTCGACTT CGCCGGAGCA CTCGTGGCCG TGGCGGGGCT TGCGCTGGCG
AAGATCCCCA CCGTCGCCGA TGAGACCGCC GAGAACCAGC ACGTGCTGGC GAACCTTCGC
GACGGCTGGA AAGCGCTGTC GGCAACCCGC GGGCTCGTCA TCCTCATCGC CGGCATCACC
ATCGGCATGA TGGCGTTCGC CCCGCTGGGC GCCATCTTCC CGCTTATGAC GTACGACCAC
TTCGGCGGTG ACGGCTACAT GGCGTCCGTC GTGGAAGCCG CGTTCGGCGT AGGGATGATC
GCGGGGTCCA TCGTGCTCAT GGCCTGGGGC GGCGGCAAGC GCCTGGCCGG ACTCATCGCG
GTGGCCTCCC TCATCGTGGG TGTCACCACC ACGGCGTGCG GATTCCTCGC CCCCACCATG
TTCTGGGCGT TCGTTGCGTT GTGCGCCGTC ATGGCGCTGG CATGCTCCTG GTTCAACGGA
CCGCTGATCA CGCTCATCCA GCGCAACGTG CCCGAGGAGA AGACGGGCCG CGCGCTGGGC
CTGGCCATGG CCGCCATGGG ACTGGCCTCC CCCGTGGGCA TCGCCATCGG CGGCGTGGCC
GCCGAAGCTA TGGGCGTAGC AGCGTTTTTC GTGGCGGACG GCCTCGTATG CATTGCGCTG
GGACTGACCG TGTACCTGTT CAAGAGCGTG CGCGCCCTCG ATCATGACGA GCCGCACGTC
CTGCGCGACA GCGCGGCCGA GGCGAACGCG GAGATCGGAA CAGTCTCGGA AGAGGGCTAG
 
Protein sequence
MTASTAPLSA AGRELPKRWL AIIATIWGGQ AASMITSYAA GYAVVWYITE TTGSAIMLAA 
AAICAYLPQG LLSPFGGVIA DKHNRKTVMI VADLSVGIVS LGLGIVILFG QVSFPLLMIL
VIVRSIGQAF HGPAMMAAMP LLVPEKHLLR INTLDQLLMS VASIGAPAFG IFLYTTIGFH
SVMFLDFAGA LVAVAGLALA KIPTVADETA ENQHVLANLR DGWKALSATR GLVILIAGIT
IGMMAFAPLG AIFPLMTYDH FGGDGYMASV VEAAFGVGMI AGSIVLMAWG GGKRLAGLIA
VASLIVGVTT TACGFLAPTM FWAFVALCAV MALACSWFNG PLITLIQRNV PEEKTGRALG
LAMAAMGLAS PVGIAIGGVA AEAMGVAAFF VADGLVCIAL GLTVYLFKSV RALDHDEPHV
LRDSAAEANA EIGTVSEEG