Gene Elen_1836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1836 
Symbol 
ID8416140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2158119 
End bp2159435 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content63% 
IMG OID645024806 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003182189 
Protein GI257791583 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATTGC TTGCCAAGAG CAAGGGAGAG GTTACCTACA AGGAGCTCTC CACCCTGCAC 
AAGTGGGCGC TCGTCGTGCT CATCTCCATG GGCTCGTCGA TCATCTACGC GCCGATGTAT
CTGAAGAACG TCTTCTACGA TCCGCTGATG CAAGCGCTCG GCGCCACCAA CGCCGACCTC
GGCCTCATGG TGTCGGCCTA CGGCATCGCC GCCATGATCT GCTACCTGCC CTCCGGCATC
GTGGCCGACA AGTTCCGCAT GCGCACGCTG GCATGGGTCG GCTTCATCGC CACCGCCGTG
CTCGTGTTCG TGTACGCCAT GCTGCCTTCC GTGCAGATCT GCCTGATCCT GTTCGTGCTC
ATGGGCGTCA CCTCCATCCT CGTGTGGTGG GGCACGCGCT TCAAGGTCAT CCGCCTGTGC
TGCGAGGAGA ACGAGTACGC CTCCAAGATC GGCATCAGCT ACTCCATCTA CGGCGTCACC
GGCCTCGTCA TCGGCCTCAT CAACGCCGGC ATCATCGCGG CCATCTCCGG CTCCGCGGGC
GTGCAGGCCA TGCTCATCTT CCTGGGCGTC GTCATCGCCG TCCTGGGCGT CGTCTCCTTC
TTCATCATCC CCGACTTCAA GGGCGAGATC AATAAGGACG CCAAGCTGTT CAGCGTCAAG
GAGGCCATCC AGGCCATCAA GCACCCCGGC GTCATCTGGG CCTGCGTCGC GTACTTCGCC
TGCTACGCCG TGTACCAGGG CGCTACCTAC ACCACGCCGT ACCTCACGCA GTGCTTCAAC
GCCGACGGCA ACCTCGTGAA CATCGTCGGC CTCATCCGCA CCTACGGCAT CGGCCTCATC
GCCGGCCCCA TTGTCGGCTT CATCGCCACG AAGATCAAGA GCCCCTCGAA GACCATCCTG
GGCGGCTTCA TCCTGTCCAT CGCGGTACTC GTCGGCTTCA TCCTGTTCCC GCAGGATCCC
TCCGGCGCCA TGGTCGCCTC CATCCTCGTG GTCGTGTTCG GCTTCACCAC CTACGGCGCC
TTCTCCATCG GCTCCTCGCC GCTGTCCGAG GTCAAGATCC CCATGGCCAT CTTCGGCACC
GCCTCCGGCC TGCTGTCCGT CATCGGCTTC CTGCCTGACG TGTTCATCCA CACCTGGTAC
GGCGGCATGA TCGACGCCCA GGGTACGGCA GCGTTCTCCA GCATCTTCGG CTTCGAGATC
ATGTTCGGCG TCATCGGCTG CATCGCGCTG GTCATGCTGC TCCGCTCCAT CAAGAAGCAC
TTCGGCGCCT CCGACGCGGT CGCGGCCGCA GAGGACGGCG AGTCCGCCAA GGCGTAA
 
Protein sequence
MSLLAKSKGE VTYKELSTLH KWALVVLISM GSSIIYAPMY LKNVFYDPLM QALGATNADL 
GLMVSAYGIA AMICYLPSGI VADKFRMRTL AWVGFIATAV LVFVYAMLPS VQICLILFVL
MGVTSILVWW GTRFKVIRLC CEENEYASKI GISYSIYGVT GLVIGLINAG IIAAISGSAG
VQAMLIFLGV VIAVLGVVSF FIIPDFKGEI NKDAKLFSVK EAIQAIKHPG VIWACVAYFA
CYAVYQGATY TTPYLTQCFN ADGNLVNIVG LIRTYGIGLI AGPIVGFIAT KIKSPSKTIL
GGFILSIAVL VGFILFPQDP SGAMVASILV VVFGFTTYGA FSIGSSPLSE VKIPMAIFGT
ASGLLSVIGF LPDVFIHTWY GGMIDAQGTA AFSSIFGFEI MFGVIGCIAL VMLLRSIKKH
FGASDAVAAA EDGESAKA