Gene Elen_2885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2885 
Symbol 
ID8417216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3350028 
End bp3351302 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content65% 
IMG OID645025863 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003183219 
Protein GI257792613 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAACTG AAACAATGAA GGGCAGCAAC TACGCATGGG CGATCGCCGT CGCCTGCGTA 
GCCTTCTACG CCATCCCCTT GGGCGTCGTG GCGAACCAGG CCGGTTTGTT CGCATCGCCC
GTCATGGAAG AGTTCGGCTG GTCGCGCACC GACGCGACGC TGTACATGTC CATCCAGCCG
TGGGTGGCGG CTATCTGCAC GCCGTTCGCC GGCAAGCTCA TCTCCAGGTT CAACCCTCGC
TGGGTGATGA CCGCCGCGGC CGCCGTCTTC GGCTTGGCTT CGCTGGCCTG TGCTTGGTTC
ACCGAGCCGT GGCAGTGGAA CGTGTACGGC GTGCTGTACG GCGCGTCCGC CGCGTTCTGG
ATGTACATCG CCACGCCGAC GTTCATCAAC CGTTGGTTCG CTAAGAGCAA CGGCACCGTC
ATCGGCGTCA TCGGCGTGTG CGCGTCGCTG CTGGGCGCGT TCATGAGCCC GGTCATCCAG
GGCTGGATCA GCGGCTACGG CTGGCACACC GCCCGTATCA TCATCAGCGT GATCGCGCTC
GTCGCGTCCG TCGTGCTGAC CGCCGCGCTG CTGCGCGAGT CGCCCGAGAA GATGGGCGTG
CTTCCCTGGG GCTACGGCGC CGCCGAAGTT GCGTCCGCGA AGTCCGAGGC CAAGTCCGTC
ATCGACGTCG CCGCTGACGA AGGCGCCACG GCCGCGCAGG CTCGCAAGAA CCCGGCGCTG
TGGCTGCTCA TCATCATGGC AGGCTTCTTC GTCATCGCCG CCGGCATGAT GCAGCAGTTC
TCGTCCTATG CATCCACCGG CGCGCTGGGC GCGGCCGTGG GCGCCATGGG CGTGACCGTG
TGCATGATCG GCCAGCTGTT CGGCAAGTTC GGTCTGGGTT GGCTGTGCGA CCACACGGGC
GCCCGCGTCT CCGGCGTGGT CGCCAGCATC TTCGGCGCCG CCGGCATCGC CATCGTGCTG
TTCAGCGTCG ATAACGCCAT GATGTTCTAC GTGGGCGTGT TCCTGTTCGG TATCGGCTTC
GCCGCGCTCA ACATCGTGCC GCCTATGGCC TGCCGCCAGG CGTTCGGCCA GAAGGACTAC
GCCAACATCT TCTCGATGGT GGCCACCGGC CTCAACGTGT TCTCCGGTTT CTCGGCGCTC
ATCTACGCGC AGATCTTCGA CATCACCGGA TCGTTCGCCG GCTGCTTCTA CCTCATCATC
GGCTTCTACG TGGTGACGCT CATCTGCTCG CTCGTGATCG TTCCCATGGG CCGTCGCTCC
TGGGCGAAGA AGTAA
 
Protein sequence
MGTETMKGSN YAWAIAVACV AFYAIPLGVV ANQAGLFASP VMEEFGWSRT DATLYMSIQP 
WVAAICTPFA GKLISRFNPR WVMTAAAAVF GLASLACAWF TEPWQWNVYG VLYGASAAFW
MYIATPTFIN RWFAKSNGTV IGVIGVCASL LGAFMSPVIQ GWISGYGWHT ARIIISVIAL
VASVVLTAAL LRESPEKMGV LPWGYGAAEV ASAKSEAKSV IDVAADEGAT AAQARKNPAL
WLLIIMAGFF VIAAGMMQQF SSYASTGALG AAVGAMGVTV CMIGQLFGKF GLGWLCDHTG
ARVSGVVASI FGAAGIAIVL FSVDNAMMFY VGVFLFGIGF AALNIVPPMA CRQAFGQKDY
ANIFSMVATG LNVFSGFSAL IYAQIFDITG SFAGCFYLII GFYVVTLICS LVIVPMGRRS
WAKK