Gene Elen_2302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2302 
Symbol 
ID8416626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2705684 
End bp2706919 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content72% 
IMG OID645025287 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003182650 
Protein GI257792044 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAA GCCTGCTCGC CCTCGCCACC GGATCGTTCG CGCTCGGATT CGCCGAGTTC 
GTGATGATGG GCATCCTCCC CGTCACGGCC TCCGGCCTGT ACGTCAGCGT GCCGGCGGCC
GGCACCTTCA TCTCCGCATA CGCCCTCGGC GTGTGCGTGG GCACCCTGTT CCTCGTGTTC
GGCCGACGCG TGCCGCCCAA GCGGCTGCTG CTCGGGTTCG TGGCGCTCGT GGCGCTCGGC
AACGCCGCGG CCGCCCTCGC ACCGAACGCC GAGGTGCTCG TGGCGGCGCG CTTCGTGTCC
GGCCTGCCAC ACGGCGCGTT CTTCGGCACC GCCACCATCG TGGCGCGCGA GCTAGCCGAA
CCCGGCCGCG AGGGTCAGGC CGTGTCCATC ATGGTGCTGG GACAGACGGT GGCGAACATG
GTGGGCGTGC CTGGCGGCAC GCTGCTGGCC GGCCTCGTGT CGTGGCGCGC GGCGTTCGTG
TTCGTCGCCG TGTGGGCGCT CGGCTCGTTT GCGCTCGTCG CGCGCCTCGT GCCCGCCGTG
CGCCCCATCC CCGACGCGGG CTTGGCGGGC CAGTTCCGCT TCCTCAAGAA GCCGGGCCCC
TGGCTGGTCA TCGGCGCGGT GCTGCTGGGC AACACCGGCG TGTTCTGCTG GTGGAGCTAC
GTGTCGCCGT GGCTGACGGA CATCGGCGGC TTCCCGTCCG ACGCGCTGCC GGCGCTGCTC
GCGCTGGCGG GCTTCGGCAT GGTGGTCGGC TCGCTCGTGG GCGGCCGGCT CACCGACCGC
ACGTCGCCCG GCAAGATGGC GGCGGCCGGC CAGGCCATCG GCTGCATCAC GCTCGCGCTC
ATCTTCGCGT TCTCGGGCGC GCCCGCCACG GCGGCCGGGC TCATGTTCCT GTGCGCCTTC
GGCATGTTCT TCGTGTCGAG CCCCCAGCAG CTGCTCATGG TGAAGGTGGG GCGCGGCGGC
GGCGAGATGA TCGGGTCGGC GTGCGTGCAG GTGGCGTTCA ACCTGGGCAA CGCGTTCGGC
GCCACCATCG GCCAGGCCGT GCTCAACGCC GGAGCGTCCT ACGCCTCGCC GAGCCTGGCG
GGCGTGCCCT TCTCGCTTGC GGCCGTCGCG CTGCTCGCGG TGTTCGCCGC CCGCTACGAG
CGCCGCTACC GCGCAGCCGG CGCGCCGGAC GGCATCGACG TGCACGACGC CCCGGAAACG
CCGTCGTGCA CGCAGCCCGC TTTTCGCCTA GAATGA
 
Protein sequence
MKKSLLALAT GSFALGFAEF VMMGILPVTA SGLYVSVPAA GTFISAYALG VCVGTLFLVF 
GRRVPPKRLL LGFVALVALG NAAAALAPNA EVLVAARFVS GLPHGAFFGT ATIVARELAE
PGREGQAVSI MVLGQTVANM VGVPGGTLLA GLVSWRAAFV FVAVWALGSF ALVARLVPAV
RPIPDAGLAG QFRFLKKPGP WLVIGAVLLG NTGVFCWWSY VSPWLTDIGG FPSDALPALL
ALAGFGMVVG SLVGGRLTDR TSPGKMAAAG QAIGCITLAL IFAFSGAPAT AAGLMFLCAF
GMFFVSSPQQ LLMVKVGRGG GEMIGSACVQ VAFNLGNAFG ATIGQAVLNA GASYASPSLA
GVPFSLAAVA LLAVFAARYE RRYRAAGAPD GIDVHDAPET PSCTQPAFRL E