Gene Elen_0371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0371 
Symbol 
ID8414655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp479020 
End bp480297 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content67% 
IMG OID645023348 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003180751 
Protein GI257790145 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACTAC ACTCGTGGCT CGCACGGCCT CAACGGCGGT TGGGGCCGCT GGGGCTCATC 
GTGCTGCTGG TCATCACGTC GCTGGTCACG CCGCTTTCGC TGGACATGTA CACGCCGGCC
GTCCCGCACA TGACCGAGCA TTTCAACACG TCGGAGAGCA TGGTGAACCT CACGCTGGTG
GGCTACTTCC TGTTCTTCGC CGTCGGGCTG CTGGCGTTCG GGCCCGCAAG CGACCGCTAC
GGACGCAAAC CCGTGCTGCT GGCAGGCATT CTCACGTATG CGCTGGCCAG CGCTTTATGC
GCGCTGTCAG TGGACATCGT CATGCTCATC GCCACGCGCA TCCTGCAAGC CTTGGGCGCC
GGGGCGGTGA GCGCGGTGTC CACGGCGGTG GTGAAGGACG CCGTCGTCCC CGAACGGCGC
GAGGCTCTGC TGTCCGTCGT GCAGGTGATG TTCGTGGTAG GGCCCGTGCT GGCGCCGGTG
GCGGGCGCGC TCATCCTGCA GGTCGCCGAC TGGCGCATGA CGTTCTGGGT ACTGGCGGGC
ATCGGCCTTC TGTGCGCCGG GCTGGCGCTG CTGTTCGACG AGACGCTTCC CGTCAGCGAA
CGCTACGAGG GCACCGTGCT GGGAAGCGTG AAGCAGCTGG GCGCGGTGGC GCGCAACAAG
GGGTTCTCGG CGTTCCTGGG CATCGTCGGG CTGTACAACC TGCCGTTCAT GGCGTACATC
GCCGTCGGTT CGTACGTGTA CATCACGTTC TTCGGGCTGA CCGAGCTGGA GTACAGCATG
TACTTCGCGT TCGCCGCGCT GCTGACGGCT GCGGGGCCGT TCATCTGGCT TGCGGCCTCG
CGGTTCATGT CCGCGCGGCG GTTCACCAGC ATCCTGCTGG GCATCGCGCT GGCGTCCGGC
GCGGCCATGC TGGCCGTGGG CCAAGCGAGC CCGCAGCTGT TCTGCATCAC GTTCCTTGCG
TTCGCGCTGA CGGAGGCTGC CGTTCGGCCG TACAGCACCA ACGTCCTGCT GTCGCAGCAG
GAAGGCGATA CCGGAGCCGC ATCGTCGCTG ATCAACTTCG CGCACACCGC CATCGGCTGC
GTCGGCATGC TTGCCGCCGT ACTGCCGTGG CCGAACTACG TGGTAGGCGT GGGCGTCATC
ATCGTCGGTT CGATGGGCGT CGCCATCGCC GGCTGGGTGG CTCTGCTGCG CTCGAACGTA
CCGCTCCGAG GCATCAAAGA TGCAGGAGAC GAACCAACAG CGGCACCCGA CAGCGACCTT
GCCCCGCAAG AACGCTAG
 
Protein sequence
MALHSWLARP QRRLGPLGLI VLLVITSLVT PLSLDMYTPA VPHMTEHFNT SESMVNLTLV 
GYFLFFAVGL LAFGPASDRY GRKPVLLAGI LTYALASALC ALSVDIVMLI ATRILQALGA
GAVSAVSTAV VKDAVVPERR EALLSVVQVM FVVGPVLAPV AGALILQVAD WRMTFWVLAG
IGLLCAGLAL LFDETLPVSE RYEGTVLGSV KQLGAVARNK GFSAFLGIVG LYNLPFMAYI
AVGSYVYITF FGLTELEYSM YFAFAALLTA AGPFIWLAAS RFMSARRFTS ILLGIALASG
AAMLAVGQAS PQLFCITFLA FALTEAAVRP YSTNVLLSQQ EGDTGAASSL INFAHTAIGC
VGMLAAVLPW PNYVVGVGVI IVGSMGVAIA GWVALLRSNV PLRGIKDAGD EPTAAPDSDL
APQER