Gene Elen_0339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0339 
Symbol 
ID8414623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp440705 
End bp442369 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content72% 
IMG OID645023316 
Productdrug resistance transporter, EmrB/QacA subfamily 
Protein accessionYP_003180719 
Protein GI257790113 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAACG ACGGGCAGGC GGAGGAGGGG AGCGTCTCCG GCGGCGGCCG GCAACCCGCC 
GCGGGCGGCG GCTGTCCGAC GGCTCTCCCG GCTTCCGATC CGCCGCTGTC GCGGCGCTCG
GTCGCCGCCG TGTTCGCGGG GCTTCTGGTG GCGATGACCG TGGGGACCCT GAACCAGACC
ATCGTGGCCA CCGTCCTGCC CACCATCGTG GGCGAGCTGG GCGGCGTGAA CCGCATGCTG
TGGGTGACCA CGTCCTACGT GCTGGCCGCC ACCGTCACCA TGCCGTTGTA CGGCAAGATG
GGCGACCTCA TCGGCCGCAA AGGGCTGTTC ATCGGCGCGC TGGCCCTGTT CGTGGCGGGC
TCGGCCGCGT GCGCGCTGGC CCCGTCGATG GAAGGGCTCG TGATCGGCCG CGCCGTGCAG
GGCTTGGGCG GCGGCGGGCT CATGGTGCTC TCGCAGGCCA TCGTGGCCGA CGTGGTGCCG
CCGCGCCGCC GCGCGCTCTA CCTCAGCATC ATGGGCGTGG CCTACGCCGT GCCCATGCTG
GCCGGCCCGT TGCTGGGCGG ATTCTTCGCC GACGCGGTGG GCTGGCGCTG GGCGTTCTGG
TTCGACGTGC CGCTCGCGCT GGCCGCCATC GTCATCGCCG CCGTCTTCCT GCCGAAACCG
CGCCGCACCG CGGAGAGGGC GCCGTTCGAC GTCGGCGGCG CCGTGACGCT CGTGGCCGCC
GTGACCGCGC TCACGCTGGC AACCTTGTGG GGCGGCAACG AGCACGCGTG GACCTCGCCG
ACCATCATCG GCCTGCTGGT TGCCACCGCG GTTGCGAGCG CGCTGTTCGT GCTGGCGGAG
CGCCGCGCAC GGGAACCGCT CATGGCGTTG TCGCTGTTCA AGAACAGGAA CTTCGACATC
TCCATCGTCG CAAGCTCCAT CACGATGTTC GTCATGATCG GCGTGCTCAC GTACCTGCCC
ACGTACTTCC AGATCGTCGA CGACCTGAAC GCCACCGCCG CCGGCTACCT GGTGGCGCCT
ATGAACGCGG CCTGGTTCGC CGCCTCGCTG CTGTCGGGCT ACCTCGTGAA CAAGCTGGGC
ACGTACAAGA AGCTCATGGT GGTCAGCTTC GCCGTGCTCG TGGCGGGCAT GGTCGGGTTC
ATTGCCGTGG ACCAGGACCC GTCGGCGGTC GTCGTCGGCG GCCTGCTGGC CGTCATGGGC
TTCGGCGTGG GCCTGAACTT CGAGATCCTC GTGCTGGTGG TGCAGAACGA GTTCCCGGCC
TCCGCCGTGG GCATGGCCAC GGCGGCGACG GGCTTCTTCC GCAAGGTGGG GTCGGTGCTG
GGAACCTCCG TCGTGGGCGC GCTGTTCACG AGCGGCCTCG CGCGCGCCCT GGCCGAGCGC
CTCGCGCCGG TCGGCGGCGT CGGGGCGCTC GGCACGGACG CGAACTTGCT CACGCCGGCC
ATCGTGCACG CGCTGCCCCC GGACGTGCGC CATGCGGTGG GCGCCGCCTA CAGCGACGCG
CTCGCGCCCG TGCTCTGGCT CGCGTTGCCG CTGGCCGTGG CGGGCCTCGT GCTCATGCTG
TTCCTGCGCG AGACGCGGCT GGCCACCACC GTGGACGGAT CGGGGCATGG GGCCGACGAA
GGCGCGGACG ACCCCCGGCG GCGGCAGGGC GCGCGACGCC GCTAG
 
Protein sequence
MANDGQAEEG SVSGGGRQPA AGGGCPTALP ASDPPLSRRS VAAVFAGLLV AMTVGTLNQT 
IVATVLPTIV GELGGVNRML WVTTSYVLAA TVTMPLYGKM GDLIGRKGLF IGALALFVAG
SAACALAPSM EGLVIGRAVQ GLGGGGLMVL SQAIVADVVP PRRRALYLSI MGVAYAVPML
AGPLLGGFFA DAVGWRWAFW FDVPLALAAI VIAAVFLPKP RRTAERAPFD VGGAVTLVAA
VTALTLATLW GGNEHAWTSP TIIGLLVATA VASALFVLAE RRAREPLMAL SLFKNRNFDI
SIVASSITMF VMIGVLTYLP TYFQIVDDLN ATAAGYLVAP MNAAWFAASL LSGYLVNKLG
TYKKLMVVSF AVLVAGMVGF IAVDQDPSAV VVGGLLAVMG FGVGLNFEIL VLVVQNEFPA
SAVGMATAAT GFFRKVGSVL GTSVVGALFT SGLARALAER LAPVGGVGAL GTDANLLTPA
IVHALPPDVR HAVGAAYSDA LAPVLWLALP LAVAGLVLML FLRETRLATT VDGSGHGADE
GADDPRRRQG ARRR