Gene Elen_1158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1158 
Symbol 
ID8415448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1390809 
End bp1392119 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content63% 
IMG OID645024120 
ProductGeneral substrate transporter 
Protein accessionYP_003181517 
Protein GI257790911 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGG CAGCAGTAAC GGAGGCACAA CCGAAAGTGC CGTTCAAGGT GGCGATCTCT 
TCGTTTTTGG GCAACTTCAT CGAATGGTTC GACTACGCCA CCTATACGTA TTTCGCCATC
ACCATCGGCA TCGTGTTCTT CCCCGAGTCG GCGGTGAACT CCACGCTGCT CGCGTTCGCG
GTGTTCGCGT TGTCGTTCGT GTTCCGGCCG TTAGGGGCGG CGTTCTGGGG CAGCATGGGA
GACAAGAAGG GGCGCAAATG GTCGTTGTCG CTGTCCATCT TCATGATGAC GGGCGCGGCG
TTCCTCATCG GCTGCCTGCC GTCGTACGAG ACGATCGGCC TGCTGTCCCC CATCCTGCTG
CTGTGCCTGC GCAGCGTGCA GGGATTCTCG GCTGCAGGCG AGTACTCGGG AGCAGCGGTG
TTCCTGGCCG AGTACGCGCC GGCGAACCAT CGCGGGAAGT ACTGCTCGCT CGTGCCGGCA
TCCACCGCGG CGGGCCTGTT GGCGGGCTCC ACCGCAGCGC TCATCATCAA GGCGCTGCTG
CCCGAAGCCG ACGTGATCTC ATGGGGATGG CGCATTCCGT TCTTGCTGGC CGGACCCCTG
GGGCTCGTGG CGCACTACAT CCGCACGAAG CTCGAGGATT CCCCCACCTA CCAGCAGATG
ACCTCGACGG CCGATCCGGC CAAAGAGGCC CCGCGGCCTA CCCGCCTCGT GTTCAAGAAG
TACAAGAAGC GCCTTGCGAC CAGCATCGCG GCGACCATGG TGAACTCGGT CGGCTTCTAC
CTCGTGCTCA CCTACCTGCC CACGTATCTG ACCAGCTACA CGGCGATGGA AGCCTCGGCG
GCCCAGCTTG CCACCGACAT CGCGCTGGTC ACGTACATCT TCATCATCTT CGGCGCCGGA
AAGATATCCG ACATCGTAGG ACGTAAGAAA ATGCTGCTGG GCTCGTGCGT GGCGTTCATC
CTGCTCAGCA TCCCCGCCTT CATGATGCTG GAGACGGCTC AGCTGCCCAT CGTCATCGCA
GCGGAGCTCA TCATGTGCGT GACGCTCTCG TTCAATGACG CGAACATCGC CTGCTACCAG
GCGGAGATGT TCCCCACGGA AGTGCGTTAC ACCGGCGCCG CGCTGGGGTC GAACATCGCC
TACGTGGTGT TCGGCGGCAC GGCCTCGATG GTGGCCACCG CGCTCATCGA CGCCACGGGC
AACGGCCTCA TGCCCGCGTA CTATATGATG GGCATCTGCC TTGTGGCGGG CATCATCCTG
CTGTTCACGG CGCACGAGTA CGCCGGCAAG GAATTGAACG ACATCGAGTA G
 
Protein sequence
MSAAAVTEAQ PKVPFKVAIS SFLGNFIEWF DYATYTYFAI TIGIVFFPES AVNSTLLAFA 
VFALSFVFRP LGAAFWGSMG DKKGRKWSLS LSIFMMTGAA FLIGCLPSYE TIGLLSPILL
LCLRSVQGFS AAGEYSGAAV FLAEYAPANH RGKYCSLVPA STAAGLLAGS TAALIIKALL
PEADVISWGW RIPFLLAGPL GLVAHYIRTK LEDSPTYQQM TSTADPAKEA PRPTRLVFKK
YKKRLATSIA ATMVNSVGFY LVLTYLPTYL TSYTAMEASA AQLATDIALV TYIFIIFGAG
KISDIVGRKK MLLGSCVAFI LLSIPAFMML ETAQLPIVIA AELIMCVTLS FNDANIACYQ
AEMFPTEVRY TGAALGSNIA YVVFGGTASM VATALIDATG NGLMPAYYMM GICLVAGIIL
LFTAHEYAGK ELNDIE