Gene Elen_1848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1848 
Symbol 
ID8416152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2172079 
End bp2173344 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content64% 
IMG OID645024818 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003182201 
Protein GI257791595 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.815594 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGTACCG TAGCTAAGAA AGAGAAGATG AGCGTGCGCC ATATGCTCGT CGTTCTCACG 
GGCATCATGA TCACCTTCGG GTGCTCGGCC CTGTGCTTTT CCACGTGGGG CCTGTTCCAG
CCGGTCGTCG CCGAGGGCCT GGGCGTCGAG ACCACGGCGT TCGCCATGTA CGTCACGGTG
ATGTACCTGA CGATGACCGT CGCCTCGCCG TTCATGGGCA AGCTGCTGCA GACCGTGGAC
ATCCGCATCA TCCTGTCCGT TTCGGCCTGC TTGGTGGGCG GCGCGTTCCT GCTCATGAGC
GTTTCGAACG AGATCTGGAT GTTCTACCTG GCCGCCGTGC TGCTGGGCCT GGGCGAGATC
TCCATCCTGT GGCTGGCCAT CCCCACGCTG ATCAACCGCT GGTTCGCCGA CAAGGCGGGC
ACGTTCATCG GTCTGTGCAT GGCGTTCACC GGCATCGGCG GCGCCGTGTG GTCCGCCGTG
TTCACGGGCC TGCGCGCCGG CGGCATGGAT TTCCACACCA TCTACCTGAT CTGGGCCGTC
ATCGCCTTGG TCACCTCGCT GCCGTTCACG CTGTTCTGCG TGCGCAGCAA GCCCGAGGAC
TGTGGTCTGG CCCCGTACGG CGCCTCCGTG GTTGCCGGCC AGGCTCCGGC CAAGCCCACC
GGCCTGTCCG CTGCGGCCGC CATGAAGACC CCGGCGTTCT ACTCCGTGTG CGTGTTCGCG
GGCCTCATCA ACATCGCCGT CCTCATCGCC ATGCAGTTCC CCACCTACAC GAAGTCCCTC
ACCGACGTCG CGTTCGACGT GCTGGTGGTC GGCGGCGTCA TGACCACGGT CATGATGGTG
GGCCAGGCGC TGTTCAAGCT CATCCTGGGC GTGGTCGCCG ACCGCAACGC CAAGGGCGCT
TTGGTGTTCG CGTTCGTCTG CGGCGTCGCC GGCGTGCTGC TCTGCTGGTT CGGCATTGCT
TCCGAGTACA TCCTGTACAG CGGCGCATTC ATCTTCGGCG CGTTCTACGC CACGGCCGTC
GTGCTGGTGC CGGTCATCGT GCGCCAGTCG TTCGGTTCGC GCGACTACTC CGTGATCTAC
TCCCGCGTGA GCACCGTGTT CAACCTGATC GCCGCTTTCG CATCGATGAT CTGGGCTTGG
ATCGGTTCCA GCTTCGGCTT CAACGCCGTG TTCATCGTCG GCTTGGTGCT GCTGGTCCTC
ATCCTTCTGC TCGGCTTCTA CACGTTCGCG AACGCCAAGA AGTTCAAGAG CCAGTGGACC
GAATAG
 
Protein sequence
MSTVAKKEKM SVRHMLVVLT GIMITFGCSA LCFSTWGLFQ PVVAEGLGVE TTAFAMYVTV 
MYLTMTVASP FMGKLLQTVD IRIILSVSAC LVGGAFLLMS VSNEIWMFYL AAVLLGLGEI
SILWLAIPTL INRWFADKAG TFIGLCMAFT GIGGAVWSAV FTGLRAGGMD FHTIYLIWAV
IALVTSLPFT LFCVRSKPED CGLAPYGASV VAGQAPAKPT GLSAAAAMKT PAFYSVCVFA
GLINIAVLIA MQFPTYTKSL TDVAFDVLVV GGVMTTVMMV GQALFKLILG VVADRNAKGA
LVFAFVCGVA GVLLCWFGIA SEYILYSGAF IFGAFYATAV VLVPVIVRQS FGSRDYSVIY
SRVSTVFNLI AAFASMIWAW IGSSFGFNAV FIVGLVLLVL ILLLGFYTFA NAKKFKSQWT
E