Gene Elen_3023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_3023 
Symbol 
ID8417357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3510590 
End bp3511528 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content66% 
IMG OID645026002 
Productprotein of unknown function DUF6 transmembrane 
Protein accessionYP_003183355 
Protein GI257792749 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.66818 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACAAGG CCTTCCTCCT GGTGGCGGCT GCCATCTGGG GATTGGGCAC CGTGGTCATC 
AAGTCGACCG TCGACGAGTT CCCGCCCGCA TGGCTCGTCG GCGTGCGCTT CACCGTGGCG
GGCATCATTT TGGGCATCGT CATGTTGCCG CGGTTCAGAA AAGCGCTGGA CCTCGACCAC
CTGAAAAAGG GCGCCATCCT GGGCGCGTTC CTGTTCCTCT CGTACTGGGC GAACTCCACG
GGACTCACCG ACACCACCGC CTCGAACAGC GCGTTCCTCA CATCGCTGTA CTGCGTGATC
ATCCCGTTTC TCGGATGGGC GCTGCGCGGG CCGCGCCCGA CTCGCTTCAA CATCGCCGCC
GCCCTCGTGT GCGTGGCCGG CGTGGGCTGC GTCTCGTTCG CGGGGCTCTC GGGGTTCTCG
CTGCGCTTCG GCGACCTGAT CACGCTGCTG TCGGCGTTCT TTCTCAGCCT GCATGTGCTG
TACACGGCGA AGTACGCGCG CGGTCGCGAC ATGACGCTGC TCACGGTCGT GCAGTTCCTG
GTGGCCGGCG TTCTGGGCTT CGGCGCCGGG CTTGCGTTCG AGCCCATGCC CGCGTTCGCC
AGCTTGGGGC TGGACACGTG GGTGAGCCTG GGGTACTTGG CCGTGTTCGC CTCGTGCATC
GCGCTGCTGC TGCAGAACTT CGCCGTCGCG CACGTCGACC CCGCGCCCGC ATCGCTGTTC
CTGGCAACCG AGTCGGTGTT CGGCGTGACG TTCTCGGTGC TGTTCTTGGG CGAGATCCTC
ACCGGCCCGC TGTTCGCCGG GTTCGCCCTC ATCTTCGCCG GCATCGTGAT CAGCGAATAC
CTCCCCTTGC GCGCAGAGAA GAAACGCCGA GCGCGCGCGA TGAAACCGCA GGAAGCGGAA
ACCTTCCCGT TCGAAGACGA CCCCGAGCGG GAAGCATAG
 
Protein sequence
MYKAFLLVAA AIWGLGTVVI KSTVDEFPPA WLVGVRFTVA GIILGIVMLP RFRKALDLDH 
LKKGAILGAF LFLSYWANST GLTDTTASNS AFLTSLYCVI IPFLGWALRG PRPTRFNIAA
ALVCVAGVGC VSFAGLSGFS LRFGDLITLL SAFFLSLHVL YTAKYARGRD MTLLTVVQFL
VAGVLGFGAG LAFEPMPAFA SLGLDTWVSL GYLAVFASCI ALLLQNFAVA HVDPAPASLF
LATESVFGVT FSVLFLGEIL TGPLFAGFAL IFAGIVISEY LPLRAEKKRR ARAMKPQEAE
TFPFEDDPER EA