Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_3023 |
Symbol | |
ID | 8417357 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 3510590 |
End bp | 3511528 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645026002 |
Product | protein of unknown function DUF6 transmembrane |
Protein accession | YP_003183355 |
Protein GI | 257792749 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.66818 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACAAGG CCTTCCTCCT GGTGGCGGCT GCCATCTGGG GATTGGGCAC CGTGGTCATC AAGTCGACCG TCGACGAGTT CCCGCCCGCA TGGCTCGTCG GCGTGCGCTT CACCGTGGCG GGCATCATTT TGGGCATCGT CATGTTGCCG CGGTTCAGAA AAGCGCTGGA CCTCGACCAC CTGAAAAAGG GCGCCATCCT GGGCGCGTTC CTGTTCCTCT CGTACTGGGC GAACTCCACG GGACTCACCG ACACCACCGC CTCGAACAGC GCGTTCCTCA CATCGCTGTA CTGCGTGATC ATCCCGTTTC TCGGATGGGC GCTGCGCGGG CCGCGCCCGA CTCGCTTCAA CATCGCCGCC GCCCTCGTGT GCGTGGCCGG CGTGGGCTGC GTCTCGTTCG CGGGGCTCTC GGGGTTCTCG CTGCGCTTCG GCGACCTGAT CACGCTGCTG TCGGCGTTCT TTCTCAGCCT GCATGTGCTG TACACGGCGA AGTACGCGCG CGGTCGCGAC ATGACGCTGC TCACGGTCGT GCAGTTCCTG GTGGCCGGCG TTCTGGGCTT CGGCGCCGGG CTTGCGTTCG AGCCCATGCC CGCGTTCGCC AGCTTGGGGC TGGACACGTG GGTGAGCCTG GGGTACTTGG CCGTGTTCGC CTCGTGCATC GCGCTGCTGC TGCAGAACTT CGCCGTCGCG CACGTCGACC CCGCGCCCGC ATCGCTGTTC CTGGCAACCG AGTCGGTGTT CGGCGTGACG TTCTCGGTGC TGTTCTTGGG CGAGATCCTC ACCGGCCCGC TGTTCGCCGG GTTCGCCCTC ATCTTCGCCG GCATCGTGAT CAGCGAATAC CTCCCCTTGC GCGCAGAGAA GAAACGCCGA GCGCGCGCGA TGAAACCGCA GGAAGCGGAA ACCTTCCCGT TCGAAGACGA CCCCGAGCGG GAAGCATAG
|
Protein sequence | MYKAFLLVAA AIWGLGTVVI KSTVDEFPPA WLVGVRFTVA GIILGIVMLP RFRKALDLDH LKKGAILGAF LFLSYWANST GLTDTTASNS AFLTSLYCVI IPFLGWALRG PRPTRFNIAA ALVCVAGVGC VSFAGLSGFS LRFGDLITLL SAFFLSLHVL YTAKYARGRD MTLLTVVQFL VAGVLGFGAG LAFEPMPAFA SLGLDTWVSL GYLAVFASCI ALLLQNFAVA HVDPAPASLF LATESVFGVT FSVLFLGEIL TGPLFAGFAL IFAGIVISEY LPLRAEKKRR ARAMKPQEAE TFPFEDDPER EA
|
| |