Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0162 |
Symbol | |
ID | 8414446 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 225817 |
End bp | 226974 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645023142 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003180545 |
Protein GI | 257789939 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGCCG ACGAGGAACG GAGCGGCCCG AGCCGCACGC AGGCAACAGG ATCATCGAGT GCCGTCACGG CGCTCGTCGC CATTGCGGCC TTCCTCCTGT TCGGGCTCAA CAACACCGAG ACGACCGCGC TTCCCACCTA CGTCATGTCG CTTGGCGCCG GACCGTTCGT CGCCAGCTTG CAGAACAGCC TGTTCGTGCT GCTCGCCGTC CTCCTGCGCA TCCCGCTCGA GCGCGTGGTG GAGCAGCGAG GAAGCCGCTT CGCCATGATC GCGGGAGCGT TGGGGTACAC CGTGCCGTGC CTGGCCCTCG TCGGTTGCTC CGAGCTGTGG CAGGTCGTGG TCCTGCGCCT GGCGCAGGCG TTCGGGCTGG CGCTGTTCCA ACCGAGCGTG GCGCAGTACC TCGCAGCCAC CTCCCCCGCC TCCAAGCTGG GGAGGCGGTT GGGAATCGTG CGGTTCGCAA CCACCGCCTC GCTGATGGCG GGGCCGGTGA CCATCTTCCC GCTCATCGAC GCGCACGGGT ACGGGGCGTT CTTCGTCGCG CTCGCGCTTG TGGGAACGTG CGGCACGGCC GTCGCCCTCG CGCTCCCCGC GTCCTCTCCC CTGGGCCAGC GGGTGCTCCT CCTCGCCAGC CCGCTCCTGC TCGCGTCGGG CTACAGCGTG GTCATGAACT TCGGCCAGAC CCTCGCCGAA GAAGCGCTCG CATCCTGCAA CGACGGCATG CTGTTCGCGT TCCTCAGCAC CGGAGGCCTC GTCGGCAGCC TCGTCGCCGG ATGGGCGACC GATCGGTTCG GCGCGAAGCG CTCGGTGGGC TGCACCATCG CCGCCAACGG CCTGGGGCTG CTGCTCATGG CGCTGGGCCG CAGCCCCGAA GCGGTCCTCG CGGGGGCGTT CCTGTGCGGA GCGGGATACT TCGGCGCCAC GGCCACGCTC GTCGCCGCCG CGGGCGCATC GTCGGGCGGC GGCGCCGGCA CGTTCCTGGC GCGGCAGCAA AGCGCGCTCG ACCTGGGAAT GATCGCGGGA GGGCTGCTGG CCGGAGCCAT GATGCAGGGC GGCTTGCCGG TTTCCGCGAC CTATCTGGCA ACCTCGGCGG TCGCCGGGGC GGGTCTGGTC GCGTGGGGTA TGATGTATCC GAACACGAAG GGGAAGCCGA AGCGCTAG
|
Protein sequence | MGADEERSGP SRTQATGSSS AVTALVAIAA FLLFGLNNTE TTALPTYVMS LGAGPFVASL QNSLFVLLAV LLRIPLERVV EQRGSRFAMI AGALGYTVPC LALVGCSELW QVVVLRLAQA FGLALFQPSV AQYLAATSPA SKLGRRLGIV RFATTASLMA GPVTIFPLID AHGYGAFFVA LALVGTCGTA VALALPASSP LGQRVLLLAS PLLLASGYSV VMNFGQTLAE EALASCNDGM LFAFLSTGGL VGSLVAGWAT DRFGAKRSVG CTIAANGLGL LLMALGRSPE AVLAGAFLCG AGYFGATATL VAAAGASSGG GAGTFLARQQ SALDLGMIAG GLLAGAMMQG GLPVSATYLA TSAVAGAGLV AWGMMYPNTK GKPKR
|
| |