Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2800 |
Symbol | |
ID | 8417126 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 3250177 |
End bp | 3251937 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645025775 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003183136 |
Protein GI | 257792530 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0579957 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00000851422 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTCCGT CTTCTTCCTC ATCTGCTTCG TCCGCCTCGC GCGGACGCGG CAAGGGGTTC GCGCTTGTTG CGGCCGTGTA TCTGCTGGGC CTGTTCATCG GGGCGCTCGA TACCGGCATC GTCACGCCTG CCCGCACGGT CATCCAGAGC GATCTGGGCA TCGGCGAGCA GATGGGCGTG TGGATCATCA CCATCTATAC GCTTGCCTAC GCGGCCGCCA TTCCGGTGAT GGGCAAGCTG GCGGACCGTT CGGGACGCAA GTACGTGTAC CTTGCGAGCA TCCTGCTGTT CGGTGTCGGG TCGCTTCTAT GCGGGTTGGC GCAGGACGTG GGGAGCTTTT GGATGCTGTT AGCCGCGCGC GCCGTGCAGG CGGTGGGCGG AGGCGGCATC GTGCCTGTTG CCACGGCCGA GTTCGGCACG ACGTTTCCTC CCGAGAAGCG CGGGCTGGCG TTGGGTCTGG TAGGCGGCGT GTACGGCATT GCCAACATCT TCGGAGCGTC GGCCGGCAGC CTGATCCTAT CGGTGTTCGG GCAGGCCAAC TGGCAGTTCA TCTTCTACGT GAACGTTCCC ATCTGCGCCT TCATCGTGGT GGCGGGGCTG TTCGTGCTGC CGAACACGCG AGCCGAGCAG GTGAAGCCCA TCGACGGGTG GGGCATTGCA GTGCTGGTGG CGATGGTGTT GTCGCTGCTG TACGGACTGA AGAACCTCGA TTTCTTCGAT CTGGGAGCAT CTGCGACCTC GTCTGACGTG TGGCCGTTCT TGCTCGCGTT CGTCGTGCTG CTTCCGGTGT TCGTGCTGAT CGAGCGCCGC GCGGCCGACC CGGTGCTCAA CCTGTCGTAC TTCCGCGACC GCGACATCGT GATCACGTTG GTGCTGTCGG TGATCACCGG CGTCATCCTG ATGGGCATCA TCTTCATCCC GCAGTTCGCC GAGAACGTGC TGAAACTGCC CTCGGGCAGC GGCGGCTACG TAGTCATCGT GCTGGCGGCG TTCGCCGGGG TGGGAGCGCC GGTGTCGGGC AAGCTCATCG ATCGCTTCGG CGTGAAGGCG GTGCTCGCGT TCGGGCTGGC GGCTTCGGCT GCGGGAGCGA TGTTCCTGGC GCTGGTGGCC ACGCAGTTCG CGAATATGGC GACGCTCATC ATCAGCCTCG TGGCCATCGG CATCGGGATG GGCTTCACGA TTGGAACGCC GCTCAACTAC ATGATGCTGG CGAAGACGAA GGAGCGCGAG GCGAACTCGG CGCTGGCGAC GCTGTCGCTG GTGCGCTCGG TGGGCACGGC GGTCGCGCCT GCCGTGCTGG TTGCGTTCAT CGCGCATGCG GGCATGGCCA TTCCCGATCG CATCATGGGC GTTCTGCCCG ATGCGCCGGG CGGGCAATCG ATTGCGCAGC TAGCGAGCGG GCAAGCGCAG GGAGGCGATG GCGCTGGGTT GCCCGACGAT CTTCAACAGC TGATGAAGGG GTCGGATGTG ACCACGATCG TCGCGAACGT GAAGACGCTG GCAAAAACCG AGATCGAACA GGAAGCTGCT TCGGCGGGCA TGCCGGCCGA AGCCGTCGAC GCGGCGGAGC AACAGTACCT CGCCGCCATC GACGATCGGG CAGGCGACAT AGAAAGCACG TTCCAGAGCA CGGTCGACGA AGGCTTTCGC GGTGCGTTTC TGCTGGTGGG GATTTGCTCG CTCGTGGGGT TGGCGCTGCT TGCGCTCTAC CGGGAAGACA GGCCGCGGCC CGGGCAGGCG AAGGGGGAGC CGACGCGCTG A
|
Protein sequence | MAPSSSSSAS SASRGRGKGF ALVAAVYLLG LFIGALDTGI VTPARTVIQS DLGIGEQMGV WIITIYTLAY AAAIPVMGKL ADRSGRKYVY LASILLFGVG SLLCGLAQDV GSFWMLLAAR AVQAVGGGGI VPVATAEFGT TFPPEKRGLA LGLVGGVYGI ANIFGASAGS LILSVFGQAN WQFIFYVNVP ICAFIVVAGL FVLPNTRAEQ VKPIDGWGIA VLVAMVLSLL YGLKNLDFFD LGASATSSDV WPFLLAFVVL LPVFVLIERR AADPVLNLSY FRDRDIVITL VLSVITGVIL MGIIFIPQFA ENVLKLPSGS GGYVVIVLAA FAGVGAPVSG KLIDRFGVKA VLAFGLAASA AGAMFLALVA TQFANMATLI ISLVAIGIGM GFTIGTPLNY MMLAKTKERE ANSALATLSL VRSVGTAVAP AVLVAFIAHA GMAIPDRIMG VLPDAPGGQS IAQLASGQAQ GGDGAGLPDD LQQLMKGSDV TTIVANVKTL AKTEIEQEAA SAGMPAEAVD AAEQQYLAAI DDRAGDIEST FQSTVDEGFR GAFLLVGICS LVGLALLALY REDRPRPGQA KGEPTR
|
| |