Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0371 |
Symbol | |
ID | 8414655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 479020 |
End bp | 480297 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645023348 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003180751 |
Protein GI | 257790145 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00880] Multidrug resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACTAC ACTCGTGGCT CGCACGGCCT CAACGGCGGT TGGGGCCGCT GGGGCTCATC GTGCTGCTGG TCATCACGTC GCTGGTCACG CCGCTTTCGC TGGACATGTA CACGCCGGCC GTCCCGCACA TGACCGAGCA TTTCAACACG TCGGAGAGCA TGGTGAACCT CACGCTGGTG GGCTACTTCC TGTTCTTCGC CGTCGGGCTG CTGGCGTTCG GGCCCGCAAG CGACCGCTAC GGACGCAAAC CCGTGCTGCT GGCAGGCATT CTCACGTATG CGCTGGCCAG CGCTTTATGC GCGCTGTCAG TGGACATCGT CATGCTCATC GCCACGCGCA TCCTGCAAGC CTTGGGCGCC GGGGCGGTGA GCGCGGTGTC CACGGCGGTG GTGAAGGACG CCGTCGTCCC CGAACGGCGC GAGGCTCTGC TGTCCGTCGT GCAGGTGATG TTCGTGGTAG GGCCCGTGCT GGCGCCGGTG GCGGGCGCGC TCATCCTGCA GGTCGCCGAC TGGCGCATGA CGTTCTGGGT ACTGGCGGGC ATCGGCCTTC TGTGCGCCGG GCTGGCGCTG CTGTTCGACG AGACGCTTCC CGTCAGCGAA CGCTACGAGG GCACCGTGCT GGGAAGCGTG AAGCAGCTGG GCGCGGTGGC GCGCAACAAG GGGTTCTCGG CGTTCCTGGG CATCGTCGGG CTGTACAACC TGCCGTTCAT GGCGTACATC GCCGTCGGTT CGTACGTGTA CATCACGTTC TTCGGGCTGA CCGAGCTGGA GTACAGCATG TACTTCGCGT TCGCCGCGCT GCTGACGGCT GCGGGGCCGT TCATCTGGCT TGCGGCCTCG CGGTTCATGT CCGCGCGGCG GTTCACCAGC ATCCTGCTGG GCATCGCGCT GGCGTCCGGC GCGGCCATGC TGGCCGTGGG CCAAGCGAGC CCGCAGCTGT TCTGCATCAC GTTCCTTGCG TTCGCGCTGA CGGAGGCTGC CGTTCGGCCG TACAGCACCA ACGTCCTGCT GTCGCAGCAG GAAGGCGATA CCGGAGCCGC ATCGTCGCTG ATCAACTTCG CGCACACCGC CATCGGCTGC GTCGGCATGC TTGCCGCCGT ACTGCCGTGG CCGAACTACG TGGTAGGCGT GGGCGTCATC ATCGTCGGTT CGATGGGCGT CGCCATCGCC GGCTGGGTGG CTCTGCTGCG CTCGAACGTA CCGCTCCGAG GCATCAAAGA TGCAGGAGAC GAACCAACAG CGGCACCCGA CAGCGACCTT GCCCCGCAAG AACGCTAG
|
Protein sequence | MALHSWLARP QRRLGPLGLI VLLVITSLVT PLSLDMYTPA VPHMTEHFNT SESMVNLTLV GYFLFFAVGL LAFGPASDRY GRKPVLLAGI LTYALASALC ALSVDIVMLI ATRILQALGA GAVSAVSTAV VKDAVVPERR EALLSVVQVM FVVGPVLAPV AGALILQVAD WRMTFWVLAG IGLLCAGLAL LFDETLPVSE RYEGTVLGSV KQLGAVARNK GFSAFLGIVG LYNLPFMAYI AVGSYVYITF FGLTELEYSM YFAFAALLTA AGPFIWLAAS RFMSARRFTS ILLGIALASG AAMLAVGQAS PQLFCITFLA FALTEAAVRP YSTNVLLSQQ EGDTGAASSL INFAHTAIGC VGMLAAVLPW PNYVVGVGVI IVGSMGVAIA GWVALLRSNV PLRGIKDAGD EPTAAPDSDL APQER
|
| |