Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1023 |
Symbol | |
ID | 8415313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1239245 |
End bp | 1240762 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 645023987 |
Product | efflux transporter, RND family, MFP subunit |
Protein accession | YP_003181384 |
Protein GI | 257790778 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0845] Membrane-fusion protein |
TIGRFAM ID | [TIGR01168] Gram-positive signal peptide, YSIRK family [TIGR01730] RND family efflux transporter, MFP subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.391562 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCCGA TCAACAACCG TCGTCAGGGC GCTCCGGACG GCACCGAGCC TTTGAGCGCT CAACCGAGCC CCTTCGATCA GAACGCTTTC GATCGCATCG CGTTCGAGCC GATCGAGGGC CTGCCGGGGC CGACCGACCT CGTCGACGCC GAGGCGCTCG CCGACATGCG GGCGTTCAAC GACCTCAAGG CGAAGCGCAA GAAGCAGCGC AAGCGCAAGA TCATCATCGG CGCCGTTTCG GCCGTGGCGG TGCTCGCCGT CGCGGGCGGC GCGTTCGCGT GGTACGCGGC CGACCAGGCG GCCAAGGCGC TGCAGGACAT GGCCCCGCAG ACCGGCTTCG TGGAGCAGGG CACGTTCGTC GAGACGGTGT CGGCGTCCGG CAACCTGCAA CCTGTCGCGT CGGTGAGCGC CACGCCCGAG GTGGACGGCA TCGTGGGCGA GGTGCTCGTG GCCGAAGGCG ACGCGGTGGC CGAGGGCCAG ACGCTGTTCA CCGTGGTCAA CGACGAGCTG GACAAGGCGG TGAACCAGGC GGCCCAGGGC ATCGAGGAGG CCAAGAACGG CGTGGCCCAG GCCCAGAACG CCGTGAACGA CGCCTACCAC GCCAAGTCCG CCGGCCAGCA GGCCGCCGCG AACGCCCAAG CCCAGGCGCA AGCCGCCGCC GCGGCCGCCA AGGAGGCCGG GGGAGCGGCG GCCGAGTCGT TCGCCGGCGA GCAGGCTTCC TTCGACGAGT CCAGCGCCGA CTCCGCCATC AGGTCGGCCG AGCTCGGCTT GAGCAACGCG AACCTCGCGC TGCAGAACGC GCAGAGCGCC TACGACGAGG CCGTCGCGCG CGCCGCCAAG CGCACGGTGA CCGCCTCCAT CGCCGGCAGC GTCGTCGCGG TGAACATCGA GCCCGGCAAG GCGCTCGGGG CCACGGCGAG CGCCGCGACC TCGCCCGTGC AGATCGCCGA CCTGTCGCAG ATGACCGTCT CCATCAACGT GAACGAGATC GACATCCTGA AGATCACCGC CGATCAGACC GCCGAGGTCA CGTTCACCGC CGCCCCCGAC CTCACGCTGC CCGCCACCGT GGTGAGCATC GCCACCACGT CGGCCGGCTC CGGCGATGCG TCGGGCGGCG CCATGTACGG CGGCATGGGC GGTGCCGTCA CGTACGCGGT GAAGCTGCTC ATCGCCGAGC CCGACCCGCG CCTCAAGCCC GGCATGACCG CCAAGGCCAC CATCACCACG ACCACCATCG AGAACGCGCT CATGGTGCCC ATCTCGGCCG TGCAGTCCGA CGGCGCGGGC GGCAGCTTCG TGATGGTGCT CACCGACCCC GAGACGCAGG AGATGGACGC GCGCACGGTG GAGGTCGTCG CGTCCGACGG CCTCACGTCC GTCGTGAAGG GCCAGGTGAA AGCCGGCGAC GAGGTGGTCG TCGGCGGAGG CATGGGCGGC GCGGTGGACG GCATGGGCAT GGCGGGCGAC GGCGGGATGG CCGCGGTCGA CGCCGGCGGC AGCGTCATGG TGGGGTAG
|
Protein sequence | MGPINNRRQG APDGTEPLSA QPSPFDQNAF DRIAFEPIEG LPGPTDLVDA EALADMRAFN DLKAKRKKQR KRKIIIGAVS AVAVLAVAGG AFAWYAADQA AKALQDMAPQ TGFVEQGTFV ETVSASGNLQ PVASVSATPE VDGIVGEVLV AEGDAVAEGQ TLFTVVNDEL DKAVNQAAQG IEEAKNGVAQ AQNAVNDAYH AKSAGQQAAA NAQAQAQAAA AAAKEAGGAA AESFAGEQAS FDESSADSAI RSAELGLSNA NLALQNAQSA YDEAVARAAK RTVTASIAGS VVAVNIEPGK ALGATASAAT SPVQIADLSQ MTVSINVNEI DILKITADQT AEVTFTAAPD LTLPATVVSI ATTSAGSGDA SGGAMYGGMG GAVTYAVKLL IAEPDPRLKP GMTAKATITT TTIENALMVP ISAVQSDGAG GSFVMVLTDP ETQEMDARTV EVVASDGLTS VVKGQVKAGD EVVVGGGMGG AVDGMGMAGD GGMAAVDAGG SVMVG
|
| |