Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1158 |
Symbol | |
ID | 8415448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1390809 |
End bp | 1392119 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 645024120 |
Product | General substrate transporter |
Protein accession | YP_003181517 |
Protein GI | 257790911 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCGG CAGCAGTAAC GGAGGCACAA CCGAAAGTGC CGTTCAAGGT GGCGATCTCT TCGTTTTTGG GCAACTTCAT CGAATGGTTC GACTACGCCA CCTATACGTA TTTCGCCATC ACCATCGGCA TCGTGTTCTT CCCCGAGTCG GCGGTGAACT CCACGCTGCT CGCGTTCGCG GTGTTCGCGT TGTCGTTCGT GTTCCGGCCG TTAGGGGCGG CGTTCTGGGG CAGCATGGGA GACAAGAAGG GGCGCAAATG GTCGTTGTCG CTGTCCATCT TCATGATGAC GGGCGCGGCG TTCCTCATCG GCTGCCTGCC GTCGTACGAG ACGATCGGCC TGCTGTCCCC CATCCTGCTG CTGTGCCTGC GCAGCGTGCA GGGATTCTCG GCTGCAGGCG AGTACTCGGG AGCAGCGGTG TTCCTGGCCG AGTACGCGCC GGCGAACCAT CGCGGGAAGT ACTGCTCGCT CGTGCCGGCA TCCACCGCGG CGGGCCTGTT GGCGGGCTCC ACCGCAGCGC TCATCATCAA GGCGCTGCTG CCCGAAGCCG ACGTGATCTC ATGGGGATGG CGCATTCCGT TCTTGCTGGC CGGACCCCTG GGGCTCGTGG CGCACTACAT CCGCACGAAG CTCGAGGATT CCCCCACCTA CCAGCAGATG ACCTCGACGG CCGATCCGGC CAAAGAGGCC CCGCGGCCTA CCCGCCTCGT GTTCAAGAAG TACAAGAAGC GCCTTGCGAC CAGCATCGCG GCGACCATGG TGAACTCGGT CGGCTTCTAC CTCGTGCTCA CCTACCTGCC CACGTATCTG ACCAGCTACA CGGCGATGGA AGCCTCGGCG GCCCAGCTTG CCACCGACAT CGCGCTGGTC ACGTACATCT TCATCATCTT CGGCGCCGGA AAGATATCCG ACATCGTAGG ACGTAAGAAA ATGCTGCTGG GCTCGTGCGT GGCGTTCATC CTGCTCAGCA TCCCCGCCTT CATGATGCTG GAGACGGCTC AGCTGCCCAT CGTCATCGCA GCGGAGCTCA TCATGTGCGT GACGCTCTCG TTCAATGACG CGAACATCGC CTGCTACCAG GCGGAGATGT TCCCCACGGA AGTGCGTTAC ACCGGCGCCG CGCTGGGGTC GAACATCGCC TACGTGGTGT TCGGCGGCAC GGCCTCGATG GTGGCCACCG CGCTCATCGA CGCCACGGGC AACGGCCTCA TGCCCGCGTA CTATATGATG GGCATCTGCC TTGTGGCGGG CATCATCCTG CTGTTCACGG CGCACGAGTA CGCCGGCAAG GAATTGAACG ACATCGAGTA G
|
Protein sequence | MSAAAVTEAQ PKVPFKVAIS SFLGNFIEWF DYATYTYFAI TIGIVFFPES AVNSTLLAFA VFALSFVFRP LGAAFWGSMG DKKGRKWSLS LSIFMMTGAA FLIGCLPSYE TIGLLSPILL LCLRSVQGFS AAGEYSGAAV FLAEYAPANH RGKYCSLVPA STAAGLLAGS TAALIIKALL PEADVISWGW RIPFLLAGPL GLVAHYIRTK LEDSPTYQQM TSTADPAKEA PRPTRLVFKK YKKRLATSIA ATMVNSVGFY LVLTYLPTYL TSYTAMEASA AQLATDIALV TYIFIIFGAG KISDIVGRKK MLLGSCVAFI LLSIPAFMML ETAQLPIVIA AELIMCVTLS FNDANIACYQ AEMFPTEVRY TGAALGSNIA YVVFGGTASM VATALIDATG NGLMPAYYMM GICLVAGIIL LFTAHEYAGK ELNDIE
|
| |