Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0146 |
Symbol | |
ID | 8414430 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 202217 |
End bp | 203512 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 645023126 |
Product | General substrate transporter |
Protein accession | YP_003180529 |
Protein GI | 257789923 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGATCTA ACGACCAACG TTCCACTATC AGAAAAGTAG CCGTTTCATC GTTCCTTGGG AACTTCATCG AATGGTTCGA CTACGCAACC TATTCGTACT TCGCAGTGGT CATCGCCAAC GTGTTCTTCC CTGGCGACGA CCCGGCGCTT GCGCTCATGC AGACGTTCGC GGTGTTCGCG CTTTCGTTCC TGCTGCGCCC CATCGGCGCC ATCTTCTGGG GCAGCATGGG CGACAAGAAG GGCCGCAAGT GGGCCCTGTC CACGTCCATC TTCCTGATGA CCGGCGCTAC CTTCTGCATC GGCTTGCTGC CGGGCTACCT GGCCATCGGC CTCGCCGCCC CCATCTTGCT GCTGCTGCTC CGCATGATCC AGGGTTTCTC GGCTTCCGGC GAGTATGCCG GCGCTGCTAC GTTCCTGGCT GAGTACGCTC CCTCGGACAA GCGCGGCGTG TACTGCTCGC TCGTGCCTGC TTCGACGGCC GTGGGCCTGT TGGTGGGCTC TACGTTCGCC ACGGTCATGT ACACGATCCT GCCTGACGCA GCCGTGAACG AGTGGGGCTG GCGCATCCCG TTCCTGTTGG CAGGACCGCT GGGTCTCGTC GCTCACTACA TTCGTACCAA GCTGGAGGAT TCGCCGACGT ATCAGGAGAT GCAGGCTGCT ATCTCCAAGA CCGACGACTC GAATCATCCT ATCCGCGACC TGTTCCGTCA TCACACGAAG GAGCTCATCA TCTCCTTCGG TGCCGCCATG CTGAACGCGG TGGGCTTCTA CGTGGTTCTC ACGTACTTGC CGGTCTACCT GGAGACGGTG GTCATGATGC CGGCGAACGA ATCGTCGCTG ATCACCACTA TCTGCCTGGT CGCGTACGTG GCGTTCATCT TCGGCATGGG GCATCTGTCC GATAAGTTCG GGCGCAAGAA GATGCTCATC ATCGCCTGTG TGTCGTTCAT CGTGCTGACG GTGCCGGCGT TCATGCTGTT GAACACTGCG CAATTCTTCA CGGTGCTGGC CGTCGAGCTG GTGCTGTGCC TGGCGCTTAC CATCAACGAC GGTACGCTGT CCAGCTATCT GACCGAGACG TTCCCGACGG CGGTGCGCTA CACCGGCTTC GCCCTGTCGT TCAACTTGGC GAACGCCCTG TTCGGCGGTT CCGCCTCGTT CATCTCGACT GCTCTGATCG CCGCTACCGG CTCCGGTCTC GCGCCTGCCT GGTACATGGT CGGCGTGTCT TGTGTCGCGC TGGTGGCGAT GATACTATCG CACGAACATA CGGGTAAGGA TCTGTCCGAA GTGTAA
|
Protein sequence | MGSNDQRSTI RKVAVSSFLG NFIEWFDYAT YSYFAVVIAN VFFPGDDPAL ALMQTFAVFA LSFLLRPIGA IFWGSMGDKK GRKWALSTSI FLMTGATFCI GLLPGYLAIG LAAPILLLLL RMIQGFSASG EYAGAATFLA EYAPSDKRGV YCSLVPASTA VGLLVGSTFA TVMYTILPDA AVNEWGWRIP FLLAGPLGLV AHYIRTKLED SPTYQEMQAA ISKTDDSNHP IRDLFRHHTK ELIISFGAAM LNAVGFYVVL TYLPVYLETV VMMPANESSL ITTICLVAYV AFIFGMGHLS DKFGRKKMLI IACVSFIVLT VPAFMLLNTA QFFTVLAVEL VLCLALTIND GTLSSYLTET FPTAVRYTGF ALSFNLANAL FGGSASFIST ALIAATGSGL APAWYMVGVS CVALVAMILS HEHTGKDLSE V
|
| |