Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0351 |
Symbol | |
ID | 8414635 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 452037 |
End bp | 453338 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 645023328 |
Product | General substrate transporter |
Protein accession | YP_003180731 |
Protein GI | 257790125 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2271] Sugar phosphate permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0092051 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.434998 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGCAG CTACAAAGAA AAAAACTTCG TTAGCCAGGG CGGTGTTCTC GGGGAGCCTT GGGAATATGC TGGAATGGTT CGACTACGGG TTGTACGGGT ACTTCGCAGC GATCATTTCC GCAGATTTCT TCGTTGCCGA CGACCCGATC GTCGGGCTCT TGCTCAGCTT CCTGGTCTTC GGAACGGGCT TTCTCGTACG CCCCATCGGC GGTATTCTGA TCGGCGCGTA CGCCGACAAG CACGGAAGAA TCAAGGCTTT GACGTTGACG ATCCTGTGCA TGGGCATCTG CACCATGCTC ATGGGATGCT TGCCAACGTA TTCCCAGGTG GGACTTTTGG CACCCATCCT GCTGACGGTG CTGCGCCTGC TGCAGGGCCT GGCAACGGGC GGCGAGTTCG GCTCTTCGCT GACGTTCATC TCGGAGTACG GAACGCCGAA CAACAGGGCC TTCCTGTGCA GCTGGCAGCC GTTCAGCGTC GGTTGCGGCC TGCTCCTGGG GTCCACCGCG GGGCTGCTCG TCACCACGCT TCTGCCTGAA GCGGCGCTGT ACGAGTGGGG ATGGCGCGTG CCGTTCCTCT GCGGAATCCT GATCGCCTTC TACGGCGTGC ACATGCGCAA GAACGTCCCC GATTCCCCTG AGTTCCTGAA GGCGAAGGCG GAAGTGAAGG AAGAGGACCA TACGCCGGTC AAAGACCTGT TCCTGCGCTA CAAGAAGTCC ATCATCACGG TGATCGGGCT GCTCGTCGGC TCCAGCGCGA CCTACTACAT CCTGATCACC TATATGCCGA CGTACATCTC GCAGTTCATG GGGACGTCGT TCTCGAGCGC GTTCGTCGTC AATACGTCGG TCATCGCGAT CAACTTGCTG CTCTGTCCCA TCGTGGGCCT GCTCATCGAC AAGGTGGGAA GGCGGAAATG CCTGATCATC GGCTGTCTCG GGTTCCTTAT CCTGTCTTAT CCGGTGTTCT ACGTGCTGAT CCAGCAGACC AACGCTCTCT TGATGATCGG CTTGCTGGGA GTGCTGATCG TGTTCCAGAC GATTCTCGCC GTCGCCATCG TGGTGGTTTC GGCCGAGGTG TTCCCCACCA AGCTCCGCAA CAGCGGCATC GGCTTCTCCT ACAATATCGC GGCGGCGGTC TTCGGCGGCT TGGCGCCCCT TGCGGCAACG GCGCTCATCG CGGCCACGGG CGATCGCCTG TCCATCACCT ACTTGATGAT AGGTTCCGTC CTCATAACGC TCTTGACCAC GATCTTCCTT TTGAAGGGGT ATTACGTCAA GGGAAAGAAC GCGTCGAGCT AG
|
Protein sequence | MVAATKKKTS LARAVFSGSL GNMLEWFDYG LYGYFAAIIS ADFFVADDPI VGLLLSFLVF GTGFLVRPIG GILIGAYADK HGRIKALTLT ILCMGICTML MGCLPTYSQV GLLAPILLTV LRLLQGLATG GEFGSSLTFI SEYGTPNNRA FLCSWQPFSV GCGLLLGSTA GLLVTTLLPE AALYEWGWRV PFLCGILIAF YGVHMRKNVP DSPEFLKAKA EVKEEDHTPV KDLFLRYKKS IITVIGLLVG SSATYYILIT YMPTYISQFM GTSFSSAFVV NTSVIAINLL LCPIVGLLID KVGRRKCLII GCLGFLILSY PVFYVLIQQT NALLMIGLLG VLIVFQTILA VAIVVVSAEV FPTKLRNSGI GFSYNIAAAV FGGLAPLAAT ALIAATGDRL SITYLMIGSV LITLLTTIFL LKGYYVKGKN ASS
|
| |