Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0661 |
Symbol | |
ID | 8414951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 838959 |
End bp | 839789 |
Gene Length | 831 bp |
Protein Length | 276 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645023635 |
Product | extracellular solute-binding protein family 3 |
Protein accession | YP_003181032 |
Protein GI | 257790426 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.126579 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.00355564 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGAAGA AACTCGTAGC GCTCGCGGCG GCTGTCGCCA CGTTGGCGCT GTCTGCGGTG ATGCTGGCCG GTTGCTCCGG CGGCGGAGAT GCGCAGAGCG CCGACAGCGC GGCCACGGAC GGCTCGTTCA CGCTGGCGGT GGGTTTCGAC CAGGGGTATC CGCCGTACGG CTACGTGGGC GACGACGGCC AGTTCACCGG CTTCGACCTG GAGCTTGCCA AGGCCGTGTG CGAGAAGATG GGCTGGGAGC TGAAGCTCGA GCCTATCGAC TGGGACGCGA AGGACGCGCT CATCGGCAGC GGCACCATCA ACTGCATCTG GAACGGCTTC ACCATGGAGA ACCGCGAGAA CGACTACACG TTCTCTGAGC CGTACATGTA CAACGAGCAG GTGGTGGTCG TGAAGAAGGA CAGCGACGCG AAGAAGCTTG AGGATCTGGC CGGCAAGACG GTGCTGACGC AGGTCGATTC GGCGGCGCTG CACGTGCTGG AGGACGAGAA GGGTCAGAAG GCGCTGGCCG ACACGTTCAA GGAGCTGCAG ACCATCGGCG ACTACAACAA CGCGTTCATG CAGCTTGAGT CCGGCATGGT GGACGCCGTT GCGTGCGACC TGTCCATCGC CAGCTACCAG ATGGCGGCCA AGCCCGACAC GTACGTGAAG CTGGGCGTGC TGGCTCCCGA GAACTACGCG GTGGGCTTCA AGAAGGGCGA CACCGAGCTG GCCAAGCAGG TGACCGACGC CCTCAAGGCG CTTGACGAGG ACGGCACCGT CAAGCAGCTG TGCGACAAGT ACGCCGACCA GGGCATCACC TACGACAACT GGGTGCTGTA A
|
Protein sequence | MKKKLVALAA AVATLALSAV MLAGCSGGGD AQSADSAATD GSFTLAVGFD QGYPPYGYVG DDGQFTGFDL ELAKAVCEKM GWELKLEPID WDAKDALIGS GTINCIWNGF TMENRENDYT FSEPYMYNEQ VVVVKKDSDA KKLEDLAGKT VLTQVDSAAL HVLEDEKGQK ALADTFKELQ TIGDYNNAFM QLESGMVDAV ACDLSIASYQ MAAKPDTYVK LGVLAPENYA VGFKKGDTEL AKQVTDALKA LDEDGTVKQL CDKYADQGIT YDNWVL
|
| |