Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1212 |
Symbol | |
ID | 8415503 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1454080 |
End bp | 1455087 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645024175 |
Product | extracellular solute-binding protein family 3 |
Protein accession | YP_003181571 |
Protein GI | 257790965 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000663729 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.0536092 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGTT ATAAAACGCT CGCGTCCTGC TGCATGGCGG TCGCGTGCAT GGCGGCCCTG CTCGTCGTTC TGACGGGCTG CTCATCGCAG CAGAGCTACA CTCCCCCCGA GAAGACGCCC ACGCTATCCT CGCCGACCAT CGGCAAGGAC GGTACGCTGC GCGTCGGTGT GAACACCGAC AACCAGCCCC TGGCGGGACA GCCTTCCTCC TCGTCCAAAA TCGTCGGCAT CGACGTGGAC GTGGCGGCGG CGCTGGCTGA CAGCTTCGGG CTGAAGCTCG AGGTCGTCAA CGTGGGATCG GATGCCGAAT CGGCTCTCAA AGAGGGAACG GTCGACATCG TCATGGGCAT CGACAAGTCC GACAGCAGCA CCTCGTTCTG GAAGTCTGAC GCGTACCTGC CTACGGCCGT GGCGTTGTTC TCCGCGCCGT CCAACACGCA GGTTCCCACG AACGTCGTCG AGACGAAGAT CGCCGCGCAG GTGTCGTCGA AGAGCGCTTG GGCGGTGACG AACGAATTCG ACAAGGCAAC CTTCTCCACG ACCGACGACC TCAAGAGCGC GTTCGCCGAG CTGGCCTCGG GCCAGGTGCA GTACGTGGCG GCCGATGCCA TCATCGGGAC GTACGCGGCG CACAGCGCGG GCGACGACGT GCATATCGTG GCGCTCATGC AGCAGGCGGG CGGCTACGGC GTGGGCGTGT CGGATGCGAA CACCGATCTC AAGCAAGCGG TCTCCGAAGC CCTCGCCACG CTGACCGGCA ACGGCACCAT CGGCGTCATC GAGACGAAGT GGCTGGGTAC CGCGCTCGAC CTTTCGTCCA CGCCGCTGAC TGCCGGCGCC ACCAAGTCCA CGGACGCGGG CGCGACCGTT GCTTCGAAGG AGCCGAAAGA CGAGAGCGAA GGCGAGAACG CTGACGGGGA CGCTGCTCCT GCCGACGAAG GCACGGGCGC CGGCGACGAG GTGAACGCGG GCGAGAACGC CGTGCAGCCT GGAGACGTCG CTGCTTAG
|
Protein sequence | MKRYKTLASC CMAVACMAAL LVVLTGCSSQ QSYTPPEKTP TLSSPTIGKD GTLRVGVNTD NQPLAGQPSS SSKIVGIDVD VAAALADSFG LKLEVVNVGS DAESALKEGT VDIVMGIDKS DSSTSFWKSD AYLPTAVALF SAPSNTQVPT NVVETKIAAQ VSSKSAWAVT NEFDKATFST TDDLKSAFAE LASGQVQYVA ADAIIGTYAA HSAGDDVHIV ALMQQAGGYG VGVSDANTDL KQAVSEALAT LTGNGTIGVI ETKWLGTALD LSSTPLTAGA TKSTDAGATV ASKEPKDESE GENADGDAAP ADEGTGAGDE VNAGENAVQP GDVAA
|
| |