Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0734 |
Symbol | |
ID | 8415024 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 924359 |
End bp | 925834 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 645023705 |
Product | hypothetical protein |
Protein accession | YP_003181102 |
Protein GI | 257790496 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.843476 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGA ATGTCATAGG GCGCGCGGGC ACGGCTTTCG CGCTTGTCGG CGCTTTGGCG CTTGCGGGTT GCTCGGGTTC TTCGGCGCCC TCGGAGGATG CGGTTTCGTC CGTGCTCGAC CCGTCCGATC CGGTGCAGGT GGAGCTGTGG ACCTATTACA ACGGCACGCA GCAGCAGGCT TTCGAAGACC TCGTCAAGGA CTTCAATGCG ACGAAAGGCA AGGATCTCGG CATCGTGGTG ACCAGCTCCA GCCAAGGCGG CGTCAACGAC CTGGCCTCCG CCGTCACCGA TTCCGCGCAG GAGCTTGTGG GTTCGGAGGC GATGCCCGAC GCGTTCCTGT CGTATTCCGA TACGGCGTCG GTCATAGACG GGTTCGGCAT GGTGGCCGAT CTGTCGGGCT ACCTTACTGA AGAGGAGAAG GCCGGATTCG TGGAGGGCTT TCTGGAAGAG GGCGATCTGA ACGGCAACGG CAGCCTCAAA GTGTTCCCGG TGGGCAAGTC CACCGAGACG CTGCAGATCA ACATGACCGA TTTCCAGACG TTCGCCGATG CCACCGGCAC TTCGCTCGAC GAGATGAGCA CCATTGAGGG CATCGTGAGA GTCGCGGAGC GCTACTACGA GTGGACCGAC GCGCAGACGC CGGGCGTCAT GGGTGACGGC CGTCCGTTCT TCGGACGCGA TGCTATGGCG AACTACCTGA TCACAGGTTC AAAACAGCTG GGTCACGAGA TTTTTGAAAT TGAGAACGGC GTGTGCGCGC TGAACTTCGA CCGCGCTACG ATGAAGACGC TCTGGGACAA CTACTACGTG CCCATGGTGC AGGGCTGGTT CTCGGCGGAG GGCAAGTTCC GTTCGGATGC GGTGAAGACG GGCGATCTCA TCTGCTATGT GGGCTCATCG TCCTCGGTGG TGTACTTCCC GCAAACAGTG ACGGTGGACG ATGCCACGAG CTATCCTATC CAGTTGGACG CGTTGCCCAA CCCGTCTTTC GAGCACGGCA AGCCGTGCTC GCCGCAGCAG GGCGCCGGGT TCGTGGTGAC GAAGTCCGAC GAGAAAAAGG AAACTGCGTG CGTCGAGTTC CTCAAGTGGT TCACCGCTAA AGAGCAGAAC ACCGACTTCT CGGTGAGCGC CGGCTACGTG CCGGTCACCA AGGATGCGCT GACGCTTGAG AACCTCCAGG CGGCGGCCGA GTCGATCGAC GGCGCTTCGG GCAACTATCT GGTGAACCTG CCTGCCACGC TGGATACCAT CGAGGCCGGT GTGTACGCGA ACCCGCCGTT CAAAGGCGGC GTTGAGGCGC GCGCCGTCCT TGACCGCGCG CTGTCGGACA GGGCGGTGGC CGATCGTGCG GCGGTGGTCG AAGCAATGGC GGCGGGAGCA TCGTCCGAAG AGGCTGTGGC TTCGTATCTC GACGACGCAG GGTTCGACGC TTGGCTCGCC GACCTGGAGA CCCAGCTCAG GGAAGCGATC GCTTAA
|
Protein sequence | MKKNVIGRAG TAFALVGALA LAGCSGSSAP SEDAVSSVLD PSDPVQVELW TYYNGTQQQA FEDLVKDFNA TKGKDLGIVV TSSSQGGVND LASAVTDSAQ ELVGSEAMPD AFLSYSDTAS VIDGFGMVAD LSGYLTEEEK AGFVEGFLEE GDLNGNGSLK VFPVGKSTET LQINMTDFQT FADATGTSLD EMSTIEGIVR VAERYYEWTD AQTPGVMGDG RPFFGRDAMA NYLITGSKQL GHEIFEIENG VCALNFDRAT MKTLWDNYYV PMVQGWFSAE GKFRSDAVKT GDLICYVGSS SSVVYFPQTV TVDDATSYPI QLDALPNPSF EHGKPCSPQQ GAGFVVTKSD EKKETACVEF LKWFTAKEQN TDFSVSAGYV PVTKDALTLE NLQAAAESID GASGNYLVNL PATLDTIEAG VYANPPFKGG VEARAVLDRA LSDRAVADRA AVVEAMAAGA SSEEAVASYL DDAGFDAWLA DLETQLREAI A
|
| |