Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1782 |
Symbol | |
ID | 8416086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2088038 |
End bp | 2089162 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645024753 |
Product | protein of unknown function UPF0052 and CofD |
Protein accession | YP_003182136 |
Protein GI | 257791530 |
COG category | [S] Function unknown |
COG ID | [COG0391] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01826] conserved hypothetical protein, cofD-related |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTGCCC GTCCGTTCTC CCACGACCCC TCGGCTACGG CGGCGTTCGC CGCACTGCGC GAATCGCAGC CCGTCGTCGC GCGCGACGAG CGCTTGCGCG TCGTCGTCAT CGGCGGCGGC ACCGGCGCGC CCGTGTCCAT ACGCACGCTT TTGTCCATGG GACTGGATAC GAGCGCCGTC GTCGCCATGG CCGACGACGG CGGCTCCACC GGCATCCTGC GCGAGGAAGC CGACGTCACG CCGCCGGGCG ACGTGCGCAA GTGCATCGCC GCAATGGCTG CGGACCCGAA CGACCCTTTG ACCAAGGCGT TCAAGTACCG CTTCTCGTTC GCTCGCAACC ACACGCTGGG CAACCTCATG CTGTCTGCGC TCGAGGATGC GGCCGGGTCG TTTCCCGAGG CCATCTCCAT CTGCGAGCGG CTGCTGGATG CGCGGGGGCA CGTGTATCCC TCCACGCTCG ATCGCGTGAC GCTCACGGCT CGCACGCGCG ACGGCCGCTC CCTCGAGGGG CAGGCCGTGG CCTGCCATTC GCGCACGGCG CTCGAGCGGG TGAGCCTGCG CGCCGCGCAC GAGGTGGTGC CCTACCAGCC GGCGCTCGAG GCTATCCGCG AGGCCGACCT CATCGTTCTG GGCCCGGGCT CGCTGTTCAC GTCCATCATC CCGAACCTGC TCGTGCCCGG CGTGGTGGAT GCGATCCGCG CGTCGAAGGG TTCCACGCTG TTCGTGTGCT CGCTTGCCGA CATGCAGGGG GAGACCTGGG GTCTCACCGC GCGCGAGCAC GTGGAGGCGC TCATGGACCA CGGCACGCGC GGGCTGCTGG ATTACGTGCT GGTGCATACC CCGGTCTCGC TGCGCCCCGA CAGCCCGGCC ACGGGCGTGT TCACCGCTGT GACGGGTGCC GACTCGGAGC ACGCCTCCAC TGCCGACCTC GACGATCTCG TGCTGTCCGG ACGCATCCGC CCCGTGCGCG TATGCTACCA GGACGTGCAG GCCATCCAGG CGCAGGGGCC GGTGGTCATT GCGCGCAACC TCGTCGATCC CATTCATCCA ACCTGGCACG ATCCTGCTGC CTTGCGCGAT GCGTTTGCGG GGGTGTTGAA GCTATGTCGT TCACGGCGGA GGTAA
|
Protein sequence | MVARPFSHDP SATAAFAALR ESQPVVARDE RLRVVVIGGG TGAPVSIRTL LSMGLDTSAV VAMADDGGST GILREEADVT PPGDVRKCIA AMAADPNDPL TKAFKYRFSF ARNHTLGNLM LSALEDAAGS FPEAISICER LLDARGHVYP STLDRVTLTA RTRDGRSLEG QAVACHSRTA LERVSLRAAH EVVPYQPALE AIREADLIVL GPGSLFTSII PNLLVPGVVD AIRASKGSTL FVCSLADMQG ETWGLTAREH VEALMDHGTR GLLDYVLVHT PVSLRPDSPA TGVFTAVTGA DSEHASTADL DDLVLSGRIR PVRVCYQDVQ AIQAQGPVVI ARNLVDPIHP TWHDPAALRD AFAGVLKLCR SRRR
|
| |