Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2841 |
Symbol | |
ID | 8417172 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 3296758 |
End bp | 3298134 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 645025821 |
Product | putative transcriptional regulator |
Protein accession | YP_003183177 |
Protein GI | 257792571 |
COG category | [K] Transcription |
COG ID | [COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGAAG CAGAATTGCG CAGCGACCTC GCAACCGGAG AGACCCCCTC CATCGAGTTC AAACGCTGCG GAAACTCGGT TGGAAGAGAC ACGTTCGAGA CCATCTGCTC TTTCGCGAAC AGCTTCGGGG GCAGCATCTA CCTGGGAGTA GAAGACGATG GAAACGCAAT CGGAATCCCC GAAGAGAATA TCGTGCCCGT GAAAAGGAAC GTGACCAACG TGGTTCACAA TCCGAACGTT TTCGACCCGC CGGCCACCCT TGAGTTCGAA GACATCGTTT TCGAGGGCTC GTCGCTTGTT CGGATATGGA TTCCTCCCAG TCCGTCGATC CACCGATACA AAGGGAGGAT ATTCGAGCGC ATCGAGGACG CCGACGTTGT CGTGAGCACC GAAAGCCAGC TTACGGCTTT GTGCGTACGC AAGCAGAACA TCTACACCGA GCAGCGGGTG TTCCCGTATG TGGAAGCAAG TGATTTGCGC ATGGATTTGC TTCCCGCCAT CCGAACGATG GCCACGGGAA AACGAACCCG ACACCCTTGG AACGGCATGA CCGACGAAGA CCTCTTGCAT TCGGCAGGGC TTTTCGGCAA GAACTTCGTC ACGGGCGAAA AAGGATTCAA CCTTGCAGCC GTCTTGTTGT TGGGCGATGC CGACGTCATT CGGTCGCTGT GCCCTTCCTA TAAAACGGAT GCCGTTGTCA GGATAAGCGA CCAGGATCGA TACGACGATC GCGTCATCGT CACAAGCAAT CTGATAGAAG CCTTCGACCA ACTCACCGGC TTTTGCACGA AACATCTGCC GGATCGGTTC CACCTTGAAG GCTCCGTTCG CGTGAGCCCT CGCGACATCA TCGTTCGCGA GGTAATCTCG AACATGCTCG TGCATCGCGA ATACACGAGC CCATTTCCTG CAAAACTCAT CATCGACAAT GAAAGGTTGC GCACGGAAAA TGCAAGTCGC GCCCCTTTCA TGGGTCGAAT CACGCTGAGC GATTTCAATC CTATCCCAAA GAACCCCCTG ATAGGCGCAT TCTTCAACAA CATCGGGCTG GCGGAAGAGC TGGGTTCCGG CACACGGAAC CTGTACAAAT ACACCAAAGT CTACTCTGGG GCGGAACCGG TTCTCAACGA GGGGGCGATT TTCACAACGA CGGTCCCCCT GCACGTCGAG AACGTCGAAC CCGTCGCGGA GCGTCCTCAC GATATGCTTT CTCTTGCAAG ACAAATCGCC TTGGATCGTG GATATGCAAC GGTTTCCGAT CTCGAGCGCA AGGGAGTCGC CCGCAGAACC GCTCAGCGCG AACTGGCAGC ACTCGCCCAA CAGGGGACGC TGCAAGCGAA AGGAAACGGT CGGGCCCGAA AATATTTCCT GCCTTAA
|
Protein sequence | MDEAELRSDL ATGETPSIEF KRCGNSVGRD TFETICSFAN SFGGSIYLGV EDDGNAIGIP EENIVPVKRN VTNVVHNPNV FDPPATLEFE DIVFEGSSLV RIWIPPSPSI HRYKGRIFER IEDADVVVST ESQLTALCVR KQNIYTEQRV FPYVEASDLR MDLLPAIRTM ATGKRTRHPW NGMTDEDLLH SAGLFGKNFV TGEKGFNLAA VLLLGDADVI RSLCPSYKTD AVVRISDQDR YDDRVIVTSN LIEAFDQLTG FCTKHLPDRF HLEGSVRVSP RDIIVREVIS NMLVHREYTS PFPAKLIIDN ERLRTENASR APFMGRITLS DFNPIPKNPL IGAFFNNIGL AEELGSGTRN LYKYTKVYSG AEPVLNEGAI FTTTVPLHVE NVEPVAERPH DMLSLARQIA LDRGYATVSD LERKGVARRT AQRELAALAQ QGTLQAKGNG RARKYFLP
|
| |