Gene Elen_2841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2841 
Symbol 
ID8417172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3296758 
End bp3298134 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content55% 
IMG OID645025821 
Productputative transcriptional regulator 
Protein accessionYP_003183177 
Protein GI257792571 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAAG CAGAATTGCG CAGCGACCTC GCAACCGGAG AGACCCCCTC CATCGAGTTC 
AAACGCTGCG GAAACTCGGT TGGAAGAGAC ACGTTCGAGA CCATCTGCTC TTTCGCGAAC
AGCTTCGGGG GCAGCATCTA CCTGGGAGTA GAAGACGATG GAAACGCAAT CGGAATCCCC
GAAGAGAATA TCGTGCCCGT GAAAAGGAAC GTGACCAACG TGGTTCACAA TCCGAACGTT
TTCGACCCGC CGGCCACCCT TGAGTTCGAA GACATCGTTT TCGAGGGCTC GTCGCTTGTT
CGGATATGGA TTCCTCCCAG TCCGTCGATC CACCGATACA AAGGGAGGAT ATTCGAGCGC
ATCGAGGACG CCGACGTTGT CGTGAGCACC GAAAGCCAGC TTACGGCTTT GTGCGTACGC
AAGCAGAACA TCTACACCGA GCAGCGGGTG TTCCCGTATG TGGAAGCAAG TGATTTGCGC
ATGGATTTGC TTCCCGCCAT CCGAACGATG GCCACGGGAA AACGAACCCG ACACCCTTGG
AACGGCATGA CCGACGAAGA CCTCTTGCAT TCGGCAGGGC TTTTCGGCAA GAACTTCGTC
ACGGGCGAAA AAGGATTCAA CCTTGCAGCC GTCTTGTTGT TGGGCGATGC CGACGTCATT
CGGTCGCTGT GCCCTTCCTA TAAAACGGAT GCCGTTGTCA GGATAAGCGA CCAGGATCGA
TACGACGATC GCGTCATCGT CACAAGCAAT CTGATAGAAG CCTTCGACCA ACTCACCGGC
TTTTGCACGA AACATCTGCC GGATCGGTTC CACCTTGAAG GCTCCGTTCG CGTGAGCCCT
CGCGACATCA TCGTTCGCGA GGTAATCTCG AACATGCTCG TGCATCGCGA ATACACGAGC
CCATTTCCTG CAAAACTCAT CATCGACAAT GAAAGGTTGC GCACGGAAAA TGCAAGTCGC
GCCCCTTTCA TGGGTCGAAT CACGCTGAGC GATTTCAATC CTATCCCAAA GAACCCCCTG
ATAGGCGCAT TCTTCAACAA CATCGGGCTG GCGGAAGAGC TGGGTTCCGG CACACGGAAC
CTGTACAAAT ACACCAAAGT CTACTCTGGG GCGGAACCGG TTCTCAACGA GGGGGCGATT
TTCACAACGA CGGTCCCCCT GCACGTCGAG AACGTCGAAC CCGTCGCGGA GCGTCCTCAC
GATATGCTTT CTCTTGCAAG ACAAATCGCC TTGGATCGTG GATATGCAAC GGTTTCCGAT
CTCGAGCGCA AGGGAGTCGC CCGCAGAACC GCTCAGCGCG AACTGGCAGC ACTCGCCCAA
CAGGGGACGC TGCAAGCGAA AGGAAACGGT CGGGCCCGAA AATATTTCCT GCCTTAA
 
Protein sequence
MDEAELRSDL ATGETPSIEF KRCGNSVGRD TFETICSFAN SFGGSIYLGV EDDGNAIGIP 
EENIVPVKRN VTNVVHNPNV FDPPATLEFE DIVFEGSSLV RIWIPPSPSI HRYKGRIFER
IEDADVVVST ESQLTALCVR KQNIYTEQRV FPYVEASDLR MDLLPAIRTM ATGKRTRHPW
NGMTDEDLLH SAGLFGKNFV TGEKGFNLAA VLLLGDADVI RSLCPSYKTD AVVRISDQDR
YDDRVIVTSN LIEAFDQLTG FCTKHLPDRF HLEGSVRVSP RDIIVREVIS NMLVHREYTS
PFPAKLIIDN ERLRTENASR APFMGRITLS DFNPIPKNPL IGAFFNNIGL AEELGSGTRN
LYKYTKVYSG AEPVLNEGAI FTTTVPLHVE NVEPVAERPH DMLSLARQIA LDRGYATVSD
LERKGVARRT AQRELAALAQ QGTLQAKGNG RARKYFLP