Gene Elen_1650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1650 
Symbol 
ID8415949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1951612 
End bp1952604 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content71% 
IMG OID645024619 
ProductTIM-barrel protein, nifR3 family 
Protein accessionYP_003182007 
Protein GI257791401 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.464846 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0220296 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGATA TGCACGCGTT CTTCCGGAGC CGCCGTCTGC TGCTCGCCCC CATGGCGGGC 
GTGAGCGACG AGGCGTTCCG CACGCTCTGC CGCGAGCAGG GCGCCGACCT CACCTACACC
GAGATGGTGT CGGCCAAGGG TCTTTCCTAC GCGAACGAGA AGACGCGGCA TCTGCTGCGC
CTCGCGCCGG GGGAGGACCA GGTGGCCGTG CAGCTGTTCG GCCACGAGCC CGACACGATG
GCCGCCCAAG CTGCGTGGAT CGAGCGCGAG ATGGGGCAGT CGCTGGCCTA CCTCGACATC
AACATGGGCT GTCCGGCCAG AAAGATCGTG TCGAAGGGCG ACGGCTCGGC GCTCATGAAG
GATCCGGCGC TGGCCGCCTC CATCGTGCGC GCGGTGCGCG CAGCGGTGAG CCGCCCCGTC
ACGGTGAAGT TCCGTCGAGG GTGGGCCGAG GGCGACGAAA CCGCCCCCGA GTTCGCTCGG
AGGATGGAGG ACGCGGGCGC CTGCGCCATA GCCGTGCACG GGCGCTATGC CGAGCAGCTG
TACCGCGGGC GCGCCGAGTG GGGCGCTATC GCGCGGGTGA AGGAGGCCGT GAGCGTGCCC
GTCGTGGGCA ACGGCGACGT GAGAAGCGGC GCCGACGCCG TGGCCATCAC GGAGCGCACC
GGCTGCGACG CCGTGATGAT CGCGCGTGCC GCCGAGGGAA ACCCTTGGCT GTTCGCCCAG
GCCAAGGCTG CGCTTGCGGG CGAGCCGGAG CCTGACGGGC CCACCGTGGA GGAGCGCATC
GCGCTCGCGC GCCGCCACGC GCGGCTGCTC AGCGCGCGTG AAGGCAGGAA CATCGTGCGC
ATGCGCAAGC ACGCTATGTG GTACCTGGCG GGGCTGCCCG GCGCCGCCGC CGCGCGCGGC
AGGATCAACG GCTGCGTGTC CGTGGAGGAT TTCGACGAGG TGTTCGACGA GCTGCTGGCA
TGCGCGAGGG AGCATGCCGC CGCCCACGAG TGA
 
Protein sequence
MEDMHAFFRS RRLLLAPMAG VSDEAFRTLC REQGADLTYT EMVSAKGLSY ANEKTRHLLR 
LAPGEDQVAV QLFGHEPDTM AAQAAWIERE MGQSLAYLDI NMGCPARKIV SKGDGSALMK
DPALAASIVR AVRAAVSRPV TVKFRRGWAE GDETAPEFAR RMEDAGACAI AVHGRYAEQL
YRGRAEWGAI ARVKEAVSVP VVGNGDVRSG ADAVAITERT GCDAVMIARA AEGNPWLFAQ
AKAALAGEPE PDGPTVEERI ALARRHARLL SAREGRNIVR MRKHAMWYLA GLPGAAAARG
RINGCVSVED FDEVFDELLA CAREHAAAHE