Gene Elen_1057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1057 
Symbol 
ID8415347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1280564 
End bp1281688 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content66% 
IMG OID645024020 
Productaldo/keto reductase 
Protein accessionYP_003181417 
Protein GI257790811 
COG category[R] General function prediction only 
COG ID[COG1453] Predicted oxidoreductases of the aldo/keto reductase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.000286681 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACGAGA TCGGGAAGCT GGGCTTCGGG TTCATGAGGC TGCCCGTCGT CGAGGGCGAC 
GGCGGCAAGG AGATCGACAT CGAGCAGGTC AAGCATATGG TAGACCTGTT CATGGACGCG
GGCTTCACCT ATTTCGACAC GGCGCGCGGC TACCATAACG GCCGGTCGGA GGCGGCGCTG
CGCGAGGCCG TCGTCGAGCG CTATCCGCGC GAATCGTTCC AGGTTGCCAC GAAGCTGCCG
GCGTGGCTGG CGAAAAACGC CGATCATGCG CGCGCCATGT TCGACAAGTC GCTGCGCGAG
ACGGGGGCGG GTTATTTCGA CTTCTTCTTG CTGCATAACC TGGGCGAGGA GCGCACGCGC
CTGTTCGACG ACTTCGGCCT GTGGGACTTC CTGCACGAGA AGAAGGAGGC GGGTCTGATC
CGCAACCTGG GCTTCTCCAT CCACGACAAG GCTGCCGTGT TGGAAGAGGT GCTGGAGGCG
CATCCCGAGG TGGACTTCGT GCAGCTGCAG ATCAATTACG CCGACTGGGA GAGCGAAACC
GTCGAATCGC GCAAGTGCTA CGAGGCTGCG CGCGCCCACG GGCTGCCTGT CGTGGTGATG
GAGCCCGTTA AGGGCGGCTC GCTCGTGCAT CTGCCCGAGG AGGCGGCGGA CGTGCTGCGC
GCCGTGAATC CGGACGAGTC GCTGCCCTCG TGGGCGCTGC GGTTCGCCGC GTCGCTGCCG
GGCGTGCTCA CCGTGCTGTC GGGTATGTCC ACACCCGATC AGGTGCGCGA TAACACGCAG
ATCATGCGTT CGTTCCATCC GCTGGCGCAC GAGGAGGACG AGGCGCTGGC GCGCGTGCGG
GCCATCCTCG ACGGCGTGCC TACCGTGCCG TGTACCGACT GCCGCTACTG TTTGAAGAAC
TGCCCGAAGG GCGTGCGCAT CCCGGCGGCC CTCGCGTCGC TCAACATCCT AGAGCTGTTC
CACGACATGC ACCGGGCGCA GGAGAACTAC GATTGGAACG CGTCGAGCGG CCCCGCGTCC
ACGTGCGTCG GGTGCGGCGC GTGCGAAAGC GTGTGCCCGC AGCATATCGA GATCGTGAAG
GAGCTGGGTC GCGCCGCCGA GCTGTTCGAG AAGAAGCCGG CATGA
 
Protein sequence
MDEIGKLGFG FMRLPVVEGD GGKEIDIEQV KHMVDLFMDA GFTYFDTARG YHNGRSEAAL 
REAVVERYPR ESFQVATKLP AWLAKNADHA RAMFDKSLRE TGAGYFDFFL LHNLGEERTR
LFDDFGLWDF LHEKKEAGLI RNLGFSIHDK AAVLEEVLEA HPEVDFVQLQ INYADWESET
VESRKCYEAA RAHGLPVVVM EPVKGGSLVH LPEEAADVLR AVNPDESLPS WALRFAASLP
GVLTVLSGMS TPDQVRDNTQ IMRSFHPLAH EEDEALARVR AILDGVPTVP CTDCRYCLKN
CPKGVRIPAA LASLNILELF HDMHRAQENY DWNASSGPAS TCVGCGACES VCPQHIEIVK
ELGRAAELFE KKPA