Gene Elen_0666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0666 
Symbol 
ID8414956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp842835 
End bp843866 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content67% 
IMG OID645023640 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_003181037 
Protein GI257790431 
COG category[R] General function prediction only 
COG ID[COG0300] Short-chain dehydrogenases of various substrate specificities 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.019294 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.000464867 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCACAAA CCGCATTGGT CACCGGCGCG ACGAGCGGCA TCGGAGAGGC CCTCTGCCTG 
CTGTTCGCCA GCGACGACTT CGATCTGGTC ACCGTCGCAC GCGACAAGGA GGCTCTGGAA
AAGCAAGCCG ACGAGCTGCG CAGGCTCGAC ATCGATGTGC TTCCCATCGC CTGCGACCTC
TCCGACCCCG ACGCTGCGCG CACCATCTTC AAGCGCGTCC AGGAGGCGGG CAAGGAGATC
GAGATCCTCG TCAACGACGC GGGCTACAGC CCCGCCGGCC AGTTCTCCGA CCTGCCCATC
GCCGACATCC GCTCGATGAT CCAGGTCAGC GTCACCAGCC TCGCCGAGCT GACCAGCGTG
TTCCTGCATC CCATGCTGGA GCGCGGCCAC GGCCGCATCC TCAACATGAG CTCGATGATG
GCGAAAACGC CGTGCCCCTA CAACGCGCTG TACGGCGCGG CGAAGGTGTT CGTGCTGTCG
TTCTCAACCG CGCTGGCGCG CGAGCTCAAG CACACCGGCG TGTCGGTGAC CACCGTCTGC
CCCGGCGCCA CGCGCACGAA CTTCCCGAAG AACGCCGGCA TCGAGGACGC GCCCGCGTGG
AAGTACTTCT CCATGGATAC CGACGAGACG GCCATCCGCG TGTACCGCGC GCTCATGCGC
GGCGAGCGCT GCGCCGTGAC GGGCTGGTAC AACAAAGTGG GTTCGATCTC GGTGCGGCTC
ATGCCCATGG GCATGCAGCT GCTGGCCGGC GAGTGGCTGA TGGGCGCGCG CAAGCATCCG
CTCGACCACG AGGGAACCGA AGAGGGCTCG CACGAGGAGC ATCCGCACGG CCGGAAGCAG
GGTCAAGCCG GGACGCAGGC CTGCGGATGC GGCCATGCGC ACGATCACGG GGCCGGTCAC
GAAGACGGGC ATCCTCGCGG ACGCGAGGAC GATCAGGCTC GGAGGCACGA CCACCATCAG
CACGGCCAGC GCGAACACGG CGGCATGGGC GGCACCGCGC CTGTCTGCCG CATCAGCTGG
GATCGCTGGT AG
 
Protein sequence
MAQTALVTGA TSGIGEALCL LFASDDFDLV TVARDKEALE KQADELRRLD IDVLPIACDL 
SDPDAARTIF KRVQEAGKEI EILVNDAGYS PAGQFSDLPI ADIRSMIQVS VTSLAELTSV
FLHPMLERGH GRILNMSSMM AKTPCPYNAL YGAAKVFVLS FSTALARELK HTGVSVTTVC
PGATRTNFPK NAGIEDAPAW KYFSMDTDET AIRVYRALMR GERCAVTGWY NKVGSISVRL
MPMGMQLLAG EWLMGARKHP LDHEGTEEGS HEEHPHGRKQ GQAGTQACGC GHAHDHGAGH
EDGHPRGRED DQARRHDHHQ HGQREHGGMG GTAPVCRISW DRW