Gene Elen_1154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1154 
Symbol 
ID8415444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1386763 
End bp1388271 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content68% 
IMG OID645024116 
Productradical SAM family protein 
Protein accessionYP_003181513 
Protein GI257790907 
COG category[R] General function prediction only 
COG ID[COG4277] Predicted DNA-binding protein with the Helix-hairpin-helix motif 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.848409 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTGG TTGCCAAGCT CGAGATTCTC GCCGACGCTG CTAAGTACGA CGTTGCCTGC 
ACGTCTTCGG GCATCGACCG CGACGCCCAG AAGGGCAAGC TCGGCAACAC GCTGGCCGCC
GGCTGCTGCC ACAGCTTCGC GGCCGACGGG CGCTGCATCA CGCTGCTCAA GGTGCTCATG
ACCAACGTCT GCGTGTACGA TTGCGCCTAC TGCGTGAACC GCGCGTCGAA CGAGGTGCCG
CGCGCCGCGT TCAAGCCGCG CGAGCTGGCC GACCTCACCA TCGCGTTCTA CCGCCGCAAC
TACATCGAGG GCCTGTTCCT CAGCTCAGGC GTCATCCGCA ACCCCGACTA CACCACCGAG
CTCATGATAC AGACGCTGTC CATCCTGCGC GAGGAGCACG GCTTCCGCGG CTACATCCAC
GCGAAAGCGG TGCCCGGCAC CTCGCCCGAG CTCGTGCAGC AGCTGGGGCA CTTGGCCGAC
CGCATGAGCG TGAACATGGA GCTGCCCTCC CAGAAGAGCC TGCAGCTGCT CGCGCCCCAG
AAGGACAAAC AGCGCATCAT CGCGCCCATG CGCCAGATCC GCGACAACAT CGCCGTGGAC
AAGGACACGC GCGCGCTCGT GCGCAAGCAG ACCACCTACA TGAGGCAGAT CCGCCCCAAG
AAGAAGGAGC GCGCCTTCGT GCCGGCCGGG CAGTCCACGC AGATGATCGT AGGAGCCTCG
CCCGAAAGCG ACTTCCAGAT CCTCAACCTG TCGGCCGCGC TCTACCGCAC GCTGTCGCTC
AAGCGCGTGT TCTTCAGCGC CTACACGCCG GTGAACGACG ACAAGCGCCT GCCCGGCACC
GACGCCGTCC AGCTCAACCG CGAGCATCGG CTGTACCAGG CCGACTGGCT GCTGCGCTTC
TACCGCTTCG ACGTCACCGA GATCATCGAC GAGGACAACC CCTTCCTCGA TCCCGACCTC
GACCCGAAGG CGAACTGGGC CATCAACCAC CTGGACTTCT TCCCCGTGGA GGTGAACACC
GCTCCGCTCG AGGCGCTGCT GCGCGTGCCC GGCATCGGCG TGCGCGGAGC GAACCTCATC
GTGCGCGCGC GGCGCACCAC CTGCCTGCGC GAGCCCGAGC TGCGCAAGCT GGGCATCGCG
TACAAGCGCG CCCGCTTCTT CATCACGTGC AGCGGCAGCT ACTCGGGGCG CGGCGTCGAC
TTCTCGCGCG AAGGGCTGCG CGCGCAGCTT GCCGCGCCCA TCAAGGGCGG CAACCACGGG
CGGCGCGCCG ACAAGACCAC ACCGGGTCAG ATGAGCCTGT TCGAGAGCGT CGAGACGCCC
GAGAAGGCCC GCATCGCGGG CGGGTCGGGC GCACGCGCGC TGGAAAGCGG CGATGCGACC
GCAGCGGCGT CTTGCAGCGA TGCGGAGCGC TCCTCGAACG CGGCGTCGTC GTCCGGCCGC
GCCGCGAGCG CCGACGGGAC GTACGGTTGG CAGCGAGCCC TCGAAACGCC GGAAGCCGTG
TGCGCATGA
 
Protein sequence
MDLVAKLEIL ADAAKYDVAC TSSGIDRDAQ KGKLGNTLAA GCCHSFAADG RCITLLKVLM 
TNVCVYDCAY CVNRASNEVP RAAFKPRELA DLTIAFYRRN YIEGLFLSSG VIRNPDYTTE
LMIQTLSILR EEHGFRGYIH AKAVPGTSPE LVQQLGHLAD RMSVNMELPS QKSLQLLAPQ
KDKQRIIAPM RQIRDNIAVD KDTRALVRKQ TTYMRQIRPK KKERAFVPAG QSTQMIVGAS
PESDFQILNL SAALYRTLSL KRVFFSAYTP VNDDKRLPGT DAVQLNREHR LYQADWLLRF
YRFDVTEIID EDNPFLDPDL DPKANWAINH LDFFPVEVNT APLEALLRVP GIGVRGANLI
VRARRTTCLR EPELRKLGIA YKRARFFITC SGSYSGRGVD FSREGLRAQL AAPIKGGNHG
RRADKTTPGQ MSLFESVETP EKARIAGGSG ARALESGDAT AAASCSDAER SSNAASSSGR
AASADGTYGW QRALETPEAV CA