Gene Elen_0144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0144 
Symbol 
ID8414428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp200107 
End bp201246 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content58% 
IMG OID645023124 
Productprotein of unknown function DUF6 transmembrane 
Protein accessionYP_003180527 
Protein GI257789921 
COG category[R] General function prediction only 
COG ID[COG2962] Predicted permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.6072 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGACA ACGCTAACAA GGAAATCAAG ACCGCGACAG CCGATGACGG CTCGACCGGC 
TACGAGATGA AGATCGGTCT TGGCACCATT GCCATGCTGA TCTCGGCGAC GGGCATGGGC
TTGGTGCCCC TGTTCAGCCG CTGGGCAACT CGTACGGACA TGTTCGACGG GGCGCTGGGT
CTGAACGCTG GCGACTCCAT CGGTGCTTTG ATGGCCGTGG GCCGCATGAG CATGGGCGTG
CTGTTCTTCG TCGTCATCAT GTTCGCTACG GGCAAGGTAG AGACGTTCAA GAAACTCAAG
CTGACGCCGG CCATCGCGTT GGGCGGCTTG ATGATCGGCA TGTCGCTGGC GTGCTACGTG
ACGTCTACGC TGTTGACCAC CATCTCGAAC GCTGTTCTGT TCATCTACAT CGGTCCTGTC
GTTTGCGTAG TGCTCGCGCG CATCTTCCGC AAGGAACCCA TGTCTGCTTT ACAGTGGGTA
TGCCTGGTTG CGGTGTTCAT CGGCATGTTG TTCGGCAACA ACCTGATGGG TTTCAACGAG
TCTGGCTTCT TCGTAGACTT CAACCTGGTT CCGTCTACGC CTGAGTTCCC GCAGAAGGGT
CTCGGCGACG CCTTCGGCCT GGCTTCCGGC TTCTTCTACG GCGCTTCGAT GTTCTTCAAC
GGCTACCGTA AGGACGCCGA CACCACGGCT CGTGGTGTGT GGAACTTCAT CTTCGCCGTC
CTGGGCGCTG GTGTTATCAC CGTCGTCCTG AACTCGCTCG GTGCAAACCC CGGCATGGAG
AACTGGGCTC TCAACATCCA CTTCACCGCA TTCAACTGGA TCGGTGCCCT GCTTTTGTGG
GTCATCTGCG GTCCTGTGGC TCTGGGCTTC TTGCTGGTGG CTGGCCGCAA CCTGCCGGCT
GCTGACTACG GCACCATTGC GTACTGGGAA GTTCCCGTGG CCATCTTCGT GGGTCTGGTC
GTGTTCGGCG AGGCCCTGAC GGTTAACACG ATTCTCGGTG GCATTCTCAT CATCGGCGGC
GGCGCTATCC CCTCTATCAA GGGCATGCTT TCCGCTCGCA AGATGAGAAA AGAAGAGGAG
ATTTGCGAGA ATCTTGCCGC TCGCTTGGAA GAGGAAGAAG TCAAGGAGCA CCTGCAGTAG
 
Protein sequence
MADNANKEIK TATADDGSTG YEMKIGLGTI AMLISATGMG LVPLFSRWAT RTDMFDGALG 
LNAGDSIGAL MAVGRMSMGV LFFVVIMFAT GKVETFKKLK LTPAIALGGL MIGMSLACYV
TSTLLTTISN AVLFIYIGPV VCVVLARIFR KEPMSALQWV CLVAVFIGML FGNNLMGFNE
SGFFVDFNLV PSTPEFPQKG LGDAFGLASG FFYGASMFFN GYRKDADTTA RGVWNFIFAV
LGAGVITVVL NSLGANPGME NWALNIHFTA FNWIGALLLW VICGPVALGF LLVAGRNLPA
ADYGTIAYWE VPVAIFVGLV VFGEALTVNT ILGGILIIGG GAIPSIKGML SARKMRKEEE
ICENLAARLE EEEVKEHLQ