Gene Elen_2145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2145 
Symbol 
ID8416467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2526464 
End bp2528077 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content64% 
IMG OID645025132 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003182497 
Protein GI257791891 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAGC CTTTGAAAGC AACTTTGTCT CGCCGCACGT TCTTGGCGGG CAGCGCCGTC 
GCGGCGGCTG CAGCAGGCCT GACTCTGGCC GGCTGCGGTG GCGGCGGCGA AACGACGGAC
ACCCCGTCGA CCGATGCCGG CACCGACGCC GGCGCAGCGG CCCAGGGCGG CACGCTGACC
GGCGCCATGG CCTACACGAG CACGAACGTC AACCCGATCG GCAACAGCTC CGCGCTGATG
CTGGCCGCCA CGTGGCATGT GTTCGAGGGC CTGTACGACC TCGATCTGCA CACCTACAAG
ACCTATAACG CGCTTGCCGC CGGCGAGCCC ACGAAGGTCT CCGACACCGA GTACGAGGTT
GCCCTGCGCG ACGGTGCCAA GTTCTCCGAC GGCACGGACG TCACCACCGC CGACGTGGTG
AACGCGTTCG AGAAGAACAT GGCCGACGCC ACCTACGGCG CCTTCCTCGA ATTCATCGAC
ACGGTGTCTG CGAAGGACGA CAAGACCGTC TCCTTCACGC TGAAGTACCC CTTCGACAGC
CTGCTGAAGG GCCGTCTGAG CGTGGTCAAG GTGTTCCCCG CCTCGCTGAC CGAGGATGAT
CTGAAGACGA AGCCGATCGG TTCCGGCCCG TGGGTGTACG ACACCATCAA CGGCGACGAC
GGCGGCTCCA TCGAGTTCGT GCCGAACACG AACTACAACG GCAAGTACGC CGCTACGGCC
GACAAGATGC ACTGGGACAT CCTGCTCGAC GACACCTCCC GCACCACCGC GCTGCAGGAG
GCCACGGTCC AGGTCATGGA GAACGTGCCC GACGCCAACG CCGAACAGCT CATGGCCGCC
GGCGCGTCCG TCGACTACAT CCAGGGCTTC AACCAGCCGT TCTTCATGTT CAACACGCTC
AAGAAGCCGT TCGACGATAA GCGCGTCCGC CAGGCGTTCT ACTATGCCGT GGACGTGGAC
AAGCTGATCT CCAACGCCAT GGCCGGCCAT GCCGCGAAGG TGACGAGCTT CCTGCCCGAG
AGCCACGAGA ACTACCACAA GGCTTCCACG GTGTACACCT ACGACCCCGA GAAGGCCAAG
AGCCTGCTTT CCGAGGCCGG CGTCACCGAC CTGAGCTTCG AGCTGATGAC GAACAACAAC
TGGGTGAAGA ACCTGGCCGC CGGCATCAAG AACGACCTCG ATGCCATCGG CGTGAACTGC
ACCATCAACG AGACGAAGAT CGACTGGGCG TCTCTGGCCG AGTCGGCCGA CGTGCTGCCC
TACGACGTCA TGCTGACCCC GGGCGACCCG ACCTGCTTCG GCAACGACCC CGACCTGCTG
ATGTCCTGGT GGTACGGCGA CAACGTGTGG ACCCAGGGCC GCAGCTGCTG GAAGAAGGCC
GGCGACGGCA AGTTCGACGA GCTGCAGACC CTCATGCAGC AGGCTCGCGA GGCCACCGGC
AACGAGCAGC AGGAGCTGTG GAACAAGTGC TTCGACCTTC TGGCCGAGGA AGTTCCGCTG
TACCCGCTGT TCCACCGCGA GCTGGCCACG GGCTACCAGG AGACGCAGAT CACCGGCTTC
GAGCCCATCG CCACGACGGG CCTCGTGTTC CTCGGTGCGA GCGTGAAGGC GTAA
 
Protein sequence
MEQPLKATLS RRTFLAGSAV AAAAAGLTLA GCGGGGETTD TPSTDAGTDA GAAAQGGTLT 
GAMAYTSTNV NPIGNSSALM LAATWHVFEG LYDLDLHTYK TYNALAAGEP TKVSDTEYEV
ALRDGAKFSD GTDVTTADVV NAFEKNMADA TYGAFLEFID TVSAKDDKTV SFTLKYPFDS
LLKGRLSVVK VFPASLTEDD LKTKPIGSGP WVYDTINGDD GGSIEFVPNT NYNGKYAATA
DKMHWDILLD DTSRTTALQE ATVQVMENVP DANAEQLMAA GASVDYIQGF NQPFFMFNTL
KKPFDDKRVR QAFYYAVDVD KLISNAMAGH AAKVTSFLPE SHENYHKAST VYTYDPEKAK
SLLSEAGVTD LSFELMTNNN WVKNLAAGIK NDLDAIGVNC TINETKIDWA SLAESADVLP
YDVMLTPGDP TCFGNDPDLL MSWWYGDNVW TQGRSCWKKA GDGKFDELQT LMQQAREATG
NEQQELWNKC FDLLAEEVPL YPLFHRELAT GYQETQITGF EPIATTGLVF LGASVKA