Gene Elen_0734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0734 
Symbol 
ID8415024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp924359 
End bp925834 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content62% 
IMG OID645023705 
Producthypothetical protein 
Protein accessionYP_003181102 
Protein GI257790496 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.843476 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA ATGTCATAGG GCGCGCGGGC ACGGCTTTCG CGCTTGTCGG CGCTTTGGCG 
CTTGCGGGTT GCTCGGGTTC TTCGGCGCCC TCGGAGGATG CGGTTTCGTC CGTGCTCGAC
CCGTCCGATC CGGTGCAGGT GGAGCTGTGG ACCTATTACA ACGGCACGCA GCAGCAGGCT
TTCGAAGACC TCGTCAAGGA CTTCAATGCG ACGAAAGGCA AGGATCTCGG CATCGTGGTG
ACCAGCTCCA GCCAAGGCGG CGTCAACGAC CTGGCCTCCG CCGTCACCGA TTCCGCGCAG
GAGCTTGTGG GTTCGGAGGC GATGCCCGAC GCGTTCCTGT CGTATTCCGA TACGGCGTCG
GTCATAGACG GGTTCGGCAT GGTGGCCGAT CTGTCGGGCT ACCTTACTGA AGAGGAGAAG
GCCGGATTCG TGGAGGGCTT TCTGGAAGAG GGCGATCTGA ACGGCAACGG CAGCCTCAAA
GTGTTCCCGG TGGGCAAGTC CACCGAGACG CTGCAGATCA ACATGACCGA TTTCCAGACG
TTCGCCGATG CCACCGGCAC TTCGCTCGAC GAGATGAGCA CCATTGAGGG CATCGTGAGA
GTCGCGGAGC GCTACTACGA GTGGACCGAC GCGCAGACGC CGGGCGTCAT GGGTGACGGC
CGTCCGTTCT TCGGACGCGA TGCTATGGCG AACTACCTGA TCACAGGTTC AAAACAGCTG
GGTCACGAGA TTTTTGAAAT TGAGAACGGC GTGTGCGCGC TGAACTTCGA CCGCGCTACG
ATGAAGACGC TCTGGGACAA CTACTACGTG CCCATGGTGC AGGGCTGGTT CTCGGCGGAG
GGCAAGTTCC GTTCGGATGC GGTGAAGACG GGCGATCTCA TCTGCTATGT GGGCTCATCG
TCCTCGGTGG TGTACTTCCC GCAAACAGTG ACGGTGGACG ATGCCACGAG CTATCCTATC
CAGTTGGACG CGTTGCCCAA CCCGTCTTTC GAGCACGGCA AGCCGTGCTC GCCGCAGCAG
GGCGCCGGGT TCGTGGTGAC GAAGTCCGAC GAGAAAAAGG AAACTGCGTG CGTCGAGTTC
CTCAAGTGGT TCACCGCTAA AGAGCAGAAC ACCGACTTCT CGGTGAGCGC CGGCTACGTG
CCGGTCACCA AGGATGCGCT GACGCTTGAG AACCTCCAGG CGGCGGCCGA GTCGATCGAC
GGCGCTTCGG GCAACTATCT GGTGAACCTG CCTGCCACGC TGGATACCAT CGAGGCCGGT
GTGTACGCGA ACCCGCCGTT CAAAGGCGGC GTTGAGGCGC GCGCCGTCCT TGACCGCGCG
CTGTCGGACA GGGCGGTGGC CGATCGTGCG GCGGTGGTCG AAGCAATGGC GGCGGGAGCA
TCGTCCGAAG AGGCTGTGGC TTCGTATCTC GACGACGCAG GGTTCGACGC TTGGCTCGCC
GACCTGGAGA CCCAGCTCAG GGAAGCGATC GCTTAA
 
Protein sequence
MKKNVIGRAG TAFALVGALA LAGCSGSSAP SEDAVSSVLD PSDPVQVELW TYYNGTQQQA 
FEDLVKDFNA TKGKDLGIVV TSSSQGGVND LASAVTDSAQ ELVGSEAMPD AFLSYSDTAS
VIDGFGMVAD LSGYLTEEEK AGFVEGFLEE GDLNGNGSLK VFPVGKSTET LQINMTDFQT
FADATGTSLD EMSTIEGIVR VAERYYEWTD AQTPGVMGDG RPFFGRDAMA NYLITGSKQL
GHEIFEIENG VCALNFDRAT MKTLWDNYYV PMVQGWFSAE GKFRSDAVKT GDLICYVGSS
SSVVYFPQTV TVDDATSYPI QLDALPNPSF EHGKPCSPQQ GAGFVVTKSD EKKETACVEF
LKWFTAKEQN TDFSVSAGYV PVTKDALTLE NLQAAAESID GASGNYLVNL PATLDTIEAG
VYANPPFKGG VEARAVLDRA LSDRAVADRA AVVEAMAAGA SSEEAVASYL DDAGFDAWLA
DLETQLREAI A