Gene Elen_3065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_3065 
Symbol 
ID8417400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3563237 
End bp3564346 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content67% 
IMG OID645026045 
Productprotein of unknown function DUF917 
Protein accessionYP_003183397 
Protein GI257792791 
COG category[S] Function unknown 
COG ID[COG3535] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAGGA AAATCGGCAT CAAAGAGATC GAGGACATGG CGCTTGGCGC GACGGTTCTC 
GGCGCCGGCG GGGGCGGCGA CCCGTACGTC GGCAAGCTCA TGGCCATCGA GGCCATCAAG
AAGTACGGCG AGGTGGAGCT CATCTCGCCC GACGAAGTTC CTGACGACGC CGTGGTGTGC
GTGTCCCAGA TGATGGGCGC CCCCACCATC ATGGTGGAGA AGATCTGCAG CGGCCTGGAG
CCCATGGCCA CGTACGACGA GCTGGTGAAG GAGCTGGGCC AGGAGCCGTA CGCCATCTAC
GCGGTGGAAG CCGGCGGCGT GAACTCCACC ATCCCGTTCA TCCTGGCGGC CACGCGCCGC
ATCCCCGTGG TGGACTGCGA TCTCATGGGC CGCGCGTTCC CCGAGCTGCA GATGACCACG
CTGGGCATCA ACGGCGTGAA GGGACAGCCC GCCGTCATGG CCGATGAGAA GGGCAACACG
GTCACGGTGC GCGCGATCGA CGACAAGTGG CTCGAGCGCA TCTCGCGCCA GGCCACGTCG
GTGATGGGCG GTTACACCAT CCTGGCGTCG TATCCGTGCA CGGGGCGCCA GCTCAAGGAC
TACTGCATCC CCGACACGCC TACGCTGTGC GAGGAGATCG GCCGCACGCT GCGCGAGGCG
CGCGAGCAGC ATGCCGACCC CATCGAGGCC GTGCTGAACG TGACGAACGG GTTTCGCCTG
TTCCGCGGCA AAGTGGTGGA CGTCGAGCGC AAGACCGACG GCATGTTCGT GCGCGGTCGC
GCCGTGGTGG ACGGGCTCGA CCAGGACAAG GGCAGCCAGC TTATCATCGA GTTCCAGAAC
GAGAACCTCA TCGCGCTGCG CGACGGCCAG CCGGTGACCA CGTCGCCCGA CCTCATCATG
TCGCTGGACA TGGAGTCCGG CTCGCCCGTG ACTACCGAGG GCCTGAAGTA CGGCGCTCGC
ATTGTGGTGG TGGGCATGCC CTGCGCGCCG CAGTGGCGCA CGCCCGAGGG CCTGGCCGTG
GTAGGGCCGC GCGCGTTCGG CTACGACATC GACTACGTGC CGGTTGAGCA GCGCGTCGCC
GCGATGAACA ACGAGGAGGT GCAGGCGTAA
 
Protein sequence
MRRKIGIKEI EDMALGATVL GAGGGGDPYV GKLMAIEAIK KYGEVELISP DEVPDDAVVC 
VSQMMGAPTI MVEKICSGLE PMATYDELVK ELGQEPYAIY AVEAGGVNST IPFILAATRR
IPVVDCDLMG RAFPELQMTT LGINGVKGQP AVMADEKGNT VTVRAIDDKW LERISRQATS
VMGGYTILAS YPCTGRQLKD YCIPDTPTLC EEIGRTLREA REQHADPIEA VLNVTNGFRL
FRGKVVDVER KTDGMFVRGR AVVDGLDQDK GSQLIIEFQN ENLIALRDGQ PVTTSPDLIM
SLDMESGSPV TTEGLKYGAR IVVVGMPCAP QWRTPEGLAV VGPRAFGYDI DYVPVEQRVA
AMNNEEVQA