Gene Elen_0247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0247 
Symbol 
ID8414531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp336788 
End bp338128 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content62% 
IMG OID645023225 
Productprotein of unknown function DUF21 
Protein accessionYP_003180628 
Protein GI257790022 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATTT GGATTAGCAT CGTCGTCACG TTCGTGCTGG TGCTGGTGAA CGGCTATTTC 
TCGATGTCGG AGATGGCGTT GGTGAACGCG CGCCACGTAC TGCTGCAGCA CGATGCCGAC
GAGGGCGATA AAAGCGCCCA ACGCGCGCTG GGTCTGGCCG CCGATTCGGG GCAGTTCCTG
GCCACCATCC AGGTGGCCAT CACGCTCGTC GGGTTCTTCG CCTCCGCGGC TGCCGCCACG
AACCTCTCCG ATCCGCTGGC GCAGTGGCTG TCCGGCTTCA ACATCGGGTG GCTTTCCGTT
ATCGCGCCCG GTTTGGCCCC CGTGGTCATC ACGCTCATCG TGTCATACCT CAGCATCGTG
GTGGGCGAGC TGGTGCCGAA GCGCATCGCG CTGGCCGATG CCGAGCGCGT CAGCAAGATG
GTGGCCGGAC CGCTCATGGT GTTCCAGAAA ATCGCTTCGC CTTTGGTGGC GTTGACCTCG
GCGTCCGCGA ACGGGCTGTC GCGCCTGTTC GGCATCAAGA ACGCCGACGA GCGCCAGAAC
GTGTCCGAAG AAGAGATCAA GTACATGGTC ACGGACAACG ACGAGCTGCT CGAGGACGAG
AAGCGCATGA TCCACGACAT CCTCGATTTG GGCGACATGA CCGTGCACGA GATCATGACG
CCGCGCGTGG ACGTGATGTT CGCGGAAGAC ACCGACACGG TGCGCCAGAC GGTGGAGCGC
ATGCGCGGCA CGGGCTACTC GCGTCTGCCG GTGTATCACG AGGACATCGA CCGCATCGTG
GGCATCGTCC ACTTCAAGGA CCTCGTGGCG CCGCTCATGG ACGGCAAGGA GCACGAGCCG
GTGGCCGAGT ACGCCTACGA GGCCATGTTC GTGCCCGAGA CGAAGGATCT GTTCCCGCTG
CTCGCCGAGA TGCAAACGAA TCGTCAACAG ATGGCTATCG TCGTTGACGA GTACGGTGGC
ACCGATGGTT TAATTACCGT TGAGGACATC GTAGAGGAGG TCGTCGGCGA GATCGTGGAC
GAGACGGATC GAGAGAATCC GTTCATCGAG CAGGAAAGCG AGAACGTCTG GGTGGTCGAC
GGGCGATTCC CCGTCGAAGA TGCCGCAGAG CTTGGATGGC CGGTGGAGGA TTCGGCCGAC
TACGAGACCA TCGCGGGCTG GCTCATGAGC ATGCTCGACT CGGTGCCCCA GGTGGGCGAG
GAACTTGCGT TCGACGGATA CCGCTTCAAG ATTCAGGCTA TGCGCCGCCG TCGCATTTCG
ACGGTGCGCG TGGAACGACT GGACGATCCC TCCCCATCAT GCGTGGACGC TGTCGAGGCG
ATCGACCGGG AGGAAGCGTG A
 
Protein sequence
MDIWISIVVT FVLVLVNGYF SMSEMALVNA RHVLLQHDAD EGDKSAQRAL GLAADSGQFL 
ATIQVAITLV GFFASAAAAT NLSDPLAQWL SGFNIGWLSV IAPGLAPVVI TLIVSYLSIV
VGELVPKRIA LADAERVSKM VAGPLMVFQK IASPLVALTS ASANGLSRLF GIKNADERQN
VSEEEIKYMV TDNDELLEDE KRMIHDILDL GDMTVHEIMT PRVDVMFAED TDTVRQTVER
MRGTGYSRLP VYHEDIDRIV GIVHFKDLVA PLMDGKEHEP VAEYAYEAMF VPETKDLFPL
LAEMQTNRQQ MAIVVDEYGG TDGLITVEDI VEEVVGEIVD ETDRENPFIE QESENVWVVD
GRFPVEDAAE LGWPVEDSAD YETIAGWLMS MLDSVPQVGE ELAFDGYRFK IQAMRRRRIS
TVRVERLDDP SPSCVDAVEA IDREEA