Gene Elen_2966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2966 
Symbol 
ID8417298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3440924 
End bp3442243 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content62% 
IMG OID645025943 
Productprotein of unknown function DUF21 
Protein accessionYP_003183298 
Protein GI257792692 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATAG CACTCAGTTT GCTGCTCGTT CTGTTTTTCC TGCTCATGAA TGCGTTCTTC 
GTCGCCGCCG AGTTCTCGCT CGTGCGCGTA CGCAAGTCGC AAGTCGAAAT CCTCGTGGAC
GAGGGTCGCA AGGGCGCCAA GTACACCAAG CTCGTCGCCG ACAACGTCAA CGCCTACCTG
TCAGCCTGCC AGCTGGGCAT CACCCTTGCC TCGCTCGCCC TCGGCTGGTT GGGCGAGCCT
GCGGTGTCAG CCCTGTTCGA ACCGCTGTTC AAGGCACTCA ACGTGCCCGA GGCGGCCACG
CACGGCATCT CCATCGTCAT CGGTTTCGTC ATCATCACCG CGTTGCACAT CGTGGTGGGC
GAGCTCATCC CGAAGTCGCT GGCCATCTTC TCCACCGAGC GCTACGCCCT GTTCACGGCC
ACGCCGCTCG TGTGGTTCTA CCGCATCACG TACCCCGTCA TGTGGCTGTT CAACAGCATT
ACGAACGGCG TCATGAAGAT GCTGGGCCAC GACGTGGCCA ACGAACACGA GGTGTACACC
GACGAGGAGA TCAAGCTGCT CATCGACGAG AGCACCGAAA GCGGACTCAT CGACCCCGAG
CAGAACGAAT ACGTGGACAA CATCTTCGAC CTGGGCGACA AAGACGCCGA GGCCATCATG
ACGCCGCGCA CCGATGTGGT GTGCATCGAC CTCGACGATC CGCTGGAGGA GAGTCTCCAG
ACCGTGTTGC AGTACAAGTA CACGCGCTAC CCGGTGTGCC GCGGCAGCAA GGACCGCATC
GTCGGCTTCG TGCACGTGAA GGACCTCTAC ACGATGCCCA AGGACGCGAC GGTCGACGAC
CTGCGCGTTC GCATGATCCA GGCCGTACCC GAAGGCGTGC CCATCGCGAA GTTGCTGCAA
ACGCTGCAGG AGAAGCGCAC GAAAATCGCC GTGGTCATCG ACGAGCATGG CGGCACGGCC
GGCATCGTTA CGATGAGCGA CATCATGGAG CAGATCGTCG GCCGCATCGA CGACGAGTAC
GCGCATGGCG GCTCGGACGA GATCGTGCAG TTGGACGATG GCAGCTACCT CATCGACGGC
TCGCTTCCCA TCGACGAGGT GGGCGAGCTC ATCGGTTTCG AGCCTCTCGA GTCCGAGGAA
TGCGAGACGG CGGGCGGCCT GCTGCTCACC GTGTTCGACC GTATCCCCGA CGAGGGCGAT
TCCGTGACCA TCGAGGACGG CGACGACAGG GCCACGTTCA CCGTAGTCGA CATGGACCGC
CACCGCATCG ACAAGATTCG GGTGGTGCTC GAGCACGCTC CGGAAAGCGA CGAAAGCTAA
 
Protein sequence
MPIALSLLLV LFFLLMNAFF VAAEFSLVRV RKSQVEILVD EGRKGAKYTK LVADNVNAYL 
SACQLGITLA SLALGWLGEP AVSALFEPLF KALNVPEAAT HGISIVIGFV IITALHIVVG
ELIPKSLAIF STERYALFTA TPLVWFYRIT YPVMWLFNSI TNGVMKMLGH DVANEHEVYT
DEEIKLLIDE STESGLIDPE QNEYVDNIFD LGDKDAEAIM TPRTDVVCID LDDPLEESLQ
TVLQYKYTRY PVCRGSKDRI VGFVHVKDLY TMPKDATVDD LRVRMIQAVP EGVPIAKLLQ
TLQEKRTKIA VVIDEHGGTA GIVTMSDIME QIVGRIDDEY AHGGSDEIVQ LDDGSYLIDG
SLPIDEVGEL IGFEPLESEE CETAGGLLLT VFDRIPDEGD SVTIEDGDDR ATFTVVDMDR
HRIDKIRVVL EHAPESDES