Gene Elen_1966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1966 
Symbol 
ID8416277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2303345 
End bp2304496 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content69% 
IMG OID645024943 
Productpeptidase M42 family protein 
Protein accessionYP_003182319 
Protein GI257791713 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAACA AGCAGGTGAA ATTCCTCAAG CAACTGCTCG AGACCCCGTC GGCCACCGGC 
ACCGAGATCG CCGTGGCCGA ACTGGTGCGC GAGCGCCTGG CCGGCACGGC CGACGAGATC
CAGACCGACG TCATGGGCTC GGTCCATGCC CGCCTGTCGG GCACGGGCGT GGCCCCGTCG
CTCATGCTGT CGGCCCACAT GGACGAGATC GGCCTCATGG TCACGTACAT CTCCGACGAG
GGCTTCCTGT CCGTGTCGTC CGTCGGCGGC GTGGACGCGG CCATCCTCCC GGGCATGCGC
GTGGACGTGC ACGCGTCGAA CTCCTTCGAG CCGCTGCGCG GCGTGGTGGG CCGCAAGCCC
ATCCACCTCA TCGAGCCCGA CGAGCGCAAG AACGTCACGC CCATCGACAA GCTGGTCATC
GACCTCGGCA TGCCGGCGAA GCGTGTGAAG AAGCTCGTCA TGGTGGGCGA CGTCATCACC
TTCGGCGTGG GCTTCGAGCG CTTCGGCAAG AACATGGCTG TCTCGCGCGC CTTCGACGAC
AAGGCGGGCG TGTGGGTCGC TGTGCGCGTG CTGGAGACGC TCGCTAAGGA GGGCCGCGCG
CCCGGCGACT TCATCGTGGC CGCCACCGTT CAGGAGGAGA TCGGCACGCG CGGCGCCATT
ACGTCCGCGT ACGGCCTGGA CCCCGATGTG GCCATCGCGT TCGACGTGAC GCACGCCACC
GACTATCCCG GCATCGACAA GACGAAGTAC GGAAAGATCG TCTGCGGCGA GGGTCCCGTT
ATCGCGCGCG GCCCCAACAT CAACCCGGCT GTGTTCGAGC GCCTCGTGGC GGCGGCCGAG
GCCGAGGGTC TGCCGTACCA GATCGAGGCC GAGCCCGGCG TCACCGGCAC CGACGCCCGT
TCCATCCAGA TCTCCCGCGG CGGCGTCCCC ACCGGCCTCG TGTCGGTGCC CCTGCGCTAC
ATGCATACGC CCACCGAGGT GGTCAGCCTC GACGACCTCG ACGCAACGGT GAAGCTGCTC
GCCCGTTTCG CGCGCGACCT GGACGAGGAC GCCTGCTTCG TGCCGGGCAT GGGCGACGCG
GTGACCGAGG GCGACGGCGC TGGCGCCGAG TCCGCTGCGC AGATGCAGTT CGACGAGACG
GGCGTGGAGT AG
 
Protein sequence
MKNKQVKFLK QLLETPSATG TEIAVAELVR ERLAGTADEI QTDVMGSVHA RLSGTGVAPS 
LMLSAHMDEI GLMVTYISDE GFLSVSSVGG VDAAILPGMR VDVHASNSFE PLRGVVGRKP
IHLIEPDERK NVTPIDKLVI DLGMPAKRVK KLVMVGDVIT FGVGFERFGK NMAVSRAFDD
KAGVWVAVRV LETLAKEGRA PGDFIVAATV QEEIGTRGAI TSAYGLDPDV AIAFDVTHAT
DYPGIDKTKY GKIVCGEGPV IARGPNINPA VFERLVAAAE AEGLPYQIEA EPGVTGTDAR
SIQISRGGVP TGLVSVPLRY MHTPTEVVSL DDLDATVKLL ARFARDLDED ACFVPGMGDA
VTEGDGAGAE SAAQMQFDET GVE