Gene Elen_0587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0587 
Symbol 
ID8414873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp743389 
End bp745911 
Gene Length2523 bp 
Protein Length840 aa 
Translation table11 
GC content68% 
IMG OID645023560 
Productpeptidase U32 
Protein accessionYP_003180961 
Protein GI257790355 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.684095 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCATG CCATCGAACT GCTCGCGCCC GCCGGCAACG CCGCGGCGCT TCGTGCCGCC 
GTGCGCGGCG GGGCCGACGC CGTGTATTTG GGCCTCGACT CGTTCAACGC CCGCCGCGGA
GCCGACAACT TCACGCTTGA GACACTGGCC GATGCCTGCG CCTACGCGCA TCTGCGTGGG
GTTCGCGTGT ACGTGACGTT CAACACCGCC GTGCTGCCTT CCGAGGTGGC CCGTGCGCTC
GAAACGGTGC GCCAGGCGTA TCGCGCTGGG GCGGATGCCT TCATCGTGCA AGACATCGGC
ATCGCATCCG AGATCTCCCG CGCGCTCCCC GAAGCGCGAC TGCATATATC GACGCAGATG
AACACGCATA ACGCGGCCGG CATCGAGGCG GCGGCGAGGC TGGGCGCCCA GCGCGTCACG
CTGGCACGCG AGCTGTCCGT CCTCGAGATC GCCCATCTCG CCGAGGTGGC CGACGGCTAC
GGCATGGAGG TGGAGTCGTT CGCTCACGGC GCGCTGTGCG TGTGCTACTC CGGCCAGTGC
TTCATGTCGT CGCTCATCGG CGGCCGTTCG GCGAACCGCG GGCTGTGCGC CCAGGCGTGC
CGTCTGCCCT ACACGCTGCA CAACGTGGCG CTGCGCAAGA ACCTGCCCGC GCCGGGCGAG
CATCTGCTGT CCCCGCAGGA CCTGTGCGCC ATCGACTTGC TGCCCGAACT GGTGGAAGCG
GGCGTCACGT CGTTCAAGAT CGAGGGTCGC ATGAAGTCGC CCGAGTACGT GTTCGCCGTC
ACGCAGACGT ACCGCGCCGT ACTCGATCGC ACGCTGGCCG AGCGCGCTGC GGGAAGCGGC
AAGGACGTGC GGGCCGCCGA AGACGAGCAT CGCACCCTGG CCGAGGCGTT CTCCCGCGGG
TTCACCACGG CCTACCTGGA GAACCAGCGC GGCAACGACA TCATGAGCTA CGGGCGCCCG
AACAACCGCG GCGTGTTCGT GGGTCGCGTG ACGTCGGCGA AGAACGGCGT CGCCACCGTC
GCGGCCGAGC GGCCGCTCGC ACCGGGCGAC GTGTTGGAGT TCTGGACGAA CAAGGGTCAC
TTCGCCTACA CGCTCGACCA GGTGTCTCTC GACAAGCAAG GCAATGTTCG CATGGCGCCC
GACCGTCCGG TGGGGAAGGG CGACCGCGTG TTCCGCGTCC GCAGCGCCGA AGCGGCGTTT
GAAGACGACG TATTCGAGCC GCGCGTCGCG GTGACGGGTC GCGTGGTGCT GCGCATCGGC
CAGCCGCTGC GCGTAGAGTT CTCGCTGGCG CCGAGCGCTG CAGCGCGTCA TCCTGAGCGC
AGCGAGGCGC AGCCGAGCGA AGTCGAGGGA TCCCGTGCGG CGACAGCCCA TACCCCCCCG
ATCGGCGCAG CCGAAGGCGT TGCGATCGAG CCCGCGCGTA CGAAGCCGGT GACGGTCGAC
GACGTGCGTG CGCATGTCGA TCGCCTGGGT CAAACTCCGT TTCTGCTAGA ATCGCTCGAG
GTCGAACTCG ACGAGGGTGT GGGTATCGGC TTCTCGCAGC TGCACCGCGT GCGCGCCGCC
GCGCTCGACG ACCTTGCGCA GCAGCTGCTC GCGCCCACGC GCAACCGTGC GTTGCCGCGT
GTGCGGGAGC GCACGGCGCT TGCGCCCGCG CGCCCGCGCG GCGTGCGCAT CGCGGCGTGG
GCCACGAATC CTGCCTGCGC ACGCGCGGCG AAGCGGGCAG GCGCTGACAT CATCTACGTG
CCTGCGCTCA ACTACAAGCG CGGCGAGGCC GTGGTGGCGG GCCAACGTTC TGCCACTGCA
GAGCAAGCGG GCTATCCGAA GCAGGCCGTC GTGGCGCTGC CAACTGTTGA ACACGACCAG
GTTCCGGGCA CGCGCGAGGC GGCCATCGAC TTCGATCCTT GGCGCTACGT CAAGCCAGGC
AAGCCTGTGC TCGTGGAGAA TCTGGCCGGC CTCGTGCGCG CCGCCGAGCT GGGATGCGAG
GTCGAAGTAG GGCCGCACAT CCCCATCACG AATCCGCTAT CGCTCGCAGC GGCCGCCGAG
CTGGGAGCGC GCCGGGTGTG GCTGTCGCCC GAGCTCACGC TGGGCCAGAT CGCCGACATC
GCCGAAGATG CACCGGTTGA GCTAGGCCTT ACCATCATCG GCGCCCAAGA GCTCATGGTC
ACCGAGCACT GCCTGCTCAT GAGCCAAGGC CCCTGCGACG AGAACTGCGC CGAGTGTCCG
CGCCGCAAGA GCCCGCATTT TCTGCGCGAT CGCAAGGACT ACGAGTTCCC GGTCGTCACC
GACGCGCTCG GCCGCAGTCA CTTGTTCAAC GGTGTTCAGC TCGATGTGGC CCAGACCCTG
CCCGACCTCA TCCATGCGGG CGTCACGTCG TTCCTGGTGG ATACCACGCT CATGAACGTC
GAGGAGACCA CGAAGGCCGT GCAGCGCGCC GTCCGAGCGC GCAACGTGGC CCACGCCGAC
GGCAACGCCA TCGCCAAAAC CCCCGGCACC ACCAGCGGTC ATCTATTCAG GGGCGTTTCC
TAG
 
Protein sequence
MSHAIELLAP AGNAAALRAA VRGGADAVYL GLDSFNARRG ADNFTLETLA DACAYAHLRG 
VRVYVTFNTA VLPSEVARAL ETVRQAYRAG ADAFIVQDIG IASEISRALP EARLHISTQM
NTHNAAGIEA AARLGAQRVT LARELSVLEI AHLAEVADGY GMEVESFAHG ALCVCYSGQC
FMSSLIGGRS ANRGLCAQAC RLPYTLHNVA LRKNLPAPGE HLLSPQDLCA IDLLPELVEA
GVTSFKIEGR MKSPEYVFAV TQTYRAVLDR TLAERAAGSG KDVRAAEDEH RTLAEAFSRG
FTTAYLENQR GNDIMSYGRP NNRGVFVGRV TSAKNGVATV AAERPLAPGD VLEFWTNKGH
FAYTLDQVSL DKQGNVRMAP DRPVGKGDRV FRVRSAEAAF EDDVFEPRVA VTGRVVLRIG
QPLRVEFSLA PSAAARHPER SEAQPSEVEG SRAATAHTPP IGAAEGVAIE PARTKPVTVD
DVRAHVDRLG QTPFLLESLE VELDEGVGIG FSQLHRVRAA ALDDLAQQLL APTRNRALPR
VRERTALAPA RPRGVRIAAW ATNPACARAA KRAGADIIYV PALNYKRGEA VVAGQRSATA
EQAGYPKQAV VALPTVEHDQ VPGTREAAID FDPWRYVKPG KPVLVENLAG LVRAAELGCE
VEVGPHIPIT NPLSLAAAAE LGARRVWLSP ELTLGQIADI AEDAPVELGL TIIGAQELMV
TEHCLLMSQG PCDENCAECP RRKSPHFLRD RKDYEFPVVT DALGRSHLFN GVQLDVAQTL
PDLIHAGVTS FLVDTTLMNV EETTKAVQRA VRARNVAHAD GNAIAKTPGT TSGHLFRGVS