Gene Elen_2232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2232 
Symbol 
ID8416555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2621109 
End bp2622365 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content70% 
IMG OID645025218 
Productpeptidase U32 
Protein accessionYP_003182582 
Protein GI257791976 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.138304 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.83326 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAACTC AATCGACAAG GACTCCCGAG CTCCTCGCGC CCGCGGGCGG GCTTGCGCAG 
CTGGAGGCCG CGCTGCGCTT CGGTGCGGAC GCCGTGTACC TGGCCGCCGA TCGTTTCGGG
CTGCGGCAGC GCGCGGCGAA CTTCGCGCTG TACGACGTTC CCGCTGCGGC AGCTCGCGCG
CACGATGCGG GCGCGAAGGC GTACGCGACG CTGAACGCCC TCATGGACGC CGACGACCTC
AAGGCGCTTC CCGCGTACCT CGAAGCGCTG GCCGCGGCCG GCGTCGACGC GTTCATCGTG
AGCGACCTGG GCGCGCTGCG CCTGGCACAG CGGCACGCGC CGAACGTCGA GCTGCACGTG
AGCACCCAGG CCTCGGTATG CAACGCCGAG GCGGCGCGCG TATGGCACGA GCTGGGCGCG
AGCCGCGTGG TGTGCGCGCG AGAGATGAGC GTGGAGGACA TCGCGCGACT GCGCGCCGGC
GCCCCGCGCG AGCTGGAGCT GGAGGCGTTC GTGCACGGCG CCATGTGCAT GGCCGTGTCG
GGCCGCTGCC TGATCAGCGC CGCGCTCACC GGCCGCTCCG GCAACAAGGG CCATTGCACC
CAGCCGTGCC GGTGGAGCTA CGCGCTGGTG GAGGAGCAGC GTCCCGGCGA GTTCTTTCCC
GTGGAGGAGG ACGTGCGCGG AACCTATGTC ATGAACGCGC AGGACCTCAA CATGCTGGCG
CACCTCGACG ACTTGGCCGC GGCCGGCATC GACTCGTTCA AGATCGAAGG CCGCAACAAG
AAGGCGTTCT ACGTGGCTTC GGTGGTGCGC GCTTACCGGC TGGCCCTGGA CGGCGTTCCC
TCCTCCGAGC TGGCCGACGA GCTGCTGGCC GTGTCGCATC GCCCGTACGG CACGGGCTTC
TACTACGGCG ACGCCAGGCA ATCGCCCGAC GTGGACGGCT ACACCGCCGA ATGCCGGCAT
GCCGCCACGG TGGAAGCGTG CGAACCGGCC GGCGAAGGCG CGTTCCGCGT GATCGCGCGG
TGCTACAACC GCTTCTGCGA AGGCGACGAG CTGGAGGCGC TGTCGCCGGG TCCGCACGTC
CCTCGCGTGC GCGTGCGTAA CCTCGCCTGG CTCCCCGAGC CCGACGGGGA CGACGCGCAG
CCAAAGCGGG TGCCGGTTGC CGTGGCGAAC CGCTCGGCCG AGCGCTATGC GTTCGAAACG
GGGGAGGAGC TGGCTCCCGG CGACTTTCTG CGCATGCGTA TCAACGTTGA GCGATAG
 
Protein sequence
MRTQSTRTPE LLAPAGGLAQ LEAALRFGAD AVYLAADRFG LRQRAANFAL YDVPAAAARA 
HDAGAKAYAT LNALMDADDL KALPAYLEAL AAAGVDAFIV SDLGALRLAQ RHAPNVELHV
STQASVCNAE AARVWHELGA SRVVCAREMS VEDIARLRAG APRELELEAF VHGAMCMAVS
GRCLISAALT GRSGNKGHCT QPCRWSYALV EEQRPGEFFP VEEDVRGTYV MNAQDLNMLA
HLDDLAAAGI DSFKIEGRNK KAFYVASVVR AYRLALDGVP SSELADELLA VSHRPYGTGF
YYGDARQSPD VDGYTAECRH AATVEACEPA GEGAFRVIAR CYNRFCEGDE LEALSPGPHV
PRVRVRNLAW LPEPDGDDAQ PKRVPVAVAN RSAERYAFET GEELAPGDFL RMRINVER