Gene Elen_2476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2476 
Symbol 
ID8416800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2898758 
End bp2900218 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content65% 
IMG OID645025458 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_003182821 
Protein GI257792215 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATA CGCATGGCAG CACCCCCGGC AACGGCCCCG TGCCGCCGAC GCAGGGCGAC 
GAGCATGCCC AGCATGCCGC CCCCATGGGG CAGCAGCCTT ATTATCAGCA GCAACAGCAG
TACTATCAGC AGCCTCAGCA CCCGCAGCAT CCGGCAGGAG GAGTGACCGT CGAGAAGAAG
TCGCACGGCG GACGCACGTT CCTGCTCGCC TTCTGCGGTG CCGCCGTCGC CTGCGCCATC
GGTTTGGGCG GCTTCGGCAT CTGGCAAGCC ACTGCGGGCG GTAACGATTC CGGATCCTCT
TCGTCTGCGA CGCAGCTGGG TTCGCAAAAC TCCGGCAGCA TCAACGCTAC CGACGCCGAG
TCCGACCAGA CGCTGGCAGA GGCCGTCGCG CAGAAGGCGC TTCCCTCCGT GGCCGCCATC
GATGTGTACA CGAACCAGTC CAATGCGGGC GGCATGTACG GTTTTGGGGC CGGCAACGGA
TCTGAAGCCG GCACGCTGAC GAAGTCCTCG CTGGGAAGCG GCGTCGTGCT CACCGCCGAC
GGCTACATCA TCACGAACAA CCACGTCGTA GAAGGCGGCA GCGCGTACAA GGTCACCATC
GCGGGCGAGA CCTACGACGC CGAGGTCGTG GGCAGCGATC CCAGCTCCGA CGTCGCGGTC
ATCAAGGCCA AGGATGCCAG CGGCCTTACG CCCATCGAGA TCGGCGACTC CGACAAGCTC
GTCATCGGCG AGTGGGTCAT GACCATCGGC AGCCCGTTCG GCCTCGAACA GTCCGTTGCC
ACCGGTATCG TGTCGGCCAC GAGCCGTTCT CAGATTGTGA ACGCCTCCAC CGACCAGTAC
GGCAACAGCA CGGGCGAATC CACCATCTAC CCGAATATGA TCCAGACTGA CGCCGCTATC
AACCCCGGTA ACTCCGGCGG CGCGCTCGTC GACGCGGACG GCAAGCTCAT CGGCATCAAC
ACGCTGATCA CGTCGTACTC CGGCAACTAC TCTGGCGTCG GCTTCGCCAT CCCGGTGAAC
TACGCGGTGA ACCTCGCCCA GCAGATCATC GACGGCAAGA CCCCGACCCA TGCGCAGCTC
GGCGTGTCCC TCTCCACCGT GAACGCGCAG AACGCCAAGC GCTACGGCCT GTCCGTTGAC
GAAGGCGCCT ACGTGGCGGC CGTCAGCGAA GGCTCCGGCG CGGCCGAAGC CGGCTTGCAG
GAGGGCGACA TCGTCACGAA GTTCGACGGC AAGGACGTCG CATCCGCCAG CGACCTCATG
CTGGACGTGC GCTCCAAGAA CCCGGGCGAC AAGGTGACGC TCGACGTGAA CCGCAACGGC
GAGACCAAGC AAGTCGAGGT CACGCTCGGC TCCGATGAAA GCTCCCAGAG CGCGTCGACC
CAGCAGAACA GCGCGCAGGA GTCTATGCTC GAGCGCCTGT TCGGCGGCGG CTCCGGCAGC
TCCCAGCAGG ACGCTGCCTA G
 
Protein sequence
MTDTHGSTPG NGPVPPTQGD EHAQHAAPMG QQPYYQQQQQ YYQQPQHPQH PAGGVTVEKK 
SHGGRTFLLA FCGAAVACAI GLGGFGIWQA TAGGNDSGSS SSATQLGSQN SGSINATDAE
SDQTLAEAVA QKALPSVAAI DVYTNQSNAG GMYGFGAGNG SEAGTLTKSS LGSGVVLTAD
GYIITNNHVV EGGSAYKVTI AGETYDAEVV GSDPSSDVAV IKAKDASGLT PIEIGDSDKL
VIGEWVMTIG SPFGLEQSVA TGIVSATSRS QIVNASTDQY GNSTGESTIY PNMIQTDAAI
NPGNSGGALV DADGKLIGIN TLITSYSGNY SGVGFAIPVN YAVNLAQQII DGKTPTHAQL
GVSLSTVNAQ NAKRYGLSVD EGAYVAAVSE GSGAAEAGLQ EGDIVTKFDG KDVASASDLM
LDVRSKNPGD KVTLDVNRNG ETKQVEVTLG SDESSQSAST QQNSAQESML ERLFGGGSGS
SQQDAA