Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2476 |
Symbol | |
ID | 8416800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2898758 |
End bp | 2900218 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645025458 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_003182821 |
Protein GI | 257792215 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGATA CGCATGGCAG CACCCCCGGC AACGGCCCCG TGCCGCCGAC GCAGGGCGAC GAGCATGCCC AGCATGCCGC CCCCATGGGG CAGCAGCCTT ATTATCAGCA GCAACAGCAG TACTATCAGC AGCCTCAGCA CCCGCAGCAT CCGGCAGGAG GAGTGACCGT CGAGAAGAAG TCGCACGGCG GACGCACGTT CCTGCTCGCC TTCTGCGGTG CCGCCGTCGC CTGCGCCATC GGTTTGGGCG GCTTCGGCAT CTGGCAAGCC ACTGCGGGCG GTAACGATTC CGGATCCTCT TCGTCTGCGA CGCAGCTGGG TTCGCAAAAC TCCGGCAGCA TCAACGCTAC CGACGCCGAG TCCGACCAGA CGCTGGCAGA GGCCGTCGCG CAGAAGGCGC TTCCCTCCGT GGCCGCCATC GATGTGTACA CGAACCAGTC CAATGCGGGC GGCATGTACG GTTTTGGGGC CGGCAACGGA TCTGAAGCCG GCACGCTGAC GAAGTCCTCG CTGGGAAGCG GCGTCGTGCT CACCGCCGAC GGCTACATCA TCACGAACAA CCACGTCGTA GAAGGCGGCA GCGCGTACAA GGTCACCATC GCGGGCGAGA CCTACGACGC CGAGGTCGTG GGCAGCGATC CCAGCTCCGA CGTCGCGGTC ATCAAGGCCA AGGATGCCAG CGGCCTTACG CCCATCGAGA TCGGCGACTC CGACAAGCTC GTCATCGGCG AGTGGGTCAT GACCATCGGC AGCCCGTTCG GCCTCGAACA GTCCGTTGCC ACCGGTATCG TGTCGGCCAC GAGCCGTTCT CAGATTGTGA ACGCCTCCAC CGACCAGTAC GGCAACAGCA CGGGCGAATC CACCATCTAC CCGAATATGA TCCAGACTGA CGCCGCTATC AACCCCGGTA ACTCCGGCGG CGCGCTCGTC GACGCGGACG GCAAGCTCAT CGGCATCAAC ACGCTGATCA CGTCGTACTC CGGCAACTAC TCTGGCGTCG GCTTCGCCAT CCCGGTGAAC TACGCGGTGA ACCTCGCCCA GCAGATCATC GACGGCAAGA CCCCGACCCA TGCGCAGCTC GGCGTGTCCC TCTCCACCGT GAACGCGCAG AACGCCAAGC GCTACGGCCT GTCCGTTGAC GAAGGCGCCT ACGTGGCGGC CGTCAGCGAA GGCTCCGGCG CGGCCGAAGC CGGCTTGCAG GAGGGCGACA TCGTCACGAA GTTCGACGGC AAGGACGTCG CATCCGCCAG CGACCTCATG CTGGACGTGC GCTCCAAGAA CCCGGGCGAC AAGGTGACGC TCGACGTGAA CCGCAACGGC GAGACCAAGC AAGTCGAGGT CACGCTCGGC TCCGATGAAA GCTCCCAGAG CGCGTCGACC CAGCAGAACA GCGCGCAGGA GTCTATGCTC GAGCGCCTGT TCGGCGGCGG CTCCGGCAGC TCCCAGCAGG ACGCTGCCTA G
|
Protein sequence | MTDTHGSTPG NGPVPPTQGD EHAQHAAPMG QQPYYQQQQQ YYQQPQHPQH PAGGVTVEKK SHGGRTFLLA FCGAAVACAI GLGGFGIWQA TAGGNDSGSS SSATQLGSQN SGSINATDAE SDQTLAEAVA QKALPSVAAI DVYTNQSNAG GMYGFGAGNG SEAGTLTKSS LGSGVVLTAD GYIITNNHVV EGGSAYKVTI AGETYDAEVV GSDPSSDVAV IKAKDASGLT PIEIGDSDKL VIGEWVMTIG SPFGLEQSVA TGIVSATSRS QIVNASTDQY GNSTGESTIY PNMIQTDAAI NPGNSGGALV DADGKLIGIN TLITSYSGNY SGVGFAIPVN YAVNLAQQII DGKTPTHAQL GVSLSTVNAQ NAKRYGLSVD EGAYVAAVSE GSGAAEAGLQ EGDIVTKFDG KDVASASDLM LDVRSKNPGD KVTLDVNRNG ETKQVEVTLG SDESSQSAST QQNSAQESML ERLFGGGSGS SQQDAA
|
| |