Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Emin_0653 |
Symbol | |
ID | 6263767 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Elusimicrobium minutum Pei191 |
Kingdom | Bacteria |
Replicon accession | NC_010644 |
Strand | + |
Start bp | 719638 |
End bp | 720720 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 642611124 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001875545 |
Protein GI | 187251063 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAT TTTTATTTTT AATTTCTTTA TTCGCGCTGC CTAATATACT ATTTGCTTTA GATGAGGAAT ATGCAAGCTT GGGCAATAAT CCTCCCGTGC CGGCCAGGGC GGTTGCTAAC CCGCAGGCGG GCGGGCCCCA TAAAAGAGGC TCGCTGGGGC CGACTTCAAC TGTTAATAAA AGAAACCCTT TTGATTTAAA CAGCGATATT TTGGAAAACT ATAAAAATTG GGGAAAAACC CGCATGAATT TGGAAGCCGC CCATAAATTG GGTATAACCG GTGCGGGCGT TACCGTTATG GTTATAGATA GCGGCGTATC TCCTCACAAG GAATTTAAAA CCGGGGCCAT AAGCACTTTG GATTTTACCT CAAGCGGTCC TTATGATACT TTTGGACATT CAACAGGGGT TATAGGCATA ATAATAGCCA AAGGCGAAGA TATGCTCGGC GTAGCGCCTG ACGCTAAAAT TTACTCGGCC AAAGCAAACC CGGGGCAGGG GTTGATATCT TCCGGACCTG TCGTAAATGC TATTAACTGG GCTGTTGAGC ATAATAAAAC ATCGCAAGAT AAAATAAGCG TTATAAATTT AAGTTACGGC GTAAGCGGCT GGCAGCAAGA CCTTGCCGAC GCCATAAAAA ACGCTTACAA AGCCGGCATA ATTATCGTTG CGCCAAGCGG CAATGAAGGT TTTCATAAAG TTCTTTTTCC GGCTAGTATG GATGAGGTTA TAGCCGTTTC CGGCATAACC GCGCATGACG GCGCTTACGG CAAAAGTTCT TACGGCGCGC AGGTTGATTT TACCGCGCCG GCTTCCGCCG TTTACACAAC AGGTTTAAAC AATTCTTATA TTTGGGCGGA CGGAACCTCT GTCGCCGCGC CTTATGTGGC GGGTATGGCC GCTTTGGCTA TCGAAGGATA CAGGCTTGCT AACGAAGGTA AGGATCCTTC GCCCGCGCAG GTAAAGGAAA TTTTAGCCGC GGCCTCGTCG CTTGCCAGCG GGCCGCATAA ACTTAAACAA GGTTACGGTG TTATAGATGC GGGTAAAGTG GCTGCGAGGT TTGTTCCCGC AGGTAAAAAA TAA
|
Protein sequence | MKKFLFLISL FALPNILFAL DEEYASLGNN PPVPARAVAN PQAGGPHKRG SLGPTSTVNK RNPFDLNSDI LENYKNWGKT RMNLEAAHKL GITGAGVTVM VIDSGVSPHK EFKTGAISTL DFTSSGPYDT FGHSTGVIGI IIAKGEDMLG VAPDAKIYSA KANPGQGLIS SGPVVNAINW AVEHNKTSQD KISVINLSYG VSGWQQDLAD AIKNAYKAGI IIVAPSGNEG FHKVLFPASM DEVIAVSGIT AHDGAYGKSS YGAQVDFTAP ASAVYTTGLN NSYIWADGTS VAAPYVAGMA ALAIEGYRLA NEGKDPSPAQ VKEILAAASS LASGPHKLKQ GYGVIDAGKV AARFVPAGKK
|
| |