Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_0352 |
Symbol | |
ID | 5742198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 443491 |
End bp | 444552 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641291442 |
Product | glycoprotease family metalloendopeptidase |
Protein accession | YP_001557478 |
Protein GI | 160878510 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.239729 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGGGTT ATATGGAAGA TTATCGAAAA GATGTAGTGG AAAAAGAAGA TGTATATATT TTGGCAATAG AGTCATCCTG TGATGAGACT GCTGCTTCTG TAGTGAAAAA CGGTAGAGAA GTTTTATCCA ATGTTATATC AACTCAAATA GATTTACATA CCTTGTATGG TGGTGTAGTT CCAGAGATAG CGTCAAGAAA GCATATGGAA AAAATAAATC AAGTAATTGA GGAAGCATTA AGCCAAGCAC AGATGGGATT AACAGATATG AATGCAGTGG CGGTGACTTA TGGACCTGGT CTAGTTGGCG CTTTATTGGT AGGCGTTGCG GAAGCAAAAG CAATTAGTTA CGCTGCAAAA CTACCTTTAA TCGGGGTACA CCATATAGAA GGACATGTTA GCGCAAACTA TATTGAGAAT AAAGAACTAG AACCTCCATT TTTATGCTTA ATTGTATCAG GAGGACATAC ACATCTTGTG CTTGTTAAGG AGTATGGAAC TTACGAAATT ATCGGAAGAA CTCACGATGA TGCGGCTGGT GAGGCATATG ATAAGGTAGC CAGAGCAATT GGGCTTGGTT ATCCAGGAGG TCCTAAGATT GATAAACTTG CAAAGGAAGG CAAAAAGAAC GCAATTGTTT TTCCAAGAGC AAGTATAGAA GGCTGTCCAT TTGATTTTAG CTTTAGTGGT GTTAAATCTG CTGTATTAAA TTATATCAAT TCTAGTACAA TGAAAGGGGA AGCAATTAAC CGTGCGGATA TAGCAGCTTC CTTTCAGGAG GCAGTAGTTG ATGTTTTAAT CTCCCATACA ATGGCAGCAG CGAAAGAGTA TGGTATGAAG AAGATTGCAA TAGCAGGGGG TGTTGCATCC AATAGTGCAC TGCGATCTGC TATGGAGGAA GCTTGTAAAA AGAATCAATT CGAGTTTTAT CATCCAACGC CTCTGTTTTG TACTGATAAT GCAGCAATGA TAGGTGCAGC GGCTTATTAT GAATATCTAA AAGGAAATTT TAGCGGATTA GATTTAAATG CAGTACCTAA CTTAAAGATA GGTCAACGTT AG
|
Protein sequence | MEGYMEDYRK DVVEKEDVYI LAIESSCDET AASVVKNGRE VLSNVISTQI DLHTLYGGVV PEIASRKHME KINQVIEEAL SQAQMGLTDM NAVAVTYGPG LVGALLVGVA EAKAISYAAK LPLIGVHHIE GHVSANYIEN KELEPPFLCL IVSGGHTHLV LVKEYGTYEI IGRTHDDAAG EAYDKVARAI GLGYPGGPKI DKLAKEGKKN AIVFPRASIE GCPFDFSFSG VKSAVLNYIN SSTMKGEAIN RADIAASFQE AVVDVLISHT MAAAKEYGMK KIAIAGGVAS NSALRSAMEE ACKKNQFEFY HPTPLFCTDN AAMIGAAAYY EYLKGNFSGL DLNAVPNLKI GQR
|
| |