Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpin_3372 |
Symbol | |
ID | 8359538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chitinophaga pinensis DSM 2588 |
Kingdom | Bacteria |
Replicon accession | NC_013132 |
Strand | + |
Start bp | 4134654 |
End bp | 4135664 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 644965545 |
Product | metalloendopeptidase, glycoprotease family |
Protein accession | YP_003123040 |
Protein GI | 256422387 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.115677 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.185322 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGTAA AAATACTGGC CATAGAGTCA TCGTGCGACG AGACAAGTGC TTCCGTACTG GCCGATGGAA AGATCCTGTC TAATTTTATT GCCAATCAAA CTATTCATGA GCAATACGGT GGCGTAGTGC CTGAACTGGC CTCCCGTGCT CACCAGGAAA ATATTGTCCC TGTCGTGGAC CAGGCCCTGA AAGTGGCGGG TGTACGTAAG GAGGAACTGA ACGCCATTGC CTTTACCCAG GCGCCGGGTC TGATCGGTTC CCTGCTGGTA GGTAGTTGTT TTGCTAAATC CATGGCGCTG GCCCTGGACG TTCCGCTGAT AGCTGTACAC CACATGCAGG CGCACGTACT GGCTAATTTC ATTGGTGAGG ATAAGCCTTC TTTCCCTTTC CTCTGTCTGA CAGTGTCTGG TGGTCATACC CAGATCGTAC GTTGTGACAG TCCTTTGCAG ATGAAGGTAA TCGGTGAAAC ATTAGACGAT GCCGCTGGTG AAGCGTTTGA TAAAAGTGCC AAATTACTGG GCCTGCCATA TCCTGGTGGT CCGCTGATAG ATAAATACGC CCGTGAGGGC AATCCGGACA GGTTCAAATT CCCTGAACCA CAGATCCCGG GACTGAACTT CAGCTTTAGC GGTCTGAAGA CCTCCATTCT CTACTTCCTC CAGGAACAGC AACAGAAAGA TCCGCAGTTC GCCGAAAATA ACATGGCGGA TATCTGTGCT TCTATCCAGC ATCGTATCGT CAGTATCCTG ATGAACAAAC TGGTAAAAGC ATCAAAGGAA ACGGGTATCA AAGAGATCGG TATAGCAGGT GGCGTGAGCG CTAATTCCGG TTTACGTAAC GCTTTACAAC AGTATGGTGA AAAGTATGGC TGGAAAACCT ACATACCGAA ATTTGAATAC TGTACAGATA ATGCTGCGAT GATTGCCATG ACCGCCTGGT ATAAATATCA GGCAGGGGAG TTTGTAGGAC TGGATGCCGT TCCAGGCGCC AGAGCAGGTT TTGAACATTA G
|
Protein sequence | MSVKILAIES SCDETSASVL ADGKILSNFI ANQTIHEQYG GVVPELASRA HQENIVPVVD QALKVAGVRK EELNAIAFTQ APGLIGSLLV GSCFAKSMAL ALDVPLIAVH HMQAHVLANF IGEDKPSFPF LCLTVSGGHT QIVRCDSPLQ MKVIGETLDD AAGEAFDKSA KLLGLPYPGG PLIDKYAREG NPDRFKFPEP QIPGLNFSFS GLKTSILYFL QEQQQKDPQF AENNMADICA SIQHRIVSIL MNKLVKASKE TGIKEIGIAG GVSANSGLRN ALQQYGEKYG WKTYIPKFEY CTDNAAMIAM TAWYKYQAGE FVGLDAVPGA RAGFEH
|
| |