Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccur_10820 |
Symbol | |
ID | 8375289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cryptobacterium curtum DSM 15641 |
Kingdom | Bacteria |
Replicon accession | NC_013170 |
Strand | - |
Start bp | 1228557 |
End bp | 1229972 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 644994004 |
Product | collagenase-like protease |
Protein accession | YP_003151455 |
Protein GI | 256827496 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 152 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTCGA CCGCAAATAA TTTAGCTATC TCGTCAGCGG TTGATAAACG TGAATGTGAG AAGTGCGAAC TACTTGCTCC AGCTGGTGGG TGGGAACAGC TGCATTACGC GATTCGGTTT GGCGCCGATG CAGTGTATTT GGCAGCTGAC CGGTTTGGCA TGCGGGCACG GGCGAATAAC TTTTCTCTCG ATGAAATACC GCAGGTTGTT GAGTGCGCCC ACCAGGCGAA TGTTGCGGTT CACGTGACCC TCAATGTGCA GATGTACGAC GGTGACTTGG CGGAACTGGC GCACTATGTT CACTCGCTTG CGTCAGCACA GGTTGATGCG GTTATTGTGG GCGATTTGGG CGCACTGAGG ATAGTACGTC AACAGGCGCC GGATCTCGCA GTGCATGTGA GTACGCAGGC ATCAGTTTCG AATGTTCCCT CGGCGTTAAC CTGGTATGAA CTGGGAGCAC GCCGCATTGT CTGTGCGCGC GAAATGAGCC TTGATGCCAT TGCGCGGCTC CGGACAGAAC TTCCCAGCGA TTTAGAAATA GAAGCCTTTG CGCATGGTGC ACAGTGTATG GCAACATCGG GTAGGTGTTT GATCAGCGAT TATATGACAG GGCGCAGTGG TGTAACAGGT AATTGTGCGC AGCCCTGTCG TTGGAAATAC TCGCTGCAAG AAGAAAAGCG ACCCGGCAAA TTCTTCAGCG TGGAAGAGGA CGACCGGGGT AGCTATCTAT TAAATGCGCA GGATTTGAAT ATGCTGGCTC ACGTGGATGA TATGCGCCAG GCGGGTATTA ATTCCATCAA GATTGAAGGG CGCAACAAAA AGGCGTTCTA TGTGGCGTGC GTCGTTAATG CGTATCGACA GGTACTTGAT GGAGCTGATC CATCTGACTG GGAAGGTGAA CTGGAAACGG TTTCACATCG GCCCTATGGT ACAGGATTCT ATTTTGGTCC AGCACACCAA ACACCTGAAA CAGATGACTA TGTTCGTCCG TATGACTGGG TATTTGAAGT GCTCACCTGT AAGGCAGAAC CAGATGGATC GTGGTGTGCG TGGGGCTTAG CGCGCAATCG CTTTACTCAT AATGCGCAGC TTGAAGTACT CTCTCCCGGT CAACCGGTGC GGACATTTCA TGCCGAGGAC ATCCACTGGG TGCCGCGTCT TGGGTGTTCT GAAGTTGATG CTGCGCGTGC CGCTGGCTTG TCTGACCCAC TGACCAATCA CCTTGATGCG AACGTAGCTG CTTCGGCACA TACTTTTGCC GAGCGCTTGA TTGCTTCAGG GTTACTTGAT TTGGCCCGCC CTGCGCGAGC GCAGGTTGAA GAAGCCAACC GCATCATGGA TGTCTATACC ATGCGAGTGC CGTTTCCGCT TTGTGCTCAC GACATGGTGC GCGCGCCGCG CAGTGAAACT ATGTAA
|
Protein sequence | MSSTANNLAI SSAVDKRECE KCELLAPAGG WEQLHYAIRF GADAVYLAAD RFGMRARANN FSLDEIPQVV ECAHQANVAV HVTLNVQMYD GDLAELAHYV HSLASAQVDA VIVGDLGALR IVRQQAPDLA VHVSTQASVS NVPSALTWYE LGARRIVCAR EMSLDAIARL RTELPSDLEI EAFAHGAQCM ATSGRCLISD YMTGRSGVTG NCAQPCRWKY SLQEEKRPGK FFSVEEDDRG SYLLNAQDLN MLAHVDDMRQ AGINSIKIEG RNKKAFYVAC VVNAYRQVLD GADPSDWEGE LETVSHRPYG TGFYFGPAHQ TPETDDYVRP YDWVFEVLTC KAEPDGSWCA WGLARNRFTH NAQLEVLSPG QPVRTFHAED IHWVPRLGCS EVDAARAAGL SDPLTNHLDA NVAASAHTFA ERLIASGLLD LARPARAQVE EANRIMDVYT MRVPFPLCAH DMVRAPRSET M
|
| |