Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_0616 |
Symbol | |
ID | 8331945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 718584 |
End bp | 720947 |
Gene Length | 2364 bp |
Protein Length | 787 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644953768 |
Product | Microbial collagenase |
Protein accession | YP_003111393 |
Protein GI | 256389829 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00460764 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.253983 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCACC GAACACGCCT GCCCGGAATC GCCGCCGCGG TGCTCGCCGC GGTGGCGGTC GCCGGGATGA CCGGGGTGGG AGCGGCCTCC GCCGCCCCGC GTCCGGCCGC CGCCTCGGTC GCCCCGACCG TGAGCCACGC GAACACGAAC ACGAACACGA ACAAGAACGC CACGCCGAAG GGTTCCGCCG ACCCGGCCAT CGCCGGGAAG TCCGTGGTCG CGAAGACCGG AAAGCCGGTC GGCGACCTCG CGCCGCGCGC ACCGATCGGC GGCAGCGCCG ACGGAACCAA GACCGGCGTT GCGCAGACCG GCGCCACGCA AACCAGCGCC GCAAAGCAGT CCCCGGCCGC CAAGACCCCC GCCAAAACCC CCGCAGCCCA GTCCTGCACC CCCGCGGACT TCGCCTCCCG CTCCGGCAGC ACCCTCAGCG CCTTCGTCAA GGCATCGACC ACCGACTGCA TCAATACCCT GTTCTCGGTC ACCGGCTCCA ACGCTTCGGC GATCTTCAGC GAAGCCAAGA TGGAGACCAT CGCAGGGTCC TACAAGACCG CCGCCGCCAG CTACCCGGGC ACCGACGCGA CGAGCGTCCA GCAGCTCGTG CTGTTCCTGC GCGCCGGGTA CTACGTGCAG TACAACAGCA ACGGCTCGAT CCCGGCCTAC GGCCAGGCGC TCGCCACGCT GGTCGAGGGC GGCCTCGACG CCTTCTTCGC CGGCTCGCAC GCCCTGGACG TCAACGACAA CAACGGCCCG GTCCTCGGTG AGACGATCAT CCTCACCGAC AGCGCCGACG AACAGGCCCG CTACCTCAGC ACCTACCAGC GGATCCTGAA CGCCTACACC AGTTCCTACA ACGCCTACTC GACGATGCTG AACGCGGTGA ACGACGTCTA CACGCCGCTG TTCCGAGGCC ACCAGTTCCC GGCGTTCGTC ACGGCCGTCA CCGCGAACCC GAGCATCATC GACACCCTGA ACAGCTTCGC GCTGAACCAC AAGAACCTGC TGGGAGGGGA CAACTCCTAC CTGGACTCCA ACGCCGGGCT GGAGATGAGC CGGTTCGTTC AGCACACGGC GCTGCAGGCC AAGGTGCGTC CGCTGATGAA GGGTCTGCTC GCGGCCTCGT CGATGACCGG ACCGACGGCG CCGCTGTGGG TCGCGGTCGC CGGCAACGCC GACTACTACG ACCAGGCGAA CTGCGCCTAC TACAACGTGT GCGATCTGGC CGCCAAGCTC ACCGCCGCCG TGCTCACCAC GACGACGCAC TGCGACGCCA GCCACACCGT GCTGTCCCAG GCGCTGACCG CCTCCGACAC CAGCGCCGTG TGCGCGAGCA TCCTGGGCCA GTACTCGTAC TTCCACACCG TGGTGCACGA CAGCGGCCCG ATTCCCGGGC AGTACGACCA GAACTTCGTG CTGACCGTGT TCGCCTCGCC CACGGACTAC CAGACCTACG CCGGACCGAT CTACGGCGTG GACACGGACA ACGGCGGCAT CACCCTGACC GGGGATCCGA CCGATCCGTC CAACATCGTC CGCTCGATCA TGTACCAGTG GGACACCGAC AACGGCTTCG TGGCGCGCGT GTGGAACCTG AACCACGAGT TCACCCACGC GCTGGACGCC GAGTACGACA CCAAGGGCGA CTTCACCGCC GAGATCGTGG TCCCGGACAT CTGGTGGATC GAGGGCGTCG CGGAGTACGT CTCGTACAGC TACCGCGATG TCACCGACAC CGAGGCGGTG AGCGAGGCGG CGACCCACCG GTACGCGCTG AGCACCCTGT GGCAGAGCTC GTATGACAAC AGCGACGAGA CACGCACCTA CCCCTGGGGC TACCTCGCCG TCCGCTACAT GATGGAGCGG CACCCCGCCG ACATCGCCAC ACTCCTGGCG AAGTTCCGCG TCGGCGATTA CCAGGGCGCG TACGCCTTCT ACGGCACGAC CATCGGCACG GCGTACGACG CCGACTTCAA TTCCTGGCTC GACCAGTGCG CGGCCGGCGC CTGCCAGGCC GGCGGCGGAA CCACGCCGCC GCCCCAGAAC TGCTCCGACC CCGACACCCG GGCGATGGAC CAGAACTGCT CGCGCACCGG CGAGTCGGCG GCGGCCGGCG CGATCGACTA CTTCTACATC GACATTCCCG CCGGGACGTC GTCGCTGACC ATCACCACCA CCGGTGGCAG CGGCACGGCG TACCTGCTGT ACAACCCCTC GACGTGGGCG ACCCCCACCG CGTACACGCA GGGCTCGTTG AACAACGGCA CGACACAGAG CCTGACGATC ACCGATCCGC CATCCGGCTA CCGGTACATC AGCCTGTACG GGCAGACCGC CTTCAGCGGG GTGACCATCA CCACGTCCTA CTGA
|
Protein sequence | MRHRTRLPGI AAAVLAAVAV AGMTGVGAAS AAPRPAAASV APTVSHANTN TNTNKNATPK GSADPAIAGK SVVAKTGKPV GDLAPRAPIG GSADGTKTGV AQTGATQTSA AKQSPAAKTP AKTPAAQSCT PADFASRSGS TLSAFVKAST TDCINTLFSV TGSNASAIFS EAKMETIAGS YKTAAASYPG TDATSVQQLV LFLRAGYYVQ YNSNGSIPAY GQALATLVEG GLDAFFAGSH ALDVNDNNGP VLGETIILTD SADEQARYLS TYQRILNAYT SSYNAYSTML NAVNDVYTPL FRGHQFPAFV TAVTANPSII DTLNSFALNH KNLLGGDNSY LDSNAGLEMS RFVQHTALQA KVRPLMKGLL AASSMTGPTA PLWVAVAGNA DYYDQANCAY YNVCDLAAKL TAAVLTTTTH CDASHTVLSQ ALTASDTSAV CASILGQYSY FHTVVHDSGP IPGQYDQNFV LTVFASPTDY QTYAGPIYGV DTDNGGITLT GDPTDPSNIV RSIMYQWDTD NGFVARVWNL NHEFTHALDA EYDTKGDFTA EIVVPDIWWI EGVAEYVSYS YRDVTDTEAV SEAATHRYAL STLWQSSYDN SDETRTYPWG YLAVRYMMER HPADIATLLA KFRVGDYQGA YAFYGTTIGT AYDADFNSWL DQCAAGACQA GGGTTPPPQN CSDPDTRAMD QNCSRTGESA AAGAIDYFYI DIPAGTSSLT ITTTGGSGTA YLLYNPSTWA TPTAYTQGSL NNGTTQSLTI TDPPSGYRYI SLYGQTAFSG VTITTSY
|
| |