Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_7152 |
Symbol | |
ID | 8338520 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 8319230 |
End bp | 8321797 |
Gene Length | 2568 bp |
Protein Length | 855 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644960233 |
Product | Peptidase M9A collagenase domain protein |
Protein accession | YP_003117822 |
Protein GI | 256396258 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.766617 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.500432 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCATC GATCTGCCCT GCCCAGAATT CTGGCTCTGG GCCTCGCGGT CTGCACCGCG TCGTTCGGGC TGGCCGCCGG ACCGGCCGGT GCGGTGTCGG GTGCGGCGAG CGCCGCGCCG CCGTCCGTTC CCGCGCCGAG CGGTGCTGCC GTCGCGCCGG CGGGAGCCTC GGAGAGTCAC ACGGCTGCCA AGCCGTCGCC GGCCGGTCCG CCGGTCGCCA TCGGCGCCGC GACGTCGCTG GCGGTGACGA GCCCTGCGGG CCCGGCTGGA CCGGACGCCG CGTCCGCCGT GACCGCGGCA TCTTCGGCGG CGCCTGCGGG CACGACACCC AGCACACCCA GCACGCAGTC CTGCACCGCC GCTGACTTCG GCAGCCGCGG CGGCGCCAAG CTGGCTGCCT ACGTCAAGAC GTCGACCACC GACTGTCTCA ATACCCTGTA TGCGCTCACC GGCTCCGATG CTGCCGCGGT TTTCAAAGAG TCGCAGATGA CCGCCGTGGC GAACGCGTTC GTGAGCACGG CGCGGAACTA CCGGGGCGAC GACTCCAGCG GTATCTGGCA GCTCAGTCTG TTCCTGACCG CCGGGTACTA CGTGCAGTAC AACAACGCTG CCGCGGTCGG GTCGTACGGC TCTGCCTTGG CCGATCCCGT TCAGAACGGT CTGGACGCCT TCTTCTCCGC GAGGCACTCC TCGGACGTCA GCGCTGCCAA CGGAAACGTC CTGGGCAATG TCATTACCCT GAGCGACAGT GCCGACCTGC AGGCGCGCTA CATCAGCGTC TACAAGCGGG TGCTGAACGG CTACAAGAGT TCCTACGACG CCTTCCCAAG CATGGACGCG GCGGTGAACG CGGTGTTCAC GCCGATCTAT CGCGGTCACT TCTTCCCGGC GTATATAGCG GCGGTCACGG CGGACCCGAG CCTCATAGAC GCGCTGAACT CCTTTGCGCT CAACAACACC GCGCTGCTCC CCGGCGCGAA CTACGCGCTG GACACAAACG CCGCTGCCGA AGCCGTCCGC TTCCTCGACA CCCCGGCGCT GCAAGCCAAG GTGCGGCCGC TGGCCGCGCA CCTGCTCGCG ATCTCGCCGC TGCCCGGGCC GAACGGTCCG CTGTGGGTGC GGGTCGCGGT CGTCGTCGAC TACGTCGACG GCGCGGAGTG CGGGACGTTC GGCGTCTGCG ACTACACGGA CACGCTCAAG GCGGCGGTCC TCCCGACGAC CTACCCCTGC GGCACGACGC GCACCATCCT GGCGCAGGCG ATGACCGCGG CGGACCTGAA CGCGGCCTGC ACCAGTCTGC AAGGTGAGGA TGCGTTCTAC CACGGCCTGG TGAAGGACGG CGGCCCGATC GCCGGGCAGT ACGACACGAA CGTGCGCCTC GCGGTCTTCG CCACCAAGTG GGACTACACC GTGTACTCCA CGGCGCTGTT CGGCAACGAC ACCGACAACG GCGGCGAGAC CCTCAGCGGC GACATCACCG ACCCGGCGAA CCAGCCGATC TCGGTGATGT ATGTGAAGTT CCCCGGCGAC GGCTTCCCGG CGAGCGTGTG GAACCTGAAC CACGAATACA CGCACCTGCT GCAGGGCGAG TTCGACATGA AGGGCACCTT CGACCAGGAG ATCTCGGTCC CCGACATCTG GTGGGTCGAG GGTGAGGCGG AGTACGTCTC CTACGCCTAC CGCGGTCTGA ACAACACCCA GGCGATCGGC GAGGCGAGCC AGCACGCGTT CCCGCTGAGC ACGCTGTTCC AGACCACCTA CGACAACACC ACCACCGACC GCACCTACAC CTGGGGTTAC CTGGCGGTGC GGTACATGAT CGAGAAGCAC CCGGCGGACG TCCAGGCGAT GCTGGCGAAG TTCCGAGTCG GCGACTGGGC CGGCGGCTAC GCGGTCTACA ACGCGATCGG CACCAAGTAC GACGCTGACT TCGACGCCTG GCTCGACGTC TGCGCGGCCG GCGCGTGCCT GGTCCCCGGT GCTCCGACCG CGGCGTTCAC CATGGCTCCC GACGGTCTGT CCGTCCACTT CTCCGACGTC TCCACCGACA CCGGCAGCCC GCTGAACTTC GAGCGTTGGA GCTACGGCGA CAACACCATC TCCACGACCG ACGGCCCGAA CCCGACCCAC ACCTTCCCCG CCGCCGGGAC CTACACCATC GCCTTGACCG TCGATGACGC GAAGGGGTTG AGTTCCACCT ACGCGCAGAA CGTCACGGTG ACCGGTCCGT CCGGCCCGCC GCCGTGCCCC TCGGCCAACC CGCAGGCGAT GGACCGCAAC TGCTCCCGCG CCGACCAGTC GGAGACTGCG GGCGACTACG ACAGCCTGTG GATCTACCTG CCCGCCGGAC AGGTGACCCT GCATGTCGCC ACCACCGGCG GCAGCGGCAA CGCAGACCTC TACTACGACC CCGACACCTG GGCCACGAAG CAGACCCACA CCGCGAAGTC GACCGGCGGC GGCAACGACC AGAGCATCAC CGTGACGAAC AAGACGGCGG GGTACCGGTA CATCAGCTTG TACGCGGTCA CGTCGTTCAG CGGAGTCAGC GTCTCGACGC AGTACTGA
|
Protein sequence | MRHRSALPRI LALGLAVCTA SFGLAAGPAG AVSGAASAAP PSVPAPSGAA VAPAGASESH TAAKPSPAGP PVAIGAATSL AVTSPAGPAG PDAASAVTAA SSAAPAGTTP STPSTQSCTA ADFGSRGGAK LAAYVKTSTT DCLNTLYALT GSDAAAVFKE SQMTAVANAF VSTARNYRGD DSSGIWQLSL FLTAGYYVQY NNAAAVGSYG SALADPVQNG LDAFFSARHS SDVSAANGNV LGNVITLSDS ADLQARYISV YKRVLNGYKS SYDAFPSMDA AVNAVFTPIY RGHFFPAYIA AVTADPSLID ALNSFALNNT ALLPGANYAL DTNAAAEAVR FLDTPALQAK VRPLAAHLLA ISPLPGPNGP LWVRVAVVVD YVDGAECGTF GVCDYTDTLK AAVLPTTYPC GTTRTILAQA MTAADLNAAC TSLQGEDAFY HGLVKDGGPI AGQYDTNVRL AVFATKWDYT VYSTALFGND TDNGGETLSG DITDPANQPI SVMYVKFPGD GFPASVWNLN HEYTHLLQGE FDMKGTFDQE ISVPDIWWVE GEAEYVSYAY RGLNNTQAIG EASQHAFPLS TLFQTTYDNT TTDRTYTWGY LAVRYMIEKH PADVQAMLAK FRVGDWAGGY AVYNAIGTKY DADFDAWLDV CAAGACLVPG APTAAFTMAP DGLSVHFSDV STDTGSPLNF ERWSYGDNTI STTDGPNPTH TFPAAGTYTI ALTVDDAKGL SSTYAQNVTV TGPSGPPPCP SANPQAMDRN CSRADQSETA GDYDSLWIYL PAGQVTLHVA TTGGSGNADL YYDPDTWATK QTHTAKSTGG GNDQSITVTN KTAGYRYISL YAVTSFSGVS VSTQY
|
| |