Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3631 |
Symbol | |
ID | 8334984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4062883 |
End bp | 4064028 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644956772 |
Product | serine protease |
Protein accession | YP_003114375 |
Protein GI | 256392811 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4934] Predicted protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.523915 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.916295 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACAGCTC GTATCAGGCT TCTGTTCTCC ATGCTCGTCG CGCTTCTGCT GGCCATGGCT GTGCCGGCGT TCGCCGATGC CAGTTCCTCG GTCGCGGCGA CCACGTACGC GCATCCGATG TTCGTTCCCG GGCACGCCGC GCCCGGGCTG CACCCCGACC TCGTGCCGTC CGGTTACGGG CCGGCTGATC TGCAGTCCGC CTACAAGCTG CCTTCGGGCA CCAACGGCGC CGGCCAGACG GTGGCGATCG TCGACGCCAA CAACGACCCG ACCGCCGAGG CCGACCTCGG CGTGTACCGG GCGCAGTACG GCTTGCCGGC GTGCACGACG GCGAACGGCT GCTTCAGGAA GGTGAACCAG ACCGGCGGCA CGAGCTATCC GCCGACCGAC GCCGGGTGGG CGACCGAGAT ATCCCTGGAC CTGGACATGG TCTCGGCGGT CTGCCCCAAG TGCCACATCC TGCTGGTCGA GGCCACGTCG GCCTCGTACG CCAACCTGGG CCAGGCGGTC AACGAGGCGG CGGCGCTGCA CGCCACCACG ATCTCCAACA GCTACGGCGG CGGCGACCTG TCCGACAGCT CGGCTCCGTA CTACAACCAC CCCGGCATCA TGATCACGGC CAGCTCCGGT GACGCCGGCT ACGGCGTGGA GTTCCCGGCG TCCTCGCGCT ACGTCACCGC GGTCGGCGGC ACCTCGCTGA CCCGGGCCTC CAACGCGCGC GGCTGGAACG AGACCGCCTG GAGCGGCGCC GGCTCCGGCT GCTCGGCCTA CAACCCGGCA CTGAGCGGCC AGGCCAGCTA CGGCACCGGC TGCGCCCGCC GCGCCGTGGC CGACGTGTCC GCCGTGGCCG ACCCGGCGAC CGGCGTCGCG GTCTACGACT CGACCCCCTA CGGCGGCCGC AGCGGCTGGC AGGTCTACGG CGGCACCTCG GTGGCCTCCC CGATCATCGC CTCCGTGTAC GCCCTGGCCG GCAACGCCGC CAGCATCAAC AACAACTACC CCTACACCCA CTACTCCGCG AGCACCTTCT TCGACATCAC GTCCGGCTCC AACGGCTCGT GCTCCCCGAC CCAGCTGTGC CACGCCCGCG TGGGCTGGGA CGGCCCGACC GGCCTGGGCA CCCCCAACGG CGTCGGCGGG TTCTGA
|
Protein sequence | MTARIRLLFS MLVALLLAMA VPAFADASSS VAATTYAHPM FVPGHAAPGL HPDLVPSGYG PADLQSAYKL PSGTNGAGQT VAIVDANNDP TAEADLGVYR AQYGLPACTT ANGCFRKVNQ TGGTSYPPTD AGWATEISLD LDMVSAVCPK CHILLVEATS ASYANLGQAV NEAAALHATT ISNSYGGGDL SDSSAPYYNH PGIMITASSG DAGYGVEFPA SSRYVTAVGG TSLTRASNAR GWNETAWSGA GSGCSAYNPA LSGQASYGTG CARRAVADVS AVADPATGVA VYDSTPYGGR SGWQVYGGTS VASPIIASVY ALAGNAASIN NNYPYTHYSA STFFDITSGS NGSCSPTQLC HARVGWDGPT GLGTPNGVGG F
|
| |