Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_2227 |
Symbol | |
ID | 8333576 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 2527821 |
End bp | 2528759 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644955381 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_003112987 |
Protein GI | 256391423 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.247792 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0000126621 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACAGAGG ACATATACGC CGGGCTGGAT GACTACTCCC GGACCGTGGT CTCCGTCGCC GACACGCTGC GGCCCTCGGT GCTCAGCGTG CGGGTCCGGG GGCGGACCGG GGAGGGTTCC GGTTCGGCCG TCGCCTTCAC CGGCGATGGA TTCCTGGCTA CGAGCGCGCA TGTCGTGGAA GGCGTGAGTG CCGGGCTGGC CGTCACTTCC GACGGCGACG AGTACGGATT CGACGTCGTG GGCCGCGACC GGCTCTCCGA CCTGGCGGTA CTCCGCATGC CGTCGGGCCG GGTGCAGGCC GCGACACTTG GCGACGCGGA CACGCTACGG GTCGGTCAGC TGGTCGTGGC CGTCGGCAAC CCGCTGGGGC TGGCCGGTTC GGTGACCGCG GGCGTGGTCA GCGCGTTGGG GAGATCGCTG CCGACGCGGA CGGCCCGCGG TCCGTCGGTG ATCGACGATG TGATCCAGAC CGATGCGGCG CTCAACCCTG GAAATTCTGG GGGAGCGCTC GCCGACCATC GCGGCCGGGT CGTCGGGGTG AACACGGCGG TGGCCGGGTT CGGACTGGGT CTGGCGGTGC CGGTGAACGC CACCACCCGT CTGATCCTGT CCGAGCTGAT CAGCGCTGGG AAGGTGCGCC GCGCCTGGCT CGGCGTTGGC GGATCGACCG CGCCGCTGCC TTCCGAGCTT GGCGGCAGGC TGGGACGCCG GACCGGGTTC GGTGTCGCTC AGGTGGTGCC GGGAAGCCCT GCGGCCCAGG CCGGGGTGCT GGTCGGCGAC GTGCTGCTCA GCGTCGAGGG CCGACCGGTC GACGATGCCC AGGACCTTCA GCGGATCATG CTGGCACGGC GCGCCGGGGC GCAGGTCGCG ATGACCCTGC ACCGGCGCGG CGCGATGGTG GACGTGACGG TCGTGCTGGC CGAGTTGTCA GGAGTCTGA
|
Protein sequence | MTEDIYAGLD DYSRTVVSVA DTLRPSVLSV RVRGRTGEGS GSAVAFTGDG FLATSAHVVE GVSAGLAVTS DGDEYGFDVV GRDRLSDLAV LRMPSGRVQA ATLGDADTLR VGQLVVAVGN PLGLAGSVTA GVVSALGRSL PTRTARGPSV IDDVIQTDAA LNPGNSGGAL ADHRGRVVGV NTAVAGFGLG LAVPVNATTR LILSELISAG KVRRAWLGVG GSTAPLPSEL GGRLGRRTGF GVAQVVPGSP AAQAGVLVGD VLLSVEGRPV DDAQDLQRIM LARRAGAQVA MTLHRRGAMV DVTVVLAELS GV
|
| |