Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4054 |
Symbol | |
ID | 5901516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4386833 |
End bp | 4387927 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641564575 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_001685677 |
Protein GI | 167648014 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.779263 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGAAAG CGCGCAAGTA CGAGCTGTTC GCCAAGGCCC GCGCGCCCAG GGCTTCGCGG CGGCGCGAGA TCCTGGCCTA TCCCGACGTC GAGACCGAGG GCGCCACCTT CGTCTTCGAT CCCGGCGAGA TCCCCCAGGC CGGCGGACCG CCGGCCTCCG ACGACGCCCT GCTGGACGCC TATTCCAACG CCGTGGTCCG GGCCGTCGAG GCGGTGGGAC CCTGCGTCGT GCGCATCCAT CCGATGACCG AGGACCCGCG CATCCAGGGC GTTGGCTCGG GCTTCGCCAT CGCCGCGGGC GGCCTGATCC TGACCAACAG CCATGTGGTC CAGGGCGCCG CTCGCTTCGT GGTGATCACC GCCGAGGGCC GCAGCCTGAC GGCCCGCTGC GTGGGCGACG ATCCCGACAC CGACCTGGCC TTGCTGCGGC TGGACCACCG GCTCGACCTG CCCGTCGCCC GGCTGGGCAA TTCCAAGGCC CTGCGGCGCG GCCAGCTGGT GATCGCCATC GGCGCCCCGC TGGGCTTCGA GGCGACGGTG ACCACCGGCG TGGTCTCCGC CCTGGGCCGC TCGCTGCGCG GCGAGCGCGG CCGGCTGATC GAGGACCTGA TCCAGACCGA CGCCGCCCTC AATCCCGGCA ACAGCGGCGG GCCGCTGGTC GCCTCGTCCG GCGAGGTGGT CGGGATCGCC ACCGCGGTGA TCGCCGGCTA CCAGGGCCTG TGCTTCGCCG TGGCCTCCAA CACCGCCACC TTCGTGATCG CCGAGCTGAT CGCCCACGGC CATGTGCGGC GCGGCTCGAT CGGACTGGTC GCCCAGCAGG CGCCGATCCC GCCGGGCCTG GCGCGGGCGA CCGGCGTCAC CCAGGGCTAT GCGGTGTTCG TCGCCCAGGT CGACCCCGGC GGCCCGGCCG CCAGGGCCGG GGTGCGCGAG GGCGACCTGC TGATCAGCGC CGGCGGGGTT CCCCTGACCG GCCTGGACGA CCTGCTGCGG GCGCTCGACC ACCACAGCAT CGACAAGCCC TGCGTCTTCC TGCTGATCCG CGAGGGCAAG CTGATGACGG TAAGCATCAC GCCCCGAGCG CGCAAGCGCG GGTAA
|
Protein sequence | MLKARKYELF AKARAPRASR RREILAYPDV ETEGATFVFD PGEIPQAGGP PASDDALLDA YSNAVVRAVE AVGPCVVRIH PMTEDPRIQG VGSGFAIAAG GLILTNSHVV QGAARFVVIT AEGRSLTARC VGDDPDTDLA LLRLDHRLDL PVARLGNSKA LRRGQLVIAI GAPLGFEATV TTGVVSALGR SLRGERGRLI EDLIQTDAAL NPGNSGGPLV ASSGEVVGIA TAVIAGYQGL CFAVASNTAT FVIAELIAHG HVRRGSIGLV AQQAPIPPGL ARATGVTQGY AVFVAQVDPG GPAARAGVRE GDLLISAGGV PLTGLDDLLR ALDHHSIDKP CVFLLIREGK LMTVSITPRA RKRG
|
| |