Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3399 |
Symbol | |
ID | 5900854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3671375 |
End bp | 3672463 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641563905 |
Product | peptidase M48 Ste24p |
Protein accession | YP_001685024 |
Protein GI | 167647361 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.132097 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAGGCGC TTCTCTATGA CGGTCTCACC GCCCGGCCCT GGCCGGCGCG GGTGAGCCTG GACGGCGACC GCGTGGTCGC CGTCAGCGAA GCGCTCGACG CGGCGGGCGC GCCCTGTGCT CGCCTAGACT GGCCCCTGGC CAAGGTCCGC GAGCAGGAGC GTGACGAGAC CACCTATCGC CTGTCACTGG GCGGGGGCGA CGCACGGCTG ATCGTCGATA TCCAGGCCTG GCAGGTCCTG ACCGGTCGCA CGCGCCGTCA CGTCGTCCGG CGTCGACGGG GCGGGGAGTG GCGGGTGATC GGCGGTCTGG CGGTGGCCGG CCTCTCGGTC ATGGCCTTCG TCTTCATCGT CGTGCCGCTG GCCGCCGAGC CGCTGGCGCG CTGGACGCCG CCGGGCCTGG AGGCGCGGTT CGGGCGCAGC ATGCAGGCCC AGGTCCATCT CGCCATGCCG GTTTGCGAGG GCGATCCCCA GGGCGCGGCG GCGCTGGCGA AGTTGGGGCA AGCCCTGTCG GAGGGCGCCG ACACGCGCTT CCCGATCCGG GTGCAAGCGG TGCGCGCCCC GTTCGTCAAC GCCTTCGCCC TGCCCGGCGG GACGGTGATG GTCACCGACG AACTGATCGC CCAGGCCCAT TCGCCCGACG AACTGGCCGG GGTGATCGCC CACGAGATCG CCCATGTCGA GCGTCGCCAC GTCATGCAGG CGGCCTGGCG CTCGGCGGGC GCGGGCCTGG TGCTGGACGC CGTGGTCGGC GGCGGCAGCG GCGCGGGGCA GCAGGCGATC CTGCTGGCCA GCAGCCTGTC CAACCAGCGC TTCAGCCGCA AGCTGGAGGC CGAGGCCGAT ACGCGGGGCA TGCAGCTGCT GGCGCGGCGC GACATCTCCA GCCAGGGCAT GGCCGACTTC TTCGACCGGA TGGCCGACCG CAAGGCCGAC GCCCGGGTGC GTCAGGCCGC CGAATGGTTT TCGTCGCATC CCGACATGGT CGCGCGCGCG GCGCTGGCCA AGGCCGCCGC CCGGCCTGGA CGGCCGGCCC TGTCGGACGC CGATTGGCGG CGGGTCAAGG CCGTGTGCAA GGCTGGTCGC AAGCGCTAG
|
Protein sequence | MQALLYDGLT ARPWPARVSL DGDRVVAVSE ALDAAGAPCA RLDWPLAKVR EQERDETTYR LSLGGGDARL IVDIQAWQVL TGRTRRHVVR RRRGGEWRVI GGLAVAGLSV MAFVFIVVPL AAEPLARWTP PGLEARFGRS MQAQVHLAMP VCEGDPQGAA ALAKLGQALS EGADTRFPIR VQAVRAPFVN AFALPGGTVM VTDELIAQAH SPDELAGVIA HEIAHVERRH VMQAAWRSAG AGLVLDAVVG GGSGAGQQAI LLASSLSNQR FSRKLEAEAD TRGMQLLARR DISSQGMADF FDRMADRKAD ARVRQAAEWF SSHPDMVARA ALAKAAARPG RPALSDADWR RVKAVCKAGR KR
|
| |