Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1649 |
Symbol | |
ID | 5899104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1728972 |
End bp | 1730396 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641562138 |
Product | protease Do |
Protein accession | YP_001683276 |
Protein GI | 167645613 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0827513 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGATCCT ATCGCGTCGT GCTTCCCGCC CTGGCCCTGC TGGCCGCCTG CTCGCCGCAA GGGCCGTCCC AGGCCCAGTC GATCCCGGAC CTGGCCCAAC CCACCCGCCG CGCGCCCAGC GACCCGATGT CGATGAAGGC CTCGTTCGCG CCGGTGGTGA AGAAGACCGC CCCGGCCATC GTCAACGTCG CCAGCCGCCG CGTGGTGCGA CAGCAGGCGC GCGATCCGTT CTGGGACTTC TTCATGGGCG GCGGCGGCGG CGCGCCGCGC GACCAGGTGC AAGGGTCTCT TGGTTCTGGC GCCATCGTCC GCGCCGACGG GGTGATCATC ACCAACCACC ACAATATCGA GGGCATGAGC GACGTCACCG TCCAGCTGGC CGACCGCCGG GAGTTCCCGG CCACCGTGCT GCTCGACGAT CCACGCTCCG ACCTGGCGGT GCTGAAGATC GACACCAAGG GCGAGCGCCT GCCGGTGATC GCCATCGACG ACCAGGAGCA GCTGGAGGTC GGCGATCTGG TTCTGGCCCT GGGCAATCCG TTCGGCGTCG GCCAGACGGT GACCAACGGC ATCGTCTCGG CCCTGGCCCG CACCGATGTC GGCGCCGCGG AGTTTGGCAG CTACATCCAG ACCGACGCCT CGATCAATCC GGGCAATTCC GGCGGTCCCC TGGTCGACAT GGACGGCGAC CTGATCGGCA TCAACACCTT CATCATCTCG CGCTCGGGCT CGTCGAGCGG GGTCGGCTTC GCGATCCCGG CGGCCGTCGT GCGCCAAGTG GTGAGCACGG CCCTCGGCGG GGCTCACAGC GTGGTCCGCC CCTGGCTGGG CGTGAAGGGC CAGCCGGTGA CCGGCGACAT CGCCAAGAGC CTGGGCCTTG CCGCGCCGCG CGGCGTGGTG ATCTCCGACG TCTATCCCGG CGGTTCGGCG CAGCGGGCCG GCATCCGTGA AGGAGACGTG ATCTTGACCA TCGACGGCCA GGCGGTGAAC GACGAGGGCG GCGGCGCCTT CGCCATCGGC ACCCACAAGG TCGGCGACCG GGTCACGGTG CTGATCAACC GCGGCGGCAG GGAGCAGACC CTGACCCTGC GCGCCGAGGC CGCGCCGGAG AGCCCGGTCC GTGACGAGCG GGTGCTCAAG GGCCGCAACC CGTTCGACGG CGCCACGGTG GTCAACCTGT CGCCGGCCGT GGCCCAGGAC CTGGGCGTCG ACGCCTTCGC CGGACGGGGG GTGCTGGTCA CCAAGATCGG CCAGGGCTTC GCCCTGAACG CCGGCCTGCG CCCGGGCGAC TTCATCCGCG AGATCAACGG CAAGGCCATC AACACCACCG CCGAACTGGC GGCGGCCGCC AACGCCGGCG CCTCGGTCTG GACCGTGACC ATCGAGCGGG GCGGCCAAAG GATCACGGCG AGGCTGCGGG CTTAA
|
Protein sequence | MRSYRVVLPA LALLAACSPQ GPSQAQSIPD LAQPTRRAPS DPMSMKASFA PVVKKTAPAI VNVASRRVVR QQARDPFWDF FMGGGGGAPR DQVQGSLGSG AIVRADGVII TNHHNIEGMS DVTVQLADRR EFPATVLLDD PRSDLAVLKI DTKGERLPVI AIDDQEQLEV GDLVLALGNP FGVGQTVTNG IVSALARTDV GAAEFGSYIQ TDASINPGNS GGPLVDMDGD LIGINTFIIS RSGSSSGVGF AIPAAVVRQV VSTALGGAHS VVRPWLGVKG QPVTGDIAKS LGLAAPRGVV ISDVYPGGSA QRAGIREGDV ILTIDGQAVN DEGGGAFAIG THKVGDRVTV LINRGGREQT LTLRAEAAPE SPVRDERVLK GRNPFDGATV VNLSPAVAQD LGVDAFAGRG VLVTKIGQGF ALNAGLRPGD FIREINGKAI NTTAELAAAA NAGASVWTVT IERGGQRITA RLRA
|
| |