Gene Caul_1649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1649 
Symbol 
ID5899104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1728972 
End bp1730396 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content71% 
IMG OID641562138 
Productprotease Do 
Protein accessionYP_001683276 
Protein GI167645613 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0827513 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATCCT ATCGCGTCGT GCTTCCCGCC CTGGCCCTGC TGGCCGCCTG CTCGCCGCAA 
GGGCCGTCCC AGGCCCAGTC GATCCCGGAC CTGGCCCAAC CCACCCGCCG CGCGCCCAGC
GACCCGATGT CGATGAAGGC CTCGTTCGCG CCGGTGGTGA AGAAGACCGC CCCGGCCATC
GTCAACGTCG CCAGCCGCCG CGTGGTGCGA CAGCAGGCGC GCGATCCGTT CTGGGACTTC
TTCATGGGCG GCGGCGGCGG CGCGCCGCGC GACCAGGTGC AAGGGTCTCT TGGTTCTGGC
GCCATCGTCC GCGCCGACGG GGTGATCATC ACCAACCACC ACAATATCGA GGGCATGAGC
GACGTCACCG TCCAGCTGGC CGACCGCCGG GAGTTCCCGG CCACCGTGCT GCTCGACGAT
CCACGCTCCG ACCTGGCGGT GCTGAAGATC GACACCAAGG GCGAGCGCCT GCCGGTGATC
GCCATCGACG ACCAGGAGCA GCTGGAGGTC GGCGATCTGG TTCTGGCCCT GGGCAATCCG
TTCGGCGTCG GCCAGACGGT GACCAACGGC ATCGTCTCGG CCCTGGCCCG CACCGATGTC
GGCGCCGCGG AGTTTGGCAG CTACATCCAG ACCGACGCCT CGATCAATCC GGGCAATTCC
GGCGGTCCCC TGGTCGACAT GGACGGCGAC CTGATCGGCA TCAACACCTT CATCATCTCG
CGCTCGGGCT CGTCGAGCGG GGTCGGCTTC GCGATCCCGG CGGCCGTCGT GCGCCAAGTG
GTGAGCACGG CCCTCGGCGG GGCTCACAGC GTGGTCCGCC CCTGGCTGGG CGTGAAGGGC
CAGCCGGTGA CCGGCGACAT CGCCAAGAGC CTGGGCCTTG CCGCGCCGCG CGGCGTGGTG
ATCTCCGACG TCTATCCCGG CGGTTCGGCG CAGCGGGCCG GCATCCGTGA AGGAGACGTG
ATCTTGACCA TCGACGGCCA GGCGGTGAAC GACGAGGGCG GCGGCGCCTT CGCCATCGGC
ACCCACAAGG TCGGCGACCG GGTCACGGTG CTGATCAACC GCGGCGGCAG GGAGCAGACC
CTGACCCTGC GCGCCGAGGC CGCGCCGGAG AGCCCGGTCC GTGACGAGCG GGTGCTCAAG
GGCCGCAACC CGTTCGACGG CGCCACGGTG GTCAACCTGT CGCCGGCCGT GGCCCAGGAC
CTGGGCGTCG ACGCCTTCGC CGGACGGGGG GTGCTGGTCA CCAAGATCGG CCAGGGCTTC
GCCCTGAACG CCGGCCTGCG CCCGGGCGAC TTCATCCGCG AGATCAACGG CAAGGCCATC
AACACCACCG CCGAACTGGC GGCGGCCGCC AACGCCGGCG CCTCGGTCTG GACCGTGACC
ATCGAGCGGG GCGGCCAAAG GATCACGGCG AGGCTGCGGG CTTAA
 
Protein sequence
MRSYRVVLPA LALLAACSPQ GPSQAQSIPD LAQPTRRAPS DPMSMKASFA PVVKKTAPAI 
VNVASRRVVR QQARDPFWDF FMGGGGGAPR DQVQGSLGSG AIVRADGVII TNHHNIEGMS
DVTVQLADRR EFPATVLLDD PRSDLAVLKI DTKGERLPVI AIDDQEQLEV GDLVLALGNP
FGVGQTVTNG IVSALARTDV GAAEFGSYIQ TDASINPGNS GGPLVDMDGD LIGINTFIIS
RSGSSSGVGF AIPAAVVRQV VSTALGGAHS VVRPWLGVKG QPVTGDIAKS LGLAAPRGVV
ISDVYPGGSA QRAGIREGDV ILTIDGQAVN DEGGGAFAIG THKVGDRVTV LINRGGREQT
LTLRAEAAPE SPVRDERVLK GRNPFDGATV VNLSPAVAQD LGVDAFAGRG VLVTKIGQGF
ALNAGLRPGD FIREINGKAI NTTAELAAAA NAGASVWTVT IERGGQRITA RLRA