Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3889 |
Symbol | |
ID | 5901351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4208454 |
End bp | 4210028 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641564410 |
Product | protease Do |
Protein accession | YP_001685512 |
Protein GI | 167647849 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.42007 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.578189 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGCTA GGAAGTCTGG TTTCATTATT GGTGCGGTGG CCGGGGCGGG CGTGGCCTGC GCGGCCCTGG CCGGGGTCGG CATGCGCATG GGTCCTGCGG GCGCCGCTGA GCCGGCTCAG GTGATCCGCA CCTCGGGCGC CGCCGGCGCT CCCGCTCCGA TGTTCGCGCC GCCGCCCGGC GCGCCGCTGT CGTTCGCCGA CATCTTCGAA CAGGTTTCGC CGGCCGTGGT CCAGATCGAC GTGACCTCCA AGGCCGCCGC CGTTCCGAAG TTGCGCATCC CGGGCCTGGA AGGCTTCGAC ATCGTGCCGA AGGGCCAGAA GGGCGAGGAC GGCGAAGACA CCCCTGGTCC CAAGCAGCAG TCCTCGGGTT CGGGCTTCTT CATCTCGGCC GACGGCTACC TGGTGACCAA CAACCACGTC GTGGCCGACG CCGACGCCGA TGGCATCAAT GTCGTGCTCA AGGACGGCCG CGAGTTGAAG GCCACCATCG TCGGCCGCGA CGAGGGCACG GACCTGGCGG TCCTCAAGGT CGTCGATCCC AAGGCCAAGG GCGCGGCCTT CCCGTATGTG AACTTCGAGA ACCAGGCCAA GCCGCGGGTC GGCGACTGGG TGATCACCAT CGGCAACCCG TTCGGCCTGG GCGGCACCGC CACGGCCGGC ATCATTTCGG CCTATAACCG CGACCTGGGC GACAGCTCGT CGACCTTCGT CGACTACATC CAGATCGACG CGCCGATCAA CCGGGGCAAT TCGGGCGGTC CGTCGTTCGA CATCTATGGC CGGGTGATCG GGGTCAACAC CGCGATCTTC TCGCCGACCG GCGGCTCGGT GGGCATCGGC TTCGCCATCC CCGCCGACGT GGCCGAGGCC ACCGCCAAGC AGCTGATCGC CGGCGGCAAG GTCGTGCGCG GCTATATCGG CGCCCAGATC CAGCCCTTCA CCAGCGAGAT GGCCGAAGCC CAGGGTCTGG CCGACGTCAA GGGCGCCATC GTCGCCGATC TGGTGCCCGG CGGTCCGGCC CAGAAGGGCG GCTTGATGCC CGAGGACGTG ATCACCGCCG TCAACGGCGT GAATATCAAG AGCGGCTCGG AACTGACCCG CGAAGTGGCC AAGGGCCGTC CTGGCGACAC CCTGAAGCTG TCGGTGCTGC GCGGCGGCAA GCCGCGCATG GTCGAGATCA AGTCGGGCGT TCGTCCGACC GAGAAGGAGC TGGCGCTCAA CGACGACGAC AGCGACGAGG GCGGCGCCGA CGCCCCGGCC AAGCCGCAGA CTCAGAAGTC GGAGGCTCTG GGCATGACCC TTGTCCCGAT CGACGAGGCG GCCCGCCGGA CCTACAGCAT CGATCCGGCC GTCAAGGGCG TGCTGATCGA CAGCGTGAAG GCCAATTCCG ACGCCGGCGA GAAGGGCCTG CGCAAGGGCG ACGTGCTGAC CAGCGTCAAC AGCGAGCCCG TCGCCAACGC CGCTCAGGTG AACTCGGCGG TTGAGGCGGC CAAGAAGCTG CAACGGCCCA GCGTCAACCT GCGGATCATC CGCGCCGGTC GTCCGACGAT CGTGCCGCTG AAGATCACCC CGTAA
|
Protein sequence | MAARKSGFII GAVAGAGVAC AALAGVGMRM GPAGAAEPAQ VIRTSGAAGA PAPMFAPPPG APLSFADIFE QVSPAVVQID VTSKAAAVPK LRIPGLEGFD IVPKGQKGED GEDTPGPKQQ SSGSGFFISA DGYLVTNNHV VADADADGIN VVLKDGRELK ATIVGRDEGT DLAVLKVVDP KAKGAAFPYV NFENQAKPRV GDWVITIGNP FGLGGTATAG IISAYNRDLG DSSSTFVDYI QIDAPINRGN SGGPSFDIYG RVIGVNTAIF SPTGGSVGIG FAIPADVAEA TAKQLIAGGK VVRGYIGAQI QPFTSEMAEA QGLADVKGAI VADLVPGGPA QKGGLMPEDV ITAVNGVNIK SGSELTREVA KGRPGDTLKL SVLRGGKPRM VEIKSGVRPT EKELALNDDD SDEGGADAPA KPQTQKSEAL GMTLVPIDEA ARRTYSIDPA VKGVLIDSVK ANSDAGEKGL RKGDVLTSVN SEPVANAAQV NSAVEAAKKL QRPSVNLRII RAGRPTIVPL KITP
|
| |