Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_3922 |
Symbol | |
ID | 8734379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 4163601 |
End bp | 4164872 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646504546 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_003395714 |
Protein GI | 284045374 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.155869 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTCAT CCAAGGCCAT CGTCACCCTC GCCGCGGCGG CCGTGCTCGG CGCCGCGGGC GGCGCTGCGG TCGTGGGCGT CGCCGGCGAC GGCGGCGGCT CGACCAAGAC GGTGCTCGAG CCGGCTGCGG CACAGCCGCA GCCCGTCAGC GTCGCGCAGA GAAACGGCGA CGCACTCACC CCCAAGCAGG TCTACTCGCT CGCGAGAGAC TCGGTCGTGT TCATCACCTC CGACGTCACC GAGCAGGGTC AGTCCGGTCA GGCGACCGGC TCCGGCTTCG TCATCTCCAA GGACGGCTAC ATCGTCACCA ACGCGCACGT CGTCAACGGC GCGTCGAAGG TCACCGTGAA GATCGGTGAC GGGCAGACGC AGGACGCCGA GATCGTCGGC AAGGACGAAT CGACCGACAT CGCACTGCTG AAGGTGAGCG GCAGCGACGA CCTCAAGCCG CTGCAGTTCG CCGACTCCGA CAAGATCTCC GTCGGCGACC CGATGTACGC GATCGGCAAC CCGTTCGGCC TCGACCGCAC GCTCACGACC GGCGTCGTCT CCGCGCTCCA GCGGCAGATC ACGGCGCCCA ACGGCTTCTC GATCGACGGC GTGATCCAGA CCGACGCGCC GATCAACCCT GGCAACTCGG GCGGCCCGCT GCTCGACGCC CACGGCGAGG TGGTCGGCGT CAACTCGCAG ATCCTCAACG GCGGCGGCAG CTCCAGCGAG GGCAACGTCG GCATCGGCTT CGCGGCACCG TCGAACACGG TCAAGAACGT CGTCGAGCAG CTGCGGCAGA ACGGCTCGGT CGAGCACGCC TACCTCGGCG TCCAGATGGG CGACGCGGCG AGCGGCGGCG GCGCGCAGGT CGGCGCCGTG ACCCCGGACG GCCCGGCCGC CGCGGGCGGT GTCCAGCAGG GCGACGTGAT CACCAGCTTC GACGGCAAGA CCGTCACCGA CGCCGCCTCG CTGTCGAGCA TGGTCAACGC CAAGCAGGTC GGCGACAAAG TCGAGCTGGA GGTTCGCCGC GGCGACGGCG AGCAGACGCT CAGCGTGACG CTCGCCGCGC AGCCCGCCTC GGCGAGCAGC GCGCAGCAGC AGAGCCAGGT CGATCCGCAG CAGCAGGTCG ATCCCAACCA GCAGGTGGAC CCGCAGCAGC AGGTGGACCC GCAGCAGCAG GTGGATCCGA ACCAGCAGGT CGATCCGCAG CAGCAGGTCG ATCCCAACGG CGGCCAGCAG CAGATCGACC CGCGCGACCT GCTCGAGCAG CTCATGCCCT GA
|
Protein sequence | MNSSKAIVTL AAAAVLGAAG GAAVVGVAGD GGGSTKTVLE PAAAQPQPVS VAQRNGDALT PKQVYSLARD SVVFITSDVT EQGQSGQATG SGFVISKDGY IVTNAHVVNG ASKVTVKIGD GQTQDAEIVG KDESTDIALL KVSGSDDLKP LQFADSDKIS VGDPMYAIGN PFGLDRTLTT GVVSALQRQI TAPNGFSIDG VIQTDAPINP GNSGGPLLDA HGEVVGVNSQ ILNGGGSSSE GNVGIGFAAP SNTVKNVVEQ LRQNGSVEHA YLGVQMGDAA SGGGAQVGAV TPDGPAAAGG VQQGDVITSF DGKTVTDAAS LSSMVNAKQV GDKVELEVRR GDGEQTLSVT LAAQPASASS AQQQSQVDPQ QQVDPNQQVD PQQQVDPQQQ VDPNQQVDPQ QQVDPNGGQQ QIDPRDLLEQ LMP
|
| |