Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_5274 |
Symbol | |
ID | 5195864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009507 |
Strand | - |
Start bp | 169911 |
End bp | 171416 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640579201 |
Product | protease Do |
Protein accession | YP_001260149 |
Protein GI | 148550710 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGAAATC CTGACAGGAT CTTTGCCTCC CTATTAAGCA CCGCCGCCGC CGTGGGTCTT GTCATCGGCC AGCCATCCCC CGCTGCCGCG CAATCGCCGC CCGCCGCATC GCCAGTCGTG CCACGCCCAG GCGCGCCGCA AAGCTTTGCC GATCTCACCG AAAGGCTGGC GCCCGCCGTC GTCAACATCT CGACCCGCCA GCGCGTGCAG ATGCCGAGCT TCAGCCCCTT TGCCGGCACG CCCTTCGAGC GCTTCTTCGG AAGTCCCACG GGACCGCGCA CCCGCGAGGC GCAGTCGCTC GGCTCAGGCT TCATCCTGTC CGCCGACGGC TACATTGTCA CCAACAACCA CGTCATCACC GCCGACGGGC AGGGCAAGGT CGAGACGATC ACGGTCACCT TGCACAATGG CGAGGAATAT CCGGCGACCC TCGTCGGTAG CGATCCCGCC TCGGACCTGG CGGTCCTCAA GATTACGTTG CGCAAACCGC TGCCCTTCGT GACCTTCGGC GATTCCACGC GAGTCAGGGT TGGCGACTGG GTGCTCGCGA TCGGCAATCC TTTCGGCCTG GGCGGCACGG TCACAGCCGG TATCGTCTCG GCGGTCTACC GCAATACCGG TACAGGGCGC GCCTATGACC GCTATCTGCA AACCGACGCC TCGATCAACC GCGGCAATTC GGGCGGACCG ATGTTCGACA GCAGCGGGCG GGTCATCGGC ATCAACAATG CGATCTTCTC GCCCACGGGC GGAAATGTCG GGATCGGCTT TGCCATCCCC GCGGAGATCG CCGCGCCGAT CGTCGAGAAG CTCAAGGCCG GCAAGGCCAT CGAACGCGGT TATCTGGGCG TGACGATCCA GCCGATGACC GAAGACCTCG CGTCATCGCT CGGCGTTCCG CGAGACCGCG GCGAGTTCGT CCAGAGCGTG GAGCCCGGCG GGCCCGCGGC ACAGGCTGGC ATCCGCGCCG GCGACGTCAT CCTGCGGGTC GACGGCAAGG AGGTGACTCC AAGCCAAAGC CTGTCGTTCC TCGTTGCGAG CGTCGAACCT GGCCGCAAGG TCGCAGTCGA ACTCATGCGC GGCAACCAGC GCATGACCGT GACTGCCACG CCGGTGCTTC GTCCCAGCGA GGACAAGCTC GCCCGGCAGG GTTTTGGCCG CGATGACCGT CGTTTCGACA ATTTCGACAA GGATCGCGCG TCCCCCAGCG AAAAGACACT CGGGCTTGCC GTCGAACCGC TGACTCCAGG CATCGCGCGT CAACTCGGTG CGAGCGACGT CTCCCAGGGT CTCGTCATCA GCAGCGTCGA GGGCAATTCC GATGCGGCGC GCAAAGGCCT CAGCCGCGGC GACATCATCC TGTCGGCCAA CAACCGGCCG GTGGCGAGCG CTGCTGATCT CGAGGCCGCG ATCCGGCAGG CCAGGACGGC GGGACGATCG GCTATCCTGC TACGGGTCAA GGGGCGCGGC GAAGCGCCGG CCTATGTGCC GGTACGGCTG CGATAG
|
Protein sequence | MRNPDRIFAS LLSTAAAVGL VIGQPSPAAA QSPPAASPVV PRPGAPQSFA DLTERLAPAV VNISTRQRVQ MPSFSPFAGT PFERFFGSPT GPRTREAQSL GSGFILSADG YIVTNNHVIT ADGQGKVETI TVTLHNGEEY PATLVGSDPA SDLAVLKITL RKPLPFVTFG DSTRVRVGDW VLAIGNPFGL GGTVTAGIVS AVYRNTGTGR AYDRYLQTDA SINRGNSGGP MFDSSGRVIG INNAIFSPTG GNVGIGFAIP AEIAAPIVEK LKAGKAIERG YLGVTIQPMT EDLASSLGVP RDRGEFVQSV EPGGPAAQAG IRAGDVILRV DGKEVTPSQS LSFLVASVEP GRKVAVELMR GNQRMTVTAT PVLRPSEDKL ARQGFGRDDR RFDNFDKDRA SPSEKTLGLA VEPLTPGIAR QLGASDVSQG LVISSVEGNS DAARKGLSRG DIILSANNRP VASAADLEAA IRQARTAGRS AILLRVKGRG EAPAYVPVRL R
|
| |