Gene Swit_2354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_2354 
Symbol 
ID5199938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp2616981 
End bp2618537 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content70% 
IMG OID640581900 
Productprotease Do 
Protein accessionYP_001262851 
Protein GI148555269 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTTATG CCTATGGGAT CACCGCGGCC TTGCTGGCGG GCGGTGCGGC CGCAACCCTG 
ACCTTGCAGC AACCGGTCGG CGCCCAGGTG GCGCAGAACG CGCCGGGCTC GATCAACGCG
ACCGCACCCC GACCGGGCGC GCCGATGAGC TTCGCCGATC TCGCGGCCAA GCTGCAGCCG
GCGGTCGTCA ACATCTCCAC CACCCAGAAG ATCCAGGTGC GCGGCGGCGG CAACGCCTTC
TCCGGCACCC CGTTCGAGGA ACTGTTCCGG CGCTTCGGCG GCGGCCAGGG CGACGACGGC
AAGCCGATCA CGCGCGAGGC GACCTCGCTC GGCTCGGGCT TCATCATCTC GCCCGACGGC
TATGTCGTCA CCAACAACCA CGTCATCTCG GCCTCGCCCG AGGGCGGCAG CGGCGCGGTG
GTCAGCTCGA TCACCGTCAC CCTGCCCGAC CGCAAGGAAT ATAAGGCGAC GATCGTCGGC
CGCGACCAGA CGTCGGACCT GGCGCTGCTC AAGATCGACG CGAAGAACCT GCCCTTCGTC
CAGTTCGGCG ATTCGACCCG TACCCGGGTC GGCGACTGGG TGGTGGCGAT CGGCAATCCG
TTCGGCCTCG GTGGCACGGT GACGGCGGGC ATCGTCTCGG CGCTGCACCG CTCGATCGGC
ATCAACGGCC CCTATGACCG CTACATCCAG ACCGACGCCT CGATCAACCA GGGCAATTCG
GGCGGCCCGA TGTTCGATCT GCAGGGCAAC GTCATCGGCA TCAACACGGC GATCTTCTCG
CCGACCGGCG GCAATGTCGG CATCGGCTTC GCGATCCCGG CCGAGGAGGC CAAGCCGATC
ATCGACCAGC TCCGCACCGG CCAGCGGGTG CGGCGCGGCT ATCTGGGCGT CGGCATCCAG
CCGATGACCG AGGACATCGC CAGCAGCCTG GGCCTGCCCA AGGACCGCGG CGAGATCGTC
GCCCGGGTCG AGCCGGGCGA GGCGGCGGCG CGCGCGGGCA TCCGCCAGGG CGACGTCATC
GTCCGCGTCG ACAATCAGGA GATCACCCCC GACAACACGC TGTCCTACAT CGTCGGCAAG
GCCGCGGTGG GCGCGCGCCT GCCGATCGAG CTGATCCGCG AGGGCCAGCG CAAGACGGTG
ACGGTGACGC TGGGCGAACG CCCGCCCGAG GACCAGCTCG CCAGCGCCGG CAACCTCGAC
GAGGACCAGG GCGACGACGC CCCGGGCGCG GCGCAGAGCG CGCCCGACCA GTCGACCCGC
ACGGCGATCG GCCTCGGCCT GCAGACGCTG ACGCCCGACA TCGCCCGGCG CCTGGGCGTC
TCGTCGACGC TGCGCGGCGT GGTGATCAAC TATGTCGATC CGTCGAGCGA TGCCGCGGCC
AACGGCTTCC AGCCGCGCGA CATCATCCTG CAGATCAACA ATGTGCCGGT GGCGACGGTC
CAGGCGGCGG CGGCGAAGAT CACCGAGGCG CAGAAGGCCA AGCGCCCGAC TGTGCTCTTG
TTCGTCCAGC GCGGCAACAA TCCGCCGCGC TATGTGGGCG TGCAGATCCG CAACTGA
 
Protein sequence
MRYAYGITAA LLAGGAAATL TLQQPVGAQV AQNAPGSINA TAPRPGAPMS FADLAAKLQP 
AVVNISTTQK IQVRGGGNAF SGTPFEELFR RFGGGQGDDG KPITREATSL GSGFIISPDG
YVVTNNHVIS ASPEGGSGAV VSSITVTLPD RKEYKATIVG RDQTSDLALL KIDAKNLPFV
QFGDSTRTRV GDWVVAIGNP FGLGGTVTAG IVSALHRSIG INGPYDRYIQ TDASINQGNS
GGPMFDLQGN VIGINTAIFS PTGGNVGIGF AIPAEEAKPI IDQLRTGQRV RRGYLGVGIQ
PMTEDIASSL GLPKDRGEIV ARVEPGEAAA RAGIRQGDVI VRVDNQEITP DNTLSYIVGK
AAVGARLPIE LIREGQRKTV TVTLGERPPE DQLASAGNLD EDQGDDAPGA AQSAPDQSTR
TAIGLGLQTL TPDIARRLGV SSTLRGVVIN YVDPSSDAAA NGFQPRDIIL QINNVPVATV
QAAAAKITEA QKAKRPTVLL FVQRGNNPPR YVGVQIRN