Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A0099 |
Symbol | degS |
ID | 5137288 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 89188 |
End bp | 90246 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640531559 |
Product | protease DegS |
Protein accession | YP_001216064 |
Protein GI | 147675045 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02038] periplasmic serine pepetdase DegS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000000230294 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGAAAT TTTGGGTTCG CTCAATCAGC CTTGGGTTGT TGGCTGCGAT TGCCATTATT ATGGTGACAC CCTCACTACG CGCCAAATTA ATGCCCGTTG TCGAACAACC ACGCAACATC GGCGCTCTAC AAATCTCATT TAATGAAGCG GTACGCAAAG CCGCCCCTGC CGTCGTCAAT ATTTATAACC GTAAATACAG CGAAAATGAT CGCCGTAAAC TCTCGATTCA AGGTTTAGGA TCCGGTGTCA TTGTCAGCGA AAAAGGCTAC ATCATCACCA ACTACCACGT CGTCGCGCAG GCCGATCAAA TTGTCGTTGC TCTACAAGAT GGGCGAGCCG CAGCAGCACA ATTGGTGGGA AAAGATCGCC GTACCGATAT TGCCGTATTA CGCGTAGAAG GCACGGGTTT ACCAGTGATT CCACTCAATC CAGATTACCA TCCTAAAGTG GGGGACGTGG TGTTGGCGAT TGGTAACCCT TACAACTTAG GGCAAACCAC GACTTTCGGA ATTATCTCGG CTACCGGACG TTCATCCATC AGCGCTGATG GTCGCCAAGC CTTTATTCAA ACTGATGCCG CAATCAATGA CGGCAACTCA GGTGGTGCAT TGGTCAATAC CCAAGGTGAA CTGGTCGGCA TCAATACCGC CTCTTTTCAA CAAGCCACCG ATCTCGAAAC TTACGGGATT TCGTTTGCGA TTCCCTACTC TTTGGCCAGT AAAATTATGA CCAAAATCAT TGCTGATGGC CGCGTGATCC GCGGTTATAT TGGCGTCGAC GGTCAAGATA TTAACTCGAT GACATCACGT TTGCTGGGGA ATGAGCATGT CGGTGGGATC ATTATTTTAG GGGTTGACCC GAATGGACCC GCAGCCCGAG CAGGCTTTCT GGAGCAAGAT ATTTTGCTGA AAATCGACGG TAAAAAAATT AATGGCCGCC AGAATGTCAC AGATACCGTC ACCGATCTTC GCCCCGGCAC TGTGGTGGAT TTCACCCTAC TGCGTAAGGG TGAAGAGATT GTACTCCCAG TTACGATTGG TGAAGACACT CGTGATTAG
|
Protein sequence | MLKFWVRSIS LGLLAAIAII MVTPSLRAKL MPVVEQPRNI GALQISFNEA VRKAAPAVVN IYNRKYSEND RRKLSIQGLG SGVIVSEKGY IITNYHVVAQ ADQIVVALQD GRAAAAQLVG KDRRTDIAVL RVEGTGLPVI PLNPDYHPKV GDVVLAIGNP YNLGQTTTFG IISATGRSSI SADGRQAFIQ TDAAINDGNS GGALVNTQGE LVGINTASFQ QATDLETYGI SFAIPYSLAS KIMTKIIADG RVIRGYIGVD GQDINSMTSR LLGNEHVGGI IILGVDPNGP AARAGFLEQD ILLKIDGKKI NGRQNVTDTV TDLRPGTVVD FTLLRKGEEI VLPVTIGEDT RD
|
| |