Gene VC0395_A0099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0099 
SymboldegS 
ID5137288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp89188 
End bp90246 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content49% 
IMG OID640531559 
Productprotease DegS 
Protein accessionYP_001216064 
Protein GI147675045 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02038] periplasmic serine pepetdase DegS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000230294 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAAAT TTTGGGTTCG CTCAATCAGC CTTGGGTTGT TGGCTGCGAT TGCCATTATT 
ATGGTGACAC CCTCACTACG CGCCAAATTA ATGCCCGTTG TCGAACAACC ACGCAACATC
GGCGCTCTAC AAATCTCATT TAATGAAGCG GTACGCAAAG CCGCCCCTGC CGTCGTCAAT
ATTTATAACC GTAAATACAG CGAAAATGAT CGCCGTAAAC TCTCGATTCA AGGTTTAGGA
TCCGGTGTCA TTGTCAGCGA AAAAGGCTAC ATCATCACCA ACTACCACGT CGTCGCGCAG
GCCGATCAAA TTGTCGTTGC TCTACAAGAT GGGCGAGCCG CAGCAGCACA ATTGGTGGGA
AAAGATCGCC GTACCGATAT TGCCGTATTA CGCGTAGAAG GCACGGGTTT ACCAGTGATT
CCACTCAATC CAGATTACCA TCCTAAAGTG GGGGACGTGG TGTTGGCGAT TGGTAACCCT
TACAACTTAG GGCAAACCAC GACTTTCGGA ATTATCTCGG CTACCGGACG TTCATCCATC
AGCGCTGATG GTCGCCAAGC CTTTATTCAA ACTGATGCCG CAATCAATGA CGGCAACTCA
GGTGGTGCAT TGGTCAATAC CCAAGGTGAA CTGGTCGGCA TCAATACCGC CTCTTTTCAA
CAAGCCACCG ATCTCGAAAC TTACGGGATT TCGTTTGCGA TTCCCTACTC TTTGGCCAGT
AAAATTATGA CCAAAATCAT TGCTGATGGC CGCGTGATCC GCGGTTATAT TGGCGTCGAC
GGTCAAGATA TTAACTCGAT GACATCACGT TTGCTGGGGA ATGAGCATGT CGGTGGGATC
ATTATTTTAG GGGTTGACCC GAATGGACCC GCAGCCCGAG CAGGCTTTCT GGAGCAAGAT
ATTTTGCTGA AAATCGACGG TAAAAAAATT AATGGCCGCC AGAATGTCAC AGATACCGTC
ACCGATCTTC GCCCCGGCAC TGTGGTGGAT TTCACCCTAC TGCGTAAGGG TGAAGAGATT
GTACTCCCAG TTACGATTGG TGAAGACACT CGTGATTAG
 
Protein sequence
MLKFWVRSIS LGLLAAIAII MVTPSLRAKL MPVVEQPRNI GALQISFNEA VRKAAPAVVN 
IYNRKYSEND RRKLSIQGLG SGVIVSEKGY IITNYHVVAQ ADQIVVALQD GRAAAAQLVG
KDRRTDIAVL RVEGTGLPVI PLNPDYHPKV GDVVLAIGNP YNLGQTTTFG IISATGRSSI
SADGRQAFIQ TDAAINDGNS GGALVNTQGE LVGINTASFQ QATDLETYGI SFAIPYSLAS
KIMTKIIADG RVIRGYIGVD GQDINSMTSR LLGNEHVGGI IILGVDPNGP AARAGFLEQD
ILLKIDGKKI NGRQNVTDTV TDLRPGTVVD FTLLRKGEEI VLPVTIGEDT RD