Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A1579 |
Symbol | sppA |
ID | 5137261 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 1700161 |
End bp | 1702011 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640533036 |
Product | protease IV |
Protein accession | YP_001217520 |
Protein GI | 147673256 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00705] signal peptide peptidase SppA, 67K type [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000000832547 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATCAC TATTTCGTTT TGTTGGGCTG ATTTTGAAAG GGATTTGGAA AGCGATCACC TTCATCAGGC TGGCACTGAC TAACCTTATT TTCTTACTCA GTATTGGCAT TATTTACTTT ATTTATGTCC ACGCTGACGC CCCGCTACCC ACCATGGATA AATCCTCGGC ACTGGTGCTC AATCTGTCTG GCCCGATTGT TGAGCAGAGT ACGCACATCA ACCCGATGGA CTCCTTTACG GGCTCCGTGT TTGGTGAAGA GCTCCCCCGC GAAAACGTGC TGTTTGATAT TGTTGAAACC TTACGCCATG CGAAAAATGA CAACAATGTC ACCGGGCTTG TGTTGGCTTT AGGAGACATG CCAGAAACCA ACTTGACTAA GCTGCGTTAT ATCGCCAAAG CGATCAACGA ATTCAAAGCC TCTGGTAAGC CTGTGTTTGC GGTTGGGGAT TTTTATAATC AGAGCCAATA TTACTTGGCG AGTTATGCGG ACAAAATCTA CCTCGCTCCA GATGGTGCCG TACTGCTAAA AGGTTACAGC GCCTACTCCA TGTACTACAA AACTCTGCTT GAGAAATTGG ATGTCACCAC TCACGTCTTT CGTGTCGGCA CCTACAAATC GGCAATTGAG CCTTTTGTGC GTGATGATAT GTCTGATGCG GCTCGTGAAT CCGCTTCTCG CTGGCTCACT CAGTTGTGGA GCGCGTACGT CGATGATGTC GCGGCCAATC GCCAAATTGA GATCAAAACC CTTACTCCAA GCATGGAGCA GTTTGTCGCT CAGTTGAAAG AAGTGAATGG TGACTTAGCT GCACTGTCCA AAAAAGTCGG TCTCGTCGAT GAACTGGCGA CTCGTCAGCA AGTTCGCCAA ACGTTAGCGG AAACTTTTGG TAGCGATGGA AAAGACAGTT ATAACGCCAT CGGTTACTAC GAATACAAAA CCACCATTAA GCCAACGACA CTGACCGATG CCAACGATAT TGCCGTTGTC GTCGCGAGCG GTGCGATTAT GGATGGCTCA CAGCCACGTG GTACCGTGGG TGGCGATACC GTAGCTGGAT TACTGCGCGA AGCTCGTAAC GACAGCAATG TAAAAGCGGT CGTACTGCGT GTCGATAGCC CAGGGGGTAG CGCGTTTGCT TCCGAAGTGA TCCGCAATGA AATTGAAGCT CTGAAAGCGG CGGGGAAACC TGTGGTGGTG TCGATGTCAA GCCTTGCCGC TTCCGGTGGT TACTGGATTT CGATGAGCGC AGATAAGATT GTCGCCCAAC CGACCACACT GACAGGTTCA ATCGGTATTT TCAGCGTGAT CACTACCTTC GAGAAAGGAC TGAACAACCT TGGTATTTAT ACTGATGGTG TTGGTACAAC GCCTTTTTCA GGACAAGGCC TGACCACGGG CCTCACCCAA GGTGCAAAAG ATGCGATTCA ACTGGGTATT GAACACGGTT ATCAGCGCTT TATTTCTTTG GTTGCAGAGA AACGTGGCCT GACCTTAAAA GCAGTGGATG AACTGGCTCA GGGCCGAGTC TGGACTGCAC AAGATGCACA AACCCTCGGT TTAGTCGATC AGTTGGGTGA TTTTGATGAT GCCGTACATT TGGCAGCGGA TCTGGCACAG TTGGATCAAT ATAACCTGTA CTGGGTTGAA GAGCCACTCA CTCCAGCCCA GCAATTTTTA CAAGATCTGC TCGGACAAGT ACGTGTCAGC TTAGGTTTGG ATGTCTCCAC TCTCTTGCCA AAATCACTGC AACCCTTGGC AGTAGAGTGG CAACAACAAA CGTCGCTGCT CAATCAATTG AATGATCCTA AAGGGCAATA TGCGTTCTGT CTGCCCTGCC AAGTAGAATA G
|
Protein sequence | MKSLFRFVGL ILKGIWKAIT FIRLALTNLI FLLSIGIIYF IYVHADAPLP TMDKSSALVL NLSGPIVEQS THINPMDSFT GSVFGEELPR ENVLFDIVET LRHAKNDNNV TGLVLALGDM PETNLTKLRY IAKAINEFKA SGKPVFAVGD FYNQSQYYLA SYADKIYLAP DGAVLLKGYS AYSMYYKTLL EKLDVTTHVF RVGTYKSAIE PFVRDDMSDA ARESASRWLT QLWSAYVDDV AANRQIEIKT LTPSMEQFVA QLKEVNGDLA ALSKKVGLVD ELATRQQVRQ TLAETFGSDG KDSYNAIGYY EYKTTIKPTT LTDANDIAVV VASGAIMDGS QPRGTVGGDT VAGLLREARN DSNVKAVVLR VDSPGGSAFA SEVIRNEIEA LKAAGKPVVV SMSSLAASGG YWISMSADKI VAQPTTLTGS IGIFSVITTF EKGLNNLGIY TDGVGTTPFS GQGLTTGLTQ GAKDAIQLGI EHGYQRFISL VAEKRGLTLK AVDELAQGRV WTAQDAQTLG LVDQLGDFDD AVHLAADLAQ LDQYNLYWVE EPLTPAQQFL QDLLGQVRVS LGLDVSTLLP KSLQPLAVEW QQQTSLLNQL NDPKGQYAFC LPCQVE
|
| |