Gene VC0395_A1579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1579 
SymbolsppA 
ID5137261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1700161 
End bp1702011 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content49% 
IMG OID640533036 
Productprotease IV 
Protein accessionYP_001217520 
Protein GI147673256 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00705] signal peptide peptidase SppA, 67K type
[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000832547 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCAC TATTTCGTTT TGTTGGGCTG ATTTTGAAAG GGATTTGGAA AGCGATCACC 
TTCATCAGGC TGGCACTGAC TAACCTTATT TTCTTACTCA GTATTGGCAT TATTTACTTT
ATTTATGTCC ACGCTGACGC CCCGCTACCC ACCATGGATA AATCCTCGGC ACTGGTGCTC
AATCTGTCTG GCCCGATTGT TGAGCAGAGT ACGCACATCA ACCCGATGGA CTCCTTTACG
GGCTCCGTGT TTGGTGAAGA GCTCCCCCGC GAAAACGTGC TGTTTGATAT TGTTGAAACC
TTACGCCATG CGAAAAATGA CAACAATGTC ACCGGGCTTG TGTTGGCTTT AGGAGACATG
CCAGAAACCA ACTTGACTAA GCTGCGTTAT ATCGCCAAAG CGATCAACGA ATTCAAAGCC
TCTGGTAAGC CTGTGTTTGC GGTTGGGGAT TTTTATAATC AGAGCCAATA TTACTTGGCG
AGTTATGCGG ACAAAATCTA CCTCGCTCCA GATGGTGCCG TACTGCTAAA AGGTTACAGC
GCCTACTCCA TGTACTACAA AACTCTGCTT GAGAAATTGG ATGTCACCAC TCACGTCTTT
CGTGTCGGCA CCTACAAATC GGCAATTGAG CCTTTTGTGC GTGATGATAT GTCTGATGCG
GCTCGTGAAT CCGCTTCTCG CTGGCTCACT CAGTTGTGGA GCGCGTACGT CGATGATGTC
GCGGCCAATC GCCAAATTGA GATCAAAACC CTTACTCCAA GCATGGAGCA GTTTGTCGCT
CAGTTGAAAG AAGTGAATGG TGACTTAGCT GCACTGTCCA AAAAAGTCGG TCTCGTCGAT
GAACTGGCGA CTCGTCAGCA AGTTCGCCAA ACGTTAGCGG AAACTTTTGG TAGCGATGGA
AAAGACAGTT ATAACGCCAT CGGTTACTAC GAATACAAAA CCACCATTAA GCCAACGACA
CTGACCGATG CCAACGATAT TGCCGTTGTC GTCGCGAGCG GTGCGATTAT GGATGGCTCA
CAGCCACGTG GTACCGTGGG TGGCGATACC GTAGCTGGAT TACTGCGCGA AGCTCGTAAC
GACAGCAATG TAAAAGCGGT CGTACTGCGT GTCGATAGCC CAGGGGGTAG CGCGTTTGCT
TCCGAAGTGA TCCGCAATGA AATTGAAGCT CTGAAAGCGG CGGGGAAACC TGTGGTGGTG
TCGATGTCAA GCCTTGCCGC TTCCGGTGGT TACTGGATTT CGATGAGCGC AGATAAGATT
GTCGCCCAAC CGACCACACT GACAGGTTCA ATCGGTATTT TCAGCGTGAT CACTACCTTC
GAGAAAGGAC TGAACAACCT TGGTATTTAT ACTGATGGTG TTGGTACAAC GCCTTTTTCA
GGACAAGGCC TGACCACGGG CCTCACCCAA GGTGCAAAAG ATGCGATTCA ACTGGGTATT
GAACACGGTT ATCAGCGCTT TATTTCTTTG GTTGCAGAGA AACGTGGCCT GACCTTAAAA
GCAGTGGATG AACTGGCTCA GGGCCGAGTC TGGACTGCAC AAGATGCACA AACCCTCGGT
TTAGTCGATC AGTTGGGTGA TTTTGATGAT GCCGTACATT TGGCAGCGGA TCTGGCACAG
TTGGATCAAT ATAACCTGTA CTGGGTTGAA GAGCCACTCA CTCCAGCCCA GCAATTTTTA
CAAGATCTGC TCGGACAAGT ACGTGTCAGC TTAGGTTTGG ATGTCTCCAC TCTCTTGCCA
AAATCACTGC AACCCTTGGC AGTAGAGTGG CAACAACAAA CGTCGCTGCT CAATCAATTG
AATGATCCTA AAGGGCAATA TGCGTTCTGT CTGCCCTGCC AAGTAGAATA G
 
Protein sequence
MKSLFRFVGL ILKGIWKAIT FIRLALTNLI FLLSIGIIYF IYVHADAPLP TMDKSSALVL 
NLSGPIVEQS THINPMDSFT GSVFGEELPR ENVLFDIVET LRHAKNDNNV TGLVLALGDM
PETNLTKLRY IAKAINEFKA SGKPVFAVGD FYNQSQYYLA SYADKIYLAP DGAVLLKGYS
AYSMYYKTLL EKLDVTTHVF RVGTYKSAIE PFVRDDMSDA ARESASRWLT QLWSAYVDDV
AANRQIEIKT LTPSMEQFVA QLKEVNGDLA ALSKKVGLVD ELATRQQVRQ TLAETFGSDG
KDSYNAIGYY EYKTTIKPTT LTDANDIAVV VASGAIMDGS QPRGTVGGDT VAGLLREARN
DSNVKAVVLR VDSPGGSAFA SEVIRNEIEA LKAAGKPVVV SMSSLAASGG YWISMSADKI
VAQPTTLTGS IGIFSVITTF EKGLNNLGIY TDGVGTTPFS GQGLTTGLTQ GAKDAIQLGI
EHGYQRFISL VAEKRGLTLK AVDELAQGRV WTAQDAQTLG LVDQLGDFDD AVHLAADLAQ
LDQYNLYWVE EPLTPAQQFL QDLLGQVRVS LGLDVSTLLP KSLQPLAVEW QQQTSLLNQL
NDPKGQYAFC LPCQVE