Gene VIBHAR_03044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVIBHAR_03044 
Symbol 
ID5553971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio harveyi ATCC BAA-1116 
KingdomBacteria 
Replicon accessionNC_009783 
Strand
Start bp3067391 
End bp3069241 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content48% 
IMG OID640908526 
Productprotease 
Protein accessionYP_001446221 
Protein GI156975314 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00705] signal peptide peptidase SppA, 67K type
[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC TCTTCAAATT TATTGGTCTG ATATTTAAGG GGATATGGAA AGCCATCACC 
TTTGTCAGAC TTGCTCTGGC AAACTTGATC TTCCTGGTGA TGATTGCCGT CTTCTACTTT
GCCTTTACCT ACACTGGCGA AAGTCAGCCA ACCGTGGAAA AAGAATCCGC GCTGGTAATG
AACCTTTCTG GGCCTATCGT TGAGCAGCGC CGTTACGTAA ACCCGATGGA TTCGATAGCG
GGCTCTGTCC TCGGTAACGA GATGCCGAAA GAGAACGTAT TGTTCGACAT CGTCGATACC
ATTCGCTACG CGAAAGACGA TCCTAAGGTA TCTGGCCTAG TTCTGGCTCT GCGCGATATG
CCTGAGACTA ACCTGACCAA ACTTCGTTAT ATCGCAAAAG CGCTGAATGA ATTTAAAACG
TCTGGCAAAC CTGTTTATGC AGTCGGCGAT TTCTACAATC AAAGCCAATA TTACCTAGCG
AGCTATGCAG ATAAAGTCTT CCTCGCTCCA GATGGCGGTG TGTTGATTAA GGGCTACAGC
TCTTACTCGA TGTACTACAA AACCTTGTTA GAGAAGTTAG ACGTATCGAC GCACGTGTTC
CGCGTGGGCA CTTACAAGTC TGCGATTGAG CCATTTATTC GTGATGACAT GTCTGATGCG
GCAAAAGAGT CGGCGACGCG CTGGATCACT CAGCTTTGGA GTGCCTTTGT TGATGACGTA
GCAACCAACC GTAATATCGA TGCCAAAGCA CTGAATCCAA CCATGGATGA ATTGCTTGCT
GAGATGAAAT CAGTCGATGG TGACCTTGCG CAACTGGCCG TTAAAATGGG CCTAGTCGAT
GAGCTAGCGA CACGTCAAGA CGTACGCAAA CTGTTCGCAA AAGAGTTCGG CAGTGACGGT
AAAGACAGCT ACAACGCAAT TGGCTACTAC GATTACCTAG CGACCATGCG CCCTAACCTA
GCGCCGTCTG AAAATGACAT TGCTGTTGTC GTAGCAAGCG GCGCCATTAT GGATGGTCAA
CAACCGCGCG GTACCGTTGG CGGTGATACG GTGGCAAGCC TACTTCGTCA AGCTCGCAAC
GACGACAAAG TTAAAGCGGT TGTCCTGCGT GTCGACAGCC CAGGAGGCAG CGCTTTTGCC
TCTGAAGTGA TTCGTAACGA AGTAGAAGCG TTAAAAGAAG CAGGTAAACC AGTTGTGGTT
TCGATGTCTA GCTTAGCGGC TTCCGGCGGT TACTGGATCT CAATGAGTGC AGATAAGATT
GTTGCTCAGC CAACCACGCT AACTGGCTCG ATTGGCATCT TCAGTGTTAT CACGACATTT
GAAAAAGGCT TTACCAAACT TGGTATCAGT ACCGATGGTG TTGGCACATC ACCATTCTCT
GGTGATGGCA TCACAACGGG TCTTTCAGAT GGTGCATCTC AAGCATTCCA GCTCGGCATC
GAACATGGCT ACAAGCGCTT TATCTCTCTG GTTGGCTCAA ACCGAGATAT GTCACTAGAC
GAAGTAGATA AAGTCGCTCA AGGTCGAGTT TGGACAGGTC AAGATGCAAT GTCATTCGGC
CTTGTAGACC AAATGGGTGA CTTTGATGAC GCAGTGAAAC TGGCAGCTAA ACTGGCTGAT
GTAGAAAACT ATGAGCTCTA CTGGGTTGAA GAACCATTAT CACCGACAGA ACAGTTCGTC
CAAGAGTTCA TGAACCAAGT GAAAGTCTCT TTGGGCATCG ATGCGACAAG CTTCTTGCCA
AAGAGCTTAC AGCCTGTAAC TCAACAGCTT GAGCAAGATG CAAGCATGAT GCAGAGCTTC
AACGATCCAA AAGGCCAATA CGCCTTCTGT TTGAACTGCC AAGTGCAATA G
 
Protein sequence
MKKLFKFIGL IFKGIWKAIT FVRLALANLI FLVMIAVFYF AFTYTGESQP TVEKESALVM 
NLSGPIVEQR RYVNPMDSIA GSVLGNEMPK ENVLFDIVDT IRYAKDDPKV SGLVLALRDM
PETNLTKLRY IAKALNEFKT SGKPVYAVGD FYNQSQYYLA SYADKVFLAP DGGVLIKGYS
SYSMYYKTLL EKLDVSTHVF RVGTYKSAIE PFIRDDMSDA AKESATRWIT QLWSAFVDDV
ATNRNIDAKA LNPTMDELLA EMKSVDGDLA QLAVKMGLVD ELATRQDVRK LFAKEFGSDG
KDSYNAIGYY DYLATMRPNL APSENDIAVV VASGAIMDGQ QPRGTVGGDT VASLLRQARN
DDKVKAVVLR VDSPGGSAFA SEVIRNEVEA LKEAGKPVVV SMSSLAASGG YWISMSADKI
VAQPTTLTGS IGIFSVITTF EKGFTKLGIS TDGVGTSPFS GDGITTGLSD GASQAFQLGI
EHGYKRFISL VGSNRDMSLD EVDKVAQGRV WTGQDAMSFG LVDQMGDFDD AVKLAAKLAD
VENYELYWVE EPLSPTEQFV QEFMNQVKVS LGIDATSFLP KSLQPVTQQL EQDASMMQSF
NDPKGQYAFC LNCQVQ