Gene VC0395_A0089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0089 
Symbol 
ID5135732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp77899 
End bp80757 
Gene Length2859 bp 
Protein Length952 aa 
Translation table11 
GC content49% 
IMG OID640531549 
Productinsulinase family protease/insulinase family protease 
Protein accessionYP_001216054 
Protein GI147673676 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATGAGAA AAGTCGCGTT GTTTGGATTT TCTCTTTTGT TATTAGCGGG ATGCAGTAGT 
TCTGACTCAT CGTTGCCGCT GTTTTCTTCT TTGCCGAAAG GCGTCACCTT GGTCGAAGAG
GTGAAAGCCG AACCGGGTAA AGTCATGATC CCGTACTCGA AGTATCGTTT GGATAATGGA
CTGACCGTGA TCCTTTCACC GGATGACTCC GATCCCTTGG TGCATGTTGA TGTAACCTAT
CACGTGGGTT CCGCTCGTGA GGAGATTGGT AAATCGGGCT TTGCTCATTT CTTTGAGCAC
ATGATGTTCC AAGGCTCTAA ACATGTCGGC GATCAGCAGC ATTTTCGTTT GATCACTGAA
GCGGGCGGCT CGCTCAATGG CACCACCAAC CGCGACCGTA CCAACTATTT TGAAACGGTT
CCGGCCAATC AATTAGAAAA AATGTTGTGG CTTGAGGCGG ATCGAATGGG CTTTTTGCTC
GATGCGGTTT CGCAGCGCAA GTTTGAGATC CAGCGTGATA CCGTGAAAAA TGAGCGCGCT
CAGAATTATG ACAACCGCCC TTATGGCTTG ATGTGGGAAA AAATGGGTGA AGCGCTTTAT
CCCGAAGGGC ACCCTTATTC TTGGCAAACG ATTGGTTATG TCAGTGATTT AGACCGAGTT
GATGTCAATG ATTTAAAAGC CTTTTTCCTA CGTTGGTATG GCCCAAATAA TGCGGTGTTA
ACCATAGGTG GCGATCTGGA TGTTAAACAA ACCTTAGATT GGGTTCAAAA ATACTTTGGC
TCGATTCCGA AAGGCCCAGA TGTGGTAGAT GCACCGAAGC AGCCAGCACG GTTGAGTGAA
GATCGTTTTA TTACGTTGGA AGATCGTGTG CAGCAGCCGA TGCTGCTCAT AGGGTGGCCA
ACGCAATATT GGGGCGCAGA GGATCAAGTC GCACTGGATG CTCTAGCCAG TGCCCTAGGC
AGTGGCAACA ACAGCCTGCT CTATCAAGAG CTGGTGAAAA CTCAAAAGGC GGTTGATGCG
GGCGCATTCC AAGACTGTGC TGAACTCGCC TGTACCTTTT ATGTGTATGC CATGGCTCCC
TCGGGAGCCA AAGGTAAGCT CGCCCCACTG TATCAAGAAA CCCTGCAAGT GCTGGAAAAG
TTTAAGCAGC AGGGTGTCTC TGCTTCGCGT CTAGAACAGA TCATCGGCTC AGAAGAGGCC
AGTGCTGTAT TTGCGCTAGA GAGTGTCAAA GGTAAAGTCA GCCAATTGGC GGCCAACCAA
ACCTTTTTTG ATCAGCCAGA TCGCATTGAA AGTCAGCTAG AGAAAATCCG CGCAGTGACA
CCAGAGTCGG TGAAGCAGGT GTTCACTCGC TATCTGGATG GCCAGCCGAA AGTGACACTT
AGTGTAGTGG CAAAAGGCAA AACGGACTTT GCGGTACGTC CGGCAACCTT TATCACTCCG
GAACGTCAAC TGCCGGAATA TCAAAAAATT GGTGATGAGC AGCTTGCCTA TCGCGAAGTC
AAAGACAGCT TTGATCGCTC GCAGATGCCG CAAGTGGCAC AAGCGGTTCA GCCACGTTTA
CCTAAACTGT ACGATGTCTA TTTTGACAAC GGCGTACAAC TGCTTGGCAC GCAAACTCGT
GAAACTCCGA CAGTACTGAT TGAAATTCAA TTACCTGCGG GCGAGCGTCA AGTCGCGATG
GGGAAAGAAG GCTTGGCTAA CTTAACCGCC AGCCTGCTGC AAGAAGGCAG CCAAAACCGT
AGCGCCGAAG CCATTCAAGC GCAACTGGAT AAGCTCGGCT CAAGCATTCA AGTCGTGGCG
GGAGCCTATT CAACCAGCAT TGTGGTATCG AGTTTGAAGA AAAACTTGCC GGAGACGTTG
CAAATCAGCC AAGAGATGCT GCTCAAACCT GCTTTCAAAC AGAGTGATTT TGCACGTTTA
CAGCAGCAAA TGTTGCAAGG CGTGGTTTAC CAACACCAGC AACCGAGCTG GTTAGCATCG
CAAGCCACCC GCCAAGTATT ATGGGGGGAG AGTTTGTTTG CCCGCTCCGG TGATGGTACG
CAAGCTTCTA TCTCTGCCTT GACCTTGAAG GACGTGAAGC AATTCTACCG TCAACATTAC
ACCCCGCATG GTGCACAAAT TGCGGTAGTG GGGGATATCA GTGCGCGAGA AATTCGTCAG
CAGTTACAGT TTATTGCCGA TTGGAAAGGC GAAGCGGCCC CGCTGATTAA CCCACAAGTA
GTGCCAACGT TAACTAAGCA GAAAATCTAT TTAGTGGATA AGCCGGGAGC GCCGCAAAGT
ATCATCCGTA TGGTGCGTAA AGGGCTTCCT TTTGATGCCA CAGGTGAGCT GTACTTAACT
CAATTGGCGA ATTTTAACTT AGCAGGTAAC TTCAATAGCC GGATAAACCA AAATCTGCGT
GAAGACAAAG GATATACCTA CGGCGCAGGA AGCTATTTTG CCAGTAACCG TGAGATTGGA
GCCATTGTAT TTAACGCTCC AGTACGTGCG GATGTGACCG TTGAAGCGAT TCAAGAAATG
ATCAAAGAGA TGCATCATTT CAGCCAAGCG GGGATGAGCG AGGAAGAGAT GAAATTTTTA
CGTCTCGCTG TCGGTCAACA AGATGCGCTG ATGTATGAAA CACCCGCGCA GAAAGCTCAA
CTTGTCTCCA GTATTCTGAC GTACAGTTTA GATCGTGATT ATCTGCAGCA ACGTAATGAG
ATAGTGAAAA GCGTTGATCG CTCAACACTC AATGAGCTGG CGGCCAAATG GTTCAATCCT
GAGGATTACC AAATTATTGT GGTGGGCGAT GCCAAGCGAC TCAAGCCGCA GTTGGAAAAG
TTAGGCATTC CGCTAGAAGA GCTTGAAATC ATCCGTTAG
 
Protein sequence
MMRKVALFGF SLLLLAGCSS SDSSLPLFSS LPKGVTLVEE VKAEPGKVMI PYSKYRLDNG 
LTVILSPDDS DPLVHVDVTY HVGSAREEIG KSGFAHFFEH MMFQGSKHVG DQQHFRLITE
AGGSLNGTTN RDRTNYFETV PANQLEKMLW LEADRMGFLL DAVSQRKFEI QRDTVKNERA
QNYDNRPYGL MWEKMGEALY PEGHPYSWQT IGYVSDLDRV DVNDLKAFFL RWYGPNNAVL
TIGGDLDVKQ TLDWVQKYFG SIPKGPDVVD APKQPARLSE DRFITLEDRV QQPMLLIGWP
TQYWGAEDQV ALDALASALG SGNNSLLYQE LVKTQKAVDA GAFQDCAELA CTFYVYAMAP
SGAKGKLAPL YQETLQVLEK FKQQGVSASR LEQIIGSEEA SAVFALESVK GKVSQLAANQ
TFFDQPDRIE SQLEKIRAVT PESVKQVFTR YLDGQPKVTL SVVAKGKTDF AVRPATFITP
ERQLPEYQKI GDEQLAYREV KDSFDRSQMP QVAQAVQPRL PKLYDVYFDN GVQLLGTQTR
ETPTVLIEIQ LPAGERQVAM GKEGLANLTA SLLQEGSQNR SAEAIQAQLD KLGSSIQVVA
GAYSTSIVVS SLKKNLPETL QISQEMLLKP AFKQSDFARL QQQMLQGVVY QHQQPSWLAS
QATRQVLWGE SLFARSGDGT QASISALTLK DVKQFYRQHY TPHGAQIAVV GDISAREIRQ
QLQFIADWKG EAAPLINPQV VPTLTKQKIY LVDKPGAPQS IIRMVRKGLP FDATGELYLT
QLANFNLAGN FNSRINQNLR EDKGYTYGAG SYFASNREIG AIVFNAPVRA DVTVEAIQEM
IKEMHHFSQA GMSEEEMKFL RLAVGQQDAL MYETPAQKAQ LVSSILTYSL DRDYLQQRNE
IVKSVDRSTL NELAAKWFNP EDYQIIVVGD AKRLKPQLEK LGIPLEELEI IR