Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A0089 |
Symbol | |
ID | 5135732 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 77899 |
End bp | 80757 |
Gene Length | 2859 bp |
Protein Length | 952 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640531549 |
Product | insulinase family protease/insulinase family protease |
Protein accession | YP_001216054 |
Protein GI | 147673676 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATGAGAA AAGTCGCGTT GTTTGGATTT TCTCTTTTGT TATTAGCGGG ATGCAGTAGT TCTGACTCAT CGTTGCCGCT GTTTTCTTCT TTGCCGAAAG GCGTCACCTT GGTCGAAGAG GTGAAAGCCG AACCGGGTAA AGTCATGATC CCGTACTCGA AGTATCGTTT GGATAATGGA CTGACCGTGA TCCTTTCACC GGATGACTCC GATCCCTTGG TGCATGTTGA TGTAACCTAT CACGTGGGTT CCGCTCGTGA GGAGATTGGT AAATCGGGCT TTGCTCATTT CTTTGAGCAC ATGATGTTCC AAGGCTCTAA ACATGTCGGC GATCAGCAGC ATTTTCGTTT GATCACTGAA GCGGGCGGCT CGCTCAATGG CACCACCAAC CGCGACCGTA CCAACTATTT TGAAACGGTT CCGGCCAATC AATTAGAAAA AATGTTGTGG CTTGAGGCGG ATCGAATGGG CTTTTTGCTC GATGCGGTTT CGCAGCGCAA GTTTGAGATC CAGCGTGATA CCGTGAAAAA TGAGCGCGCT CAGAATTATG ACAACCGCCC TTATGGCTTG ATGTGGGAAA AAATGGGTGA AGCGCTTTAT CCCGAAGGGC ACCCTTATTC TTGGCAAACG ATTGGTTATG TCAGTGATTT AGACCGAGTT GATGTCAATG ATTTAAAAGC CTTTTTCCTA CGTTGGTATG GCCCAAATAA TGCGGTGTTA ACCATAGGTG GCGATCTGGA TGTTAAACAA ACCTTAGATT GGGTTCAAAA ATACTTTGGC TCGATTCCGA AAGGCCCAGA TGTGGTAGAT GCACCGAAGC AGCCAGCACG GTTGAGTGAA GATCGTTTTA TTACGTTGGA AGATCGTGTG CAGCAGCCGA TGCTGCTCAT AGGGTGGCCA ACGCAATATT GGGGCGCAGA GGATCAAGTC GCACTGGATG CTCTAGCCAG TGCCCTAGGC AGTGGCAACA ACAGCCTGCT CTATCAAGAG CTGGTGAAAA CTCAAAAGGC GGTTGATGCG GGCGCATTCC AAGACTGTGC TGAACTCGCC TGTACCTTTT ATGTGTATGC CATGGCTCCC TCGGGAGCCA AAGGTAAGCT CGCCCCACTG TATCAAGAAA CCCTGCAAGT GCTGGAAAAG TTTAAGCAGC AGGGTGTCTC TGCTTCGCGT CTAGAACAGA TCATCGGCTC AGAAGAGGCC AGTGCTGTAT TTGCGCTAGA GAGTGTCAAA GGTAAAGTCA GCCAATTGGC GGCCAACCAA ACCTTTTTTG ATCAGCCAGA TCGCATTGAA AGTCAGCTAG AGAAAATCCG CGCAGTGACA CCAGAGTCGG TGAAGCAGGT GTTCACTCGC TATCTGGATG GCCAGCCGAA AGTGACACTT AGTGTAGTGG CAAAAGGCAA AACGGACTTT GCGGTACGTC CGGCAACCTT TATCACTCCG GAACGTCAAC TGCCGGAATA TCAAAAAATT GGTGATGAGC AGCTTGCCTA TCGCGAAGTC AAAGACAGCT TTGATCGCTC GCAGATGCCG CAAGTGGCAC AAGCGGTTCA GCCACGTTTA CCTAAACTGT ACGATGTCTA TTTTGACAAC GGCGTACAAC TGCTTGGCAC GCAAACTCGT GAAACTCCGA CAGTACTGAT TGAAATTCAA TTACCTGCGG GCGAGCGTCA AGTCGCGATG GGGAAAGAAG GCTTGGCTAA CTTAACCGCC AGCCTGCTGC AAGAAGGCAG CCAAAACCGT AGCGCCGAAG CCATTCAAGC GCAACTGGAT AAGCTCGGCT CAAGCATTCA AGTCGTGGCG GGAGCCTATT CAACCAGCAT TGTGGTATCG AGTTTGAAGA AAAACTTGCC GGAGACGTTG CAAATCAGCC AAGAGATGCT GCTCAAACCT GCTTTCAAAC AGAGTGATTT TGCACGTTTA CAGCAGCAAA TGTTGCAAGG CGTGGTTTAC CAACACCAGC AACCGAGCTG GTTAGCATCG CAAGCCACCC GCCAAGTATT ATGGGGGGAG AGTTTGTTTG CCCGCTCCGG TGATGGTACG CAAGCTTCTA TCTCTGCCTT GACCTTGAAG GACGTGAAGC AATTCTACCG TCAACATTAC ACCCCGCATG GTGCACAAAT TGCGGTAGTG GGGGATATCA GTGCGCGAGA AATTCGTCAG CAGTTACAGT TTATTGCCGA TTGGAAAGGC GAAGCGGCCC CGCTGATTAA CCCACAAGTA GTGCCAACGT TAACTAAGCA GAAAATCTAT TTAGTGGATA AGCCGGGAGC GCCGCAAAGT ATCATCCGTA TGGTGCGTAA AGGGCTTCCT TTTGATGCCA CAGGTGAGCT GTACTTAACT CAATTGGCGA ATTTTAACTT AGCAGGTAAC TTCAATAGCC GGATAAACCA AAATCTGCGT GAAGACAAAG GATATACCTA CGGCGCAGGA AGCTATTTTG CCAGTAACCG TGAGATTGGA GCCATTGTAT TTAACGCTCC AGTACGTGCG GATGTGACCG TTGAAGCGAT TCAAGAAATG ATCAAAGAGA TGCATCATTT CAGCCAAGCG GGGATGAGCG AGGAAGAGAT GAAATTTTTA CGTCTCGCTG TCGGTCAACA AGATGCGCTG ATGTATGAAA CACCCGCGCA GAAAGCTCAA CTTGTCTCCA GTATTCTGAC GTACAGTTTA GATCGTGATT ATCTGCAGCA ACGTAATGAG ATAGTGAAAA GCGTTGATCG CTCAACACTC AATGAGCTGG CGGCCAAATG GTTCAATCCT GAGGATTACC AAATTATTGT GGTGGGCGAT GCCAAGCGAC TCAAGCCGCA GTTGGAAAAG TTAGGCATTC CGCTAGAAGA GCTTGAAATC ATCCGTTAG
|
Protein sequence | MMRKVALFGF SLLLLAGCSS SDSSLPLFSS LPKGVTLVEE VKAEPGKVMI PYSKYRLDNG LTVILSPDDS DPLVHVDVTY HVGSAREEIG KSGFAHFFEH MMFQGSKHVG DQQHFRLITE AGGSLNGTTN RDRTNYFETV PANQLEKMLW LEADRMGFLL DAVSQRKFEI QRDTVKNERA QNYDNRPYGL MWEKMGEALY PEGHPYSWQT IGYVSDLDRV DVNDLKAFFL RWYGPNNAVL TIGGDLDVKQ TLDWVQKYFG SIPKGPDVVD APKQPARLSE DRFITLEDRV QQPMLLIGWP TQYWGAEDQV ALDALASALG SGNNSLLYQE LVKTQKAVDA GAFQDCAELA CTFYVYAMAP SGAKGKLAPL YQETLQVLEK FKQQGVSASR LEQIIGSEEA SAVFALESVK GKVSQLAANQ TFFDQPDRIE SQLEKIRAVT PESVKQVFTR YLDGQPKVTL SVVAKGKTDF AVRPATFITP ERQLPEYQKI GDEQLAYREV KDSFDRSQMP QVAQAVQPRL PKLYDVYFDN GVQLLGTQTR ETPTVLIEIQ LPAGERQVAM GKEGLANLTA SLLQEGSQNR SAEAIQAQLD KLGSSIQVVA GAYSTSIVVS SLKKNLPETL QISQEMLLKP AFKQSDFARL QQQMLQGVVY QHQQPSWLAS QATRQVLWGE SLFARSGDGT QASISALTLK DVKQFYRQHY TPHGAQIAVV GDISAREIRQ QLQFIADWKG EAAPLINPQV VPTLTKQKIY LVDKPGAPQS IIRMVRKGLP FDATGELYLT QLANFNLAGN FNSRINQNLR EDKGYTYGAG SYFASNREIG AIVFNAPVRA DVTVEAIQEM IKEMHHFSQA GMSEEEMKFL RLAVGQQDAL MYETPAQKAQ LVSSILTYSL DRDYLQQRNE IVKSVDRSTL NELAAKWFNP EDYQIIVVGD AKRLKPQLEK LGIPLEELEI IR
|
| |