Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A1660 |
Symbol | |
ID | 5136473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 1780465 |
End bp | 1783284 |
Gene Length | 2820 bp |
Protein Length | 939 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640533116 |
Product | peptidase insulinase family protein |
Protein accession | YP_001217598 |
Protein GI | 147675351 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1025] Secreted/periplasmic Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGTGTTCT GTGACTCGTC TTTGACCCTA CCAATCCGGA GAAATGCTGT GCATATCAGC CCCAATGATA CTCATCAGTA CCGATATATT ACGCTGAGCA ATGGCTTACG GACGTTACTC ATCCAAAGCC CTGATGTTCA AAAGTGCGCC GCGGCGTTAG CGGTCAACGT CGGGCACTTT GACGATCCTA TCGAACGCCA AGGTTTGGCT CACTATCTCG AACACATGCT GTTTTTGGGC ACAGAAAAAT ACCCGAAAGT TGGCGATTTT CAAACTTTTA TCAGTCAGCA TGGCGGTTCT AATAATGCAT GGACTGGCAC AGAACATACC TGTTTCTTCT TTGATGTTTT GCCGAACGCT TTTGCTAAAG CGCTTGACCG TTTCAGTCAG TTTTTTATCG CACCGCTGTT TAATGCCGAG GCTTTGGATA AAGAGCGTCA AGCCGTCGAC TCAGAATACA AACTGAAAAT TAAAGATGAG TCACGCCGTC TCTACCAAGT ACAAAAAGAA ACCATTAATC CGCAGCACCC GTTTAGCAAG TTCTCAGTCG GCAATCAGCA CACCTTAGGC GATCGCGAAA ACAGCTCTAT TCGAGATGAA ATTATTGAGT TTTATCAGTC CCATTATTCA GCCAAGTTGA TGACGTTGTC CCTGATTGGC TCGCAAAGTT TTGACGAACT CGAAGCATGG GCTGAGCGTT ATTTTGCCGC GATCCCAAAC CCGCAACGCG ATATCAAACC GCTCCCTCCT TTTGTTGATC GCGAACATAC GGGAATTTTG ATTCAGATTG AGCCTCTCAA AGAGATTCGT AAGCTGATCC TCGCTTTTCC TATGCCCAGC ACTGAAAGCT ACTATCAGAA AAAACCGCTC TCTTATTTTG CGCATTTAAT TGGCTATGAG GGTGAAGGCA GCTTGCTGGA AGCGTTGAAA GAGAAAGGTT GGATCACTAC TCTTTCTGCT GGCGGAGGGG TAAGTGGCAG TAACTACCGC GAGTTTGCCG TCAGTTGTGT GCTGACTCAG GAAGGATTGG ATCATGTTGA TGAGATTATT CAAAGTCTAT TCCAAACGCT GAATTTGATT GCGACTCAAG GCTTACAGGC GTGGCGTTAT CAAGAAAAAC GGGCTGTACT TGAGTCGGCT TTCCGCTTTC AAGAAACCCA GCGCCCACTG GATATGGTCA GCCATTTGGT GGTCAACATG CAGCATTATG CCCCCGAAGA TACGGCTTAC GGGGACTATA TGATGTCAGG CTACGATGAA GCCTTGTTGC TGCATATTTT AAGTTATCTC ACCCCAGAGA ACCTGCGTGC CACCTTGATT GCAAAAGGGG GTGAGTACGA TAAAAAAGCG CAATGGTACT TTACCCCTTA TTCAGTTCGC CCTTTCACTA CTGAACAACT GCACAGATTC CGCCAGCCGT TGGATTTACC CATCTCCCTG CCTGAACCCA ACCCATTTAT CTGTTATGAC CTCGACCCGT CAGAGGTCAA GGAGAGCCAC ACTCTGCCGC AAGTGTTGCA GGACTTACCC GGATTCAAAC TTTGGCATCA GCAAGACACC GAATTCAGAG TACCTAAAGG CGTGATTTAT GTTGCGATCG ACAGCCCGCA TGCCGTGGCT AACTGTCGTA ATATCGTGAT GACCCGTTTG TGTGTGGAGA TGTTTTTGGA TGCGTTAGCC AAAGAAACAT ACCAAGCCGA AATAGCGGGG ATGGGCTACA ACCTCTATGC CCACCAAGGT GGCGTGACGC TCACGTTGTC AGGTTTTAGC CAAAAATTAC CGCAATTGAT GGAAGTGATT TTACGTAAAT TTGCGCAGCG CGATTTCCAG CCGAAGCGCT TTGCCACCAT CAAGCAGCAA ATGACTCGTA ATTGGCGCAA TGCCGCCCAC GATAAACCCA TTTCTCAACT GTTTAATGCG ATGACTGGGT TATTGCAACC CAATAACCCA CCTTATGCCG AGTTACTAGC CGCGATTGAT GATGTACAAG TGGAAGAGTT AGCCCATTTT GTCGACACAA TTCTGTCGCA GTTACACGTC GAAATGTTTG TCTATGGTGA CTGGCCTGCC GCCGAAGCCC ACAAGATGGC GGAAGTGCTC AAAGACGCGC TACGCGTTCA AGGACAAACT TATGAAGAGT CGCTTCGCCC ATTGGTTATG CTCGGAAAGA GTGGAACGTT TCAACGTGAA GTACAGTGCC AACAAGATGA TTCCGCGATT GTGGTGTATT ACCAATCTCA TGAAGTCAGC CCACGCAGTA TTGCGCTTTA CTCGCTGGCC AATCATCTGA TGTCGGCCAC CTTCTTCCAC GAAATTCGCA CCAAACAGCA ACTCGGTTAT ATGGTTGGTA CGGGCAATAT GCCATTAAAC CGCCATCCGG GTTTAATCTT ATACGTTCAA TCACCATCGG CTCCGCCAAG TGAATTGATC CGCTCTATTG ATGAGTTTTT AAATGCCTTG TATATGGTTC TGCTTGAACT CAATGAATAT CAATGGCATA GCAGTAAACG GGGATTGTGG AACCAAATTT CCGCGCCCGA CCCGACACTC CGGATTCGAG CCCAGCGTTT ATGGGTTGCG ATTGGCAATA AAGATCTCAG TTTCGATCAG CGAGAAAAAG TGCTGGAAGA GCTTAAAAAT CTCAGCCGTG CTGACATGAT ACGCTTCGTT GTCAATGAAC TGAAACCACG TACGGCACAT CGCTTAATTA TGCATACTCA AGGCCGAGCC CATCACGAGG CTCCTGCCTT ACAACTCGGC CAAGAAATTG GCTCAGTGGA AGAGTTCCAA CTGCGCCCTA AAGCTTATGA TGTGGGTTAG
|
Protein sequence | MVFCDSSLTL PIRRNAVHIS PNDTHQYRYI TLSNGLRTLL IQSPDVQKCA AALAVNVGHF DDPIERQGLA HYLEHMLFLG TEKYPKVGDF QTFISQHGGS NNAWTGTEHT CFFFDVLPNA FAKALDRFSQ FFIAPLFNAE ALDKERQAVD SEYKLKIKDE SRRLYQVQKE TINPQHPFSK FSVGNQHTLG DRENSSIRDE IIEFYQSHYS AKLMTLSLIG SQSFDELEAW AERYFAAIPN PQRDIKPLPP FVDREHTGIL IQIEPLKEIR KLILAFPMPS TESYYQKKPL SYFAHLIGYE GEGSLLEALK EKGWITTLSA GGGVSGSNYR EFAVSCVLTQ EGLDHVDEII QSLFQTLNLI ATQGLQAWRY QEKRAVLESA FRFQETQRPL DMVSHLVVNM QHYAPEDTAY GDYMMSGYDE ALLLHILSYL TPENLRATLI AKGGEYDKKA QWYFTPYSVR PFTTEQLHRF RQPLDLPISL PEPNPFICYD LDPSEVKESH TLPQVLQDLP GFKLWHQQDT EFRVPKGVIY VAIDSPHAVA NCRNIVMTRL CVEMFLDALA KETYQAEIAG MGYNLYAHQG GVTLTLSGFS QKLPQLMEVI LRKFAQRDFQ PKRFATIKQQ MTRNWRNAAH DKPISQLFNA MTGLLQPNNP PYAELLAAID DVQVEELAHF VDTILSQLHV EMFVYGDWPA AEAHKMAEVL KDALRVQGQT YEESLRPLVM LGKSGTFQRE VQCQQDDSAI VVYYQSHEVS PRSIALYSLA NHLMSATFFH EIRTKQQLGY MVGTGNMPLN RHPGLILYVQ SPSAPPSELI RSIDEFLNAL YMVLLELNEY QWHSSKRGLW NQISAPDPTL RIRAQRLWVA IGNKDLSFDQ REKVLEELKN LSRADMIRFV VNELKPRTAH RLIMHTQGRA HHEAPALQLG QEIGSVEEFQ LRPKAYDVG
|
| |