Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_1004 |
Symbol | prtV |
ID | 5134184 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009456 |
Strand | + |
Start bp | 979363 |
End bp | 982119 |
Gene Length | 2757 bp |
Protein Length | 918 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640531326 |
Product | protease |
Protein accession | YP_001215840 |
Protein GI | 147671484 |
COG category | [S] Function unknown |
COG ID | [COG4412] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03296] M6 family metalloprotease domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.000922392 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACGA TCAAAAAAAC GCTATTAGCT GCCGCCATAG CCAGTTTTTT CAGCAGTGGA TTATACGCTC AAACACCCAT TGATTTAGGC GTGGTGAATG AGGATAAATT AATTGAAATG TTAGTCCGCA CCGGACAAAT TCCTGCCGAT GCCTCTGACG TTGATAAACG TATTGCGCTA GAACGTTATC TGGAGGAGAA AATTCGCTCC GGATTCAAAG GTGATGCGCA ATTTGGTAAG AAAGCGCTCG AGCAGCGTGC GAAAATTCTT AAAGTGATCG ATAAGCAAAA AGGCCCGCAC AAGGCGCGTG TTTTTGCTTT AGATGTTGGT CAAAAGCGCA CGGACAAAGT GCTCGCGCTA TTGATCGATT TCCCCGATCT CCCTTGGGAT GATAACCGCC TGACGAAAGA GCATACTGAG ATGCTCTACG ATCGTTATGA GCCTTCTCAC TACCAAGATT TGCTGTTCTC GGACAAAGGC TATACCGGTC CAAACGGTGA AAACTTTATC TCAATGCGTC AATATTACGA GAGTGAATCT GGCAACAGCT ACAGTGTCTC CGGCCAAGCA GCAGGATGGT ATCGTGCCTC AAAAAATGCG GCTTATTACG GTGGCAACTC TCCCGGTACC AACAATGATA TGAATGCTCG GGAGCTGGTT CGCGAAGCAC TGGATCAACT TGCGCGCGAT CCAAACATTA ACCTTGCCGA TTACGATATC GAAGATCGCT ATGACTACAA CGGTAACGGT AATTTCCGTG AGCCAGATGG CGTGATAGAT CACTTGATGA TTTTCCATGC CTCTGTGGGT GAAGAAGCGG GTGGCGGTGT GTTGGGCGCG GATGCGATTT GGTCACACCG TTTTAACCTC GGCCGTTACC ATGTTCTTGA AGGCACGAAA AGCAACGTTC CTGGACGCTT CAATGGCCAA TTCGCTGCCT TTGATTACAC CATTCAACCG ATTGATGCGG CTGCCGGCGT GTGTGCCCAC GAATATGGTC ACGATTTAGG TCTGCCCGAT GAATATGACA CCCAGTACAC AGGTACGGGA GAGCCCGTCT CTTATTGGTC AATCATGTCA TCTGGCAGCT GGGCGGGCAA AATTGGCGGT ACACAGCCCA CGGCTTTCAG TTCATGGGCT AAGCAGTTCT TACAAAATTC GATTGGCGGA CGCTGGATTA ACCATGAGCA GCTTTCGATT AATGAGTTAG AAGCCAAACC GCGCGTGGTT ACGCTATTCC AAACCACAGA TAACTCACGC CCGAACATGG TGAAAGTGAC TCTGCCGATG AAACGGGTTG AAGGCATTAA GCCTGCAGAA GGTGAGTTCT CCTTCTACTC GAACCGTGGC GATGATCTGA AAAACCGAAT GAGCCGTCCA TTGACGATCC CAGCAGGCAG CCAAGCCACG TTGCGCTTTA AAGCGTGGTT CCAGATTGAA AAAGATTACG ACTACGCGCG TGTGCTGATT AACGGCAAAC CGATTGCCGG TAATATCACG ACGATGGATG ATCCGTTTAA ATCAGGTTTA GTACCTGCCA TCTCAGGCCA ATCTGATGGC TGGGTAGATG CGCAATTTGA TCTCTCTGCT TGGGCAGGCC AAACCGTTGA ACTGGCATTT GATTACTTGA CGGATGGCGG TCTGGCCATG GAAGGTCTGT ATGTCGATGA CTTACGTCTT GAGGTGGATG GCAATCAGAC CTTGATCGAT AACGCAGAAG GCACATCCAG CTTTGCGTTC CAAGGTTTCA CCAAAAACGG TGGCTTCCAC GAAGCCAATC ACTATTACTT GCTGCAATGG CGCAGCCATA ATGACGTTGA CCAAGGCTTA GCCAATTTGA AACGCTTCGG GCAACTGATG TCATTCGAGC CGGGCTTGCT GGTGTGGTAT GTGGACGAAT CTTACGCGGA TAACTGGGTT GGCAAACATC CGGGTGAAGG CTGGCTAGGC GTGGTCGATG CCGACCAAAA TGCCTTGGTC TGGTCAAAAA CAGGGGAAGT GGCACAAACG CGTTTCCAAG TGCGTGATGC AACCTTCTCA CTGTTTGATC AAGCGCCGCT CAAACTGGTC ACGGCTGATG GCAATACGCT GGAAGATATG AACTTAACCG CGAATGCCTC GTTCTCGGAC GATCAAGATT ACAGCTCGCC TCAAGCTCCA GATTCTGGCC GCAAAGTGAT GCCATTTGGT TTGAAGATCG ACCTGCTCTC ACAAAGTAAA GAGAATGAGT ACGGTGTTGT TCGCTTGTCG AAAGTCACCA CGGAAAATAT CGCGCCTGTG GCTCGCTTTG AACTGAAAGT CGAGGGGCTC TCTGTGATGT CACAAAACAC CAGTAGTGAT AGCGATGGCA ATATCGTCAG TTATTTGTGG GATTTTGGTA ACGGTCAAAC CAGTACCGAA GCCGCTCCAA CTTGGTCATA TACCAAAGCA GGCAGTTACT CTGTCACTTT AACGGTGACG GATGACAAAG GCGATAGCGA TACTCATCAG CAAACCATTA AAGTGGACAC ACCGAATGCG TTACCACAAG CCAGTGCCAA CTATATCCAT TTAGGTCGCT GGGTCACCAT GTGGTCAACC AGCACCGACA GTGATGGCCG CATTGTCGAC ACCGAATGGA CACTGCCGAA TGGTAAAATT AAGCGGGGTC GTATGTTTAC TGCGATTTTC CCAAGCTATG GGCACCATGA TGTGCAGCTC AAAGTGATGG ATGACCGCGG CGCAGTCACC ACCATCACCA TCAAAGTCAA ACTGTAA
|
Protein sequence | MKTIKKTLLA AAIASFFSSG LYAQTPIDLG VVNEDKLIEM LVRTGQIPAD ASDVDKRIAL ERYLEEKIRS GFKGDAQFGK KALEQRAKIL KVIDKQKGPH KARVFALDVG QKRTDKVLAL LIDFPDLPWD DNRLTKEHTE MLYDRYEPSH YQDLLFSDKG YTGPNGENFI SMRQYYESES GNSYSVSGQA AGWYRASKNA AYYGGNSPGT NNDMNARELV REALDQLARD PNINLADYDI EDRYDYNGNG NFREPDGVID HLMIFHASVG EEAGGGVLGA DAIWSHRFNL GRYHVLEGTK SNVPGRFNGQ FAAFDYTIQP IDAAAGVCAH EYGHDLGLPD EYDTQYTGTG EPVSYWSIMS SGSWAGKIGG TQPTAFSSWA KQFLQNSIGG RWINHEQLSI NELEAKPRVV TLFQTTDNSR PNMVKVTLPM KRVEGIKPAE GEFSFYSNRG DDLKNRMSRP LTIPAGSQAT LRFKAWFQIE KDYDYARVLI NGKPIAGNIT TMDDPFKSGL VPAISGQSDG WVDAQFDLSA WAGQTVELAF DYLTDGGLAM EGLYVDDLRL EVDGNQTLID NAEGTSSFAF QGFTKNGGFH EANHYYLLQW RSHNDVDQGL ANLKRFGQLM SFEPGLLVWY VDESYADNWV GKHPGEGWLG VVDADQNALV WSKTGEVAQT RFQVRDATFS LFDQAPLKLV TADGNTLEDM NLTANASFSD DQDYSSPQAP DSGRKVMPFG LKIDLLSQSK ENEYGVVRLS KVTTENIAPV ARFELKVEGL SVMSQNTSSD SDGNIVSYLW DFGNGQTSTE AAPTWSYTKA GSYSVTLTVT DDKGDSDTHQ QTIKVDTPNA LPQASANYIH LGRWVTMWST STDSDGRIVD TEWTLPNGKI KRGRMFTAIF PSYGHHDVQL KVMDDRGAVT TITIKVKL
|
| |