Gene VC0395_0089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_0089 
Symbol 
ID5134722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009456 
Strand
Start bp97991 
End bp101245 
Gene Length3255 bp 
Protein Length1084 aa 
Translation table11 
GC content49% 
IMG OID640530412 
Producttricorn protease 
Protein accessionYP_001214930 
Protein GI147671883 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000582269 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATTTGC CGCATTTTGC TCTGAGCTCA GTGCTCTCGC TAACGTTGTG TGCGATACCG 
AATCTTGGTT TTGCAAGCTC CAACCAGACT GAAGATTCCA CGCAGGCTCG GCCGACTTGG
TTGAGGGATA TTGCGTTGTC CCCCGATGGT CAAAAGATAG CGTTTACTTA TGCAGGTCAA
ATTTGGCTGG TGCCGGCTCA AGGCGGTGAT GCCGTGGCGC TCACCGAAAG CGGTGTGTAC
AGCGAAACGC CGATTTGGTC ACCTGACAGC CAATCGATCG CTTTTACGGC TGATCGATAT
GGTCTTGGTG ATGTTTTTAT TCTCTCGATC CAAGGTGGTG AGAGTCGGCG CTTGACCTAC
CACGGAGCGA AAGATATTCC GTATGCGTTT TCGGCTGATG GTCAGCAGCT CTATTTCTCT
TCTCGCCGAC TGGGCGATGA TAAAGCCAAT GCGAACGTTA AACAGGGCAG CTTTATGGCT
CAGCTTTACT CGGTGCCTGC AGCGGGAGGA CGTGAGCAAC GCGTTTTACC TATTGCCATC
AGTGATTTAG CCATATCGCC TTCCCATAGT GACATCCTGT ATACCAATCA ACCCTCTGAT
GAACAGCCGT GGCGTAAAGG TGCGCTATCC GATGCTACAC GTGACATTTG GCAATGGTCC
CCGCTCACGG GCAAACATAC CCAAATCACC ACTTTTCGTG GCGAAGACCG CAACCCTGTG
TGGAGTGCAG ATGGCTCCTC TATGTACTAT CTGTCTGAAC AAGCGGGCAG TTTCAACGTC
TGGCAGCAGC GATTTGATGG TTCAGAACCC GTACAAATCA CCGACCACCA AAAGCTGCCT
GTTCGTTTCT TAAGTGCCAG CTTACAAGGC GATTTGGCCT ACGGTTTTGA TGGCGAAATT
TGGCGACTGA AAGCCGGAGC TAAGCAAGCA GAAAAAGTGC CGGTCTCCAT TCGTCGCAGC
GCGATGCCTG ATGGCCGTCA TAATGTGAAT TTTAACCTTG AAGCCACGGA AATGGTGGTC
GCGCCGAATG CCGCTGAAGT GGCGATTGTG GCACGTGGCG ATGTGTATGT GGTCTCTTTG
CTTTCCGGTT TAACGCAACG AATCACCGAT ACACCAGAAG CGGAGCGCGA TGTTTCCTTT
TCCAGTGATG GGTATCGTTT GATCTACGCT TCTGAGCGTG AGGGCAGTTG GAATCTTTAT
CAAAGTTATG TGAATGATGG TGGGAAAAGC TTCTCCTCCT CATTGGATAT TATCGAAGAG
CCGGTGCTGA CGACGGAGCA AGATGTCATT CAGCCACTCT ATTCTCCTAA TCTTAAACGG
ATTGTTTATC GCGAGAATCG TAATACGCTC AAAGTGTATG ACATCGAACA AGATAAAACC
TATACCTTGC TGGATGCTCA TGCCCTCTAC TCGTATTTCG ACAAAGATCT GAGTTACCAG
TGGTCGCCAG ATAGTGAGTA TATCGTCACT CGCGATCGGG CGATGTCGAA TGGCGACATT
CAGCTACTTA AATTTGATGG AAGTGAAGCG CCTATCAATT TGAGCCAAAG TGGTTTTTCT
GAGTTTGCCC CACAATTTAG TGCCGATGGA CAGTGGGTTT ACTGGCTAAC CGATGCGAAA
GGTTTGCGCG ATATTGATGA TATGGTGGTG CAGTATGATG TGTACGGTGT TGCGCTCAAT
CGCGAAGCCA AATTCAATTT CAATAAAACC CAAGAGCAGT TGTGGCTTGA AGAGGAAATC
GCTGCGGAGA AAAACCTTGG GCCTGGCCAA AACCCACCAG CAGAGTTAAC GGTGGTGGAA
AATAAAGGGC TTAAGCAGCG CACTATGCGT ATGACTCCTA CCTCACTCAA TATTATTTTC
AAGCACTTAA CGCACGATAA CCAAGCACTG ATCATCGCTT ATCAGTTGGG AGACTCAGTA
CAAATTTCAG AAATCAATCT GCGCAGTGGG GAGATGACGG CGCTGTTTAA CCGCTTGAGT
GAAGATGCGG CTTTACTTGC GATGGCTTCT GATGATGCGA GCCTTCTAAT CATGGGGGAG
CACGGTATCG AGAATCTGAA TGTCTTGACG GGTGAGAGCA AGTTTGTTCG TTATGAAGCC
AAGGCTAATT TCGATTTTCG TGCAGAAATC GCCTACTTGT TTGATCATGT CTGGCGACTT
ACTCAAACCA AATTTTATGA CCCGCAAATG CATGGTGTAG ATTGGCAACA GTACGGTGAT
TTGTATCGTA AACATCTGCC GAGCATCCGT ACCTACAGCG ATTTTGCCGA GTTACTGAGT
GAAATGGTCG GGGATTTGAA TGTCTCCCAT ACAGGAGCCT TCTTTATGGC TGGTAACTCC
AGTTGGGAGG AGCCCGCATC CTTAGGGCTT TACTATGATG ATCGTTACCG AGGTAAGGGC
GTGCGTGTGA AATCACTGCT ACCCGGAGGC CCTGCAGATA CTTATCAATC TCCGATCAAG
GCAGGAGCGA TTATTTACTC CGTAAACGGG AAAGAAATTA GCGATCAACA GGATATTTAT
CCGTTCCTGA ATTTTACCCA AGGTAAATTA ACTCGTTTAA GTGTGCTGGT ACCGGGGGAG
GAGAAAGCGC AGAACTTTAC GTTAGTGCCG ATCACTCTTG AAGAGGAAAG TGAGCTGCTT
TATGAGCAGT GGGTTGAACA GCGCCGAGCC TTAGTGGAGA CGCTCTCCGA TGGACGCCTT
GGCTATGTGC ATCTTGCCGC GATGGATGCT GCCAGTTTTG AGCAGATGCA AAATGACATG
TTTGGCCTTG AAAAAGACAA ACTCGGCTTA GTGGTCGACG TGCGTTTTAA TGCTGGAGGT
TGGCTGCATG ATCAAGTGAT GGAGATATTG TCTGGAACGC GACATTCAGT CATGCAGACA
CGTGATGGTT ATGTAGTCTC CTCTTTCCCT GAACGTCGCT GGGCGAAACC GAGCATTATG
CTCGCCAACG CAGACAGTTA TTCAGACGGT TCGATCGTAC CGTATTTCTA TCAAAAAGAA
GGGCTAGGAA AGCTAGTGGG CGAAAGGGTT CCCGGTACTG GCACGGCAGT GATTTGGGAG
CAGCAACAAG AGCCCGGATT GATTTACGGA GTGCCGCAAC TCGGTATCAA GGATGAGCAA
GGTCGTTGGT TTGAAAACCA AGAAATCATC CCAGATATTT TGGTTTATAA CGATCCGGAA
TCGGTTGTCG CCGGCGAGGA TCGCCAGTTG GCTGCCGCCG TAGAAGCCCT GCTGCTCGAG
ATCTCTTCAA AATAA
 
Protein sequence
MHLPHFALSS VLSLTLCAIP NLGFASSNQT EDSTQARPTW LRDIALSPDG QKIAFTYAGQ 
IWLVPAQGGD AVALTESGVY SETPIWSPDS QSIAFTADRY GLGDVFILSI QGGESRRLTY
HGAKDIPYAF SADGQQLYFS SRRLGDDKAN ANVKQGSFMA QLYSVPAAGG REQRVLPIAI
SDLAISPSHS DILYTNQPSD EQPWRKGALS DATRDIWQWS PLTGKHTQIT TFRGEDRNPV
WSADGSSMYY LSEQAGSFNV WQQRFDGSEP VQITDHQKLP VRFLSASLQG DLAYGFDGEI
WRLKAGAKQA EKVPVSIRRS AMPDGRHNVN FNLEATEMVV APNAAEVAIV ARGDVYVVSL
LSGLTQRITD TPEAERDVSF SSDGYRLIYA SEREGSWNLY QSYVNDGGKS FSSSLDIIEE
PVLTTEQDVI QPLYSPNLKR IVYRENRNTL KVYDIEQDKT YTLLDAHALY SYFDKDLSYQ
WSPDSEYIVT RDRAMSNGDI QLLKFDGSEA PINLSQSGFS EFAPQFSADG QWVYWLTDAK
GLRDIDDMVV QYDVYGVALN REAKFNFNKT QEQLWLEEEI AAEKNLGPGQ NPPAELTVVE
NKGLKQRTMR MTPTSLNIIF KHLTHDNQAL IIAYQLGDSV QISEINLRSG EMTALFNRLS
EDAALLAMAS DDASLLIMGE HGIENLNVLT GESKFVRYEA KANFDFRAEI AYLFDHVWRL
TQTKFYDPQM HGVDWQQYGD LYRKHLPSIR TYSDFAELLS EMVGDLNVSH TGAFFMAGNS
SWEEPASLGL YYDDRYRGKG VRVKSLLPGG PADTYQSPIK AGAIIYSVNG KEISDQQDIY
PFLNFTQGKL TRLSVLVPGE EKAQNFTLVP ITLEEESELL YEQWVEQRRA LVETLSDGRL
GYVHLAAMDA ASFEQMQNDM FGLEKDKLGL VVDVRFNAGG WLHDQVMEIL SGTRHSVMQT
RDGYVVSSFP ERRWAKPSIM LANADSYSDG SIVPYFYQKE GLGKLVGERV PGTGTAVIWE
QQQEPGLIYG VPQLGIKDEQ GRWFENQEII PDILVYNDPE SVVAGEDRQL AAAVEALLLE
ISSK