Gene VC0395_0075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_0075 
SymbolptrB 
ID5134047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009456 
Strand
Start bp81858 
End bp83855 
Gene Length1998 bp 
Protein Length665 aa 
Translation table11 
GC content51% 
IMG OID640530398 
Productprotease II 
Protein accessionYP_001214916 
Protein GI147672486 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000167862 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTGC TCAATGTATT GCTGTTAACG ACATCTCTGT CGGTTTTTTA TACTTATGCA 
GAGACGGATT ATCAATGGTT ACGCGATGAC TCGCGTTCGG AACCTGCGGT GAAGCAGTTT
TTGGCTGAGC ATAATCGAAA AACCGATCAT TGGTTTGCAC CTGCCAAGCC ACTGGTGCAA
GAGTTGGTCA ATGAATGGCA ACAAACTTCA CAGCATAAAG CGCCTCCGCC TGCGCTTATC
TACGCAAACC AGCAGTACAA CGATATTCAA TGGAATGGTC ATCGGCACAT CGTCAAGATA
GGTGCTCAAG GCCAAATTGA GCCTCTGCTG AATCTTAGCG CGCGTGCAGA GCCGTTTGAT
TATTACCAAT TGGCTTCTTG GTCACTTGAT CGCTCAGTCC AATCGGTAGC GTTGGCAGAA
GATACGCGGG GTGACGAGCA GTTTAAGCTG ACGATCGTGC GTTTAGCCGA TCGCACCGAG
CAGATCGTTT CAGAAACAGC GAGCACTTAC TTTGCTTGGG CGGCCGATGG CAAAAGCCTC
TACTACTTAT CTGATCTCAA TGGGTCTACC CAGCTGCAGC GTTTTGAGCT AGAAACGGGT
CAATCAACGA GGCTTGCAGA GTGGCGCTCG GCAGAGTGGC TGTTCTCGCT CTATTCTGCA
AGCAATCCAC GCTATATCGT GGTACAGCAA AATAATGAAA ACTCGACTCA GCAGCGCCTG
CTGGATACCC AAACCGGTGA GCTGATGCCA TGGCTACGCA CCACTGAGCT GGGGCTGGAA
TATTATGCCG ATGTGCTGGG TGAGACACTT TACATCAATA GCAACCATGA GGGGGCATTT
CGTCTCTATC GTCAGCCGTT ACACACTAAA CAGGAATGGC AAAGCGTCAC AACACATAAA
GAAATCGGCT CACTGAGCAA CTTTTATCTG TTTGATGCTG GGATTGTGTT GGTGGAGAAC
CAAACTCTTG CACCGAAAGT TTGGGTTCTC GATAGCCAAG GCGAAGTGCG TACTCACTTT
GAACTGCGCG ATTTAGGTCA AGTGGCGTGG ATCTCTCGCA ATGGTGATGC TGCCAGTAAT
CGGCTGCGTG TACGTGCAAT GTCAATGACG GAGCCTGCTA GCTGGCATGA GTTGGATGTG
GCACAGTTAC AGTGGCAACA GTTAAGCCAA GATCACTACG CAGACTTTGA CCCGAAACAG
TATCAAACCC AAACGGTATG GGTGACGCAA GGTGCCATCC AAGTTCCGGT GACACTGGCC
TACCGCTCTG ACAAACTGAC CCCCAACAGC AGTGTGGTGC TGTATGGCTA TGGCGCTTAT
GGCGTGACGA TGAAGCCCTA TTTCATGCCA CAAATGGTCA GTTTGCTTGA TCGAGGCATG
ATTTACGCGA TCGCTCATGT TCGTGGTGGT GGATACCTTG GCGAGGCTTG GTATCAAGCT
GGCGCTGGAC TCAATAAACA AAACGGCATT GATGATTTCC TCGCGGCCGC TCGATATCTC
ACCCATTTTC AGCAAGGTGA GCGCGCCATT TATGCGATCG GCGGAAGTGC CGGCGGCACC
TTGGTTGCTG CGGCGCTCAA TCAGCAGCCC AACCTATTTG CGGGAGCTGT GCTGCAAGTG
CCGTTTGTCG ATGTGTTAGC CAGTATGAGT GATACCAGTC AAGCCTTGAC GGCGCAGCAG
TATCAAGAAT GGGGTAATCC TCAACAGCCA GAGCAGCGTC AAGTGATGCA AGCTTATGAT
CCATTCAGCA ATCTACGTGC TGCTCCTTAC CCTCCGACGT TGGTTAATGT CGGTTGGTGG
GACAATCGAG TGCCCTATTG GGAAGGGGCT CGCTATTTGG CACGTTTGAG TGATGTCTCA
CAAGGGGCTG GTCCTTACCT TTTATCAACC GATTTTCAGG CGGGTCACGC CAGTGATCGG
CGTCAAGCGC TTGAAAAGCA GGCGCGTGAA TATGCGTTCT TTCTCACCTT AGATAAAACC
AGAAAAGCGG GGCAGTAA
 
Protein sequence
MKLLNVLLLT TSLSVFYTYA ETDYQWLRDD SRSEPAVKQF LAEHNRKTDH WFAPAKPLVQ 
ELVNEWQQTS QHKAPPPALI YANQQYNDIQ WNGHRHIVKI GAQGQIEPLL NLSARAEPFD
YYQLASWSLD RSVQSVALAE DTRGDEQFKL TIVRLADRTE QIVSETASTY FAWAADGKSL
YYLSDLNGST QLQRFELETG QSTRLAEWRS AEWLFSLYSA SNPRYIVVQQ NNENSTQQRL
LDTQTGELMP WLRTTELGLE YYADVLGETL YINSNHEGAF RLYRQPLHTK QEWQSVTTHK
EIGSLSNFYL FDAGIVLVEN QTLAPKVWVL DSQGEVRTHF ELRDLGQVAW ISRNGDAASN
RLRVRAMSMT EPASWHELDV AQLQWQQLSQ DHYADFDPKQ YQTQTVWVTQ GAIQVPVTLA
YRSDKLTPNS SVVLYGYGAY GVTMKPYFMP QMVSLLDRGM IYAIAHVRGG GYLGEAWYQA
GAGLNKQNGI DDFLAAARYL THFQQGERAI YAIGGSAGGT LVAAALNQQP NLFAGAVLQV
PFVDVLASMS DTSQALTAQQ YQEWGNPQQP EQRQVMQAYD PFSNLRAAPY PPTLVNVGWW
DNRVPYWEGA RYLARLSDVS QGAGPYLLST DFQAGHASDR RQALEKQARE YAFFLTLDKT
RKAGQ