Gene VC0395_A1104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1104 
Symbolprc 
ID5137552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1160197 
End bp1162194 
Gene Length1998 bp 
Protein Length665 aa 
Translation table11 
GC content44% 
IMG OID640532562 
Productcarboxy-terminal protease 
Protein accessionYP_001217050 
Protein GI147673268 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc)
[TIGR01543] phage prohead protease, HK97 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000153479 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATGCC GTTCAAAAAT ATCTCTGATT GCTGCTAGCC TATGGCTGGC AGCCTTTTCA 
GCTCAGGCTC TAGAAGCCAA ACTCAAACCA GAAGATCTCC CTCTTCTTGT TCCTGAAGCT
CAACACGCAA CGGCAGCGAA ACGTGTTACC TCACGTTTTA CCCGTTCTCA TTACAAACAA
TTCAATTTAG ACGACCAGTT TTCTCAAGCC ATGTTTGAAC GCTACCTTGA GATGCTAGAT
TACAGTCGAA ATATCTTTAC TCAAGCGGAC ATAGACAGCT TCAAAGCTTG GTCTTTGCAA
TTGGATGACC AATTAAAAGC AGGTGACAAT CAAATCGCTT ACGATCTGTA TAACCTGTCA
ATGGAAAAAC GTTTTGAGCG CTTTCAATAC GCACTTTCTC TGCTTGATCA AGAGATGACG
TTTGATGCTG ATGAGTCTAT TGAGCTTGAT CGCACGAAAT CGCCTTGGCC AAAAGATCTT
AAAGAGATTA ACGAGCTGTG GAGACAACGA GTTAAATACG ATGCGTTAAG CTTGAAACTG
GCAGGTAAAG AGTGGCCAGA AATCAAAGAA ACGCTCGATA AACGCTACAA CAATGCCATC
AAGCGCCTCA CACAGACAAA AAGTGAAGAC GTATTTCAAA CTTATATGAA TGCGTTTGCT
CGTGAAGTTG ATCCGCATAC CAGCTATTTG TCACCGCGTA ACGCAGAACA ATTCCAATCT
GAAATGAATC TCTCGTTGGA AGGAATTGGT GCCGTGTTAC AAATGACCGA CGATTACACC
ATCATCCGCT CATTGGTTGC AGGTGGTCCT GCAGCATTGA GCAAACAATT GGGTGAGGGT
GACCGCATTA TCGGCGTCGG TCAAGAAGGC GAAGATGTGG TTGATGTAGT CGGTTGGCGA
TTAGACGATG TCGTTCAACT GATTAAAGGA CCTAAAGGTA GCAAGGTGAA ACTGCTCGTG
TTACCTGAAG GCAAAGACGC AAAAAGTCAC GTTGTCACTA TTGTGCGAGA TAAAATTCGC
TTAGAAGATC GCGCCGTAAA ATCTGAAGTG ATTGAAAAAG CAGGGAAGAA AATTGGTGTA
CTAGAAGTAC CGAGTTTCTA CGTTGGCTTA GCTCAAGACA CGGAAAAACT ACTGGCGGAG
CTAAAAGCGA AAAAAGTCGA CGGCATTATT GTTGATTTAC GCAATAACGG TGGTGGTGCA
TTAACCGAAG CTACCGCGCT TTCTGGTTTG TTCATTACCA GTGGCCCTGT AGTTCAGGTG
CGTGATAGCT ATGGTCGAGT CAACGTTAAC TCGGATACCG ATGGTAGCAT TAGCTATAGC
GGACCAATGA CCGTGCTGAT TAACCGCTAC AGTGCATCGG CTTCAGAAAT CTTTGCTGCC
GCAATGCAAG ACTACGGCCG CGCGATCATT CTCGGTGAGA ACTCATTTGG TAAAGGTACC
GTACAGCAGC ATCGCTCTCT CAATCATATC TATGATTTGT TTGATAAAGA GCTTGGCTAC
GTACAATACA CGATTCAAAA ATTTTACCGT ATTGATGGTG GTAGTACCCA AAACAAAGGT
GTCGTCCCTG ATATCGCGTA TCCCACCGCG ATTGACCCTT CCGAAACAGG GGAAAGTGTT
GAAGATAACG CACTACCGTG GGACAGCATT GATGAAGCAA AATATGAGCG TTTGAATAAC
TTCAACACCA TCATTGCTAG CTTGGAAGCT AAACACCAAC AACGTGTCGC GAATGATTTA
GAATTTGGTT TTATCGAGCA AGATATTGCG AAATACCGTG CAGAGAAAGA TGACAACCTA
CTTTCGCTGA ATGAAAAAGT ACGCAAAGAA GAGAGTGCTA AGGCTGATGA AGAGCGCTTA
GCTCGCATCA ATCAACGCCA AAAAGCGTTA GGTAAATCGA CCTATGCGAG CTTGCAAGAT
ATACCGAAAG ATTATGAAGC ACCGGATGCT TATCTCGATG AATCGGTTAA CATTATGCTT
GACATGATAT CGCGATAA
 
Protein sequence
MKCRSKISLI AASLWLAAFS AQALEAKLKP EDLPLLVPEA QHATAAKRVT SRFTRSHYKQ 
FNLDDQFSQA MFERYLEMLD YSRNIFTQAD IDSFKAWSLQ LDDQLKAGDN QIAYDLYNLS
MEKRFERFQY ALSLLDQEMT FDADESIELD RTKSPWPKDL KEINELWRQR VKYDALSLKL
AGKEWPEIKE TLDKRYNNAI KRLTQTKSED VFQTYMNAFA REVDPHTSYL SPRNAEQFQS
EMNLSLEGIG AVLQMTDDYT IIRSLVAGGP AALSKQLGEG DRIIGVGQEG EDVVDVVGWR
LDDVVQLIKG PKGSKVKLLV LPEGKDAKSH VVTIVRDKIR LEDRAVKSEV IEKAGKKIGV
LEVPSFYVGL AQDTEKLLAE LKAKKVDGII VDLRNNGGGA LTEATALSGL FITSGPVVQV
RDSYGRVNVN SDTDGSISYS GPMTVLINRY SASASEIFAA AMQDYGRAII LGENSFGKGT
VQQHRSLNHI YDLFDKELGY VQYTIQKFYR IDGGSTQNKG VVPDIAYPTA IDPSETGESV
EDNALPWDSI DEAKYERLNN FNTIIASLEA KHQQRVANDL EFGFIEQDIA KYRAEKDDNL
LSLNEKVRKE ESAKADEERL ARINQRQKAL GKSTYASLQD IPKDYEAPDA YLDESVNIML
DMISR