Gene VC0395_A1568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1568 
Symbol 
ID5136738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1686903 
End bp1688750 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content49% 
IMG OID640533025 
Productputative peptidase 
Protein accessionYP_001217509 
Protein GI147674907 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.000239383 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGCGA CACATTACCT CAATCAACTT AACCACCGTT ATCTGAACAT CCATCGTGTC 
AAAGAAGATT TCTTCTGGGA TACCTACATG GGGCTGAGTT ATGATCATGT GGGTTCCGCG
CAAGCACAAA CCGAGTGGAC TCAGTTTCTG AGCAATGGTG CGCGGATTGA GGAAATTCGC
CAGCAAATCG AATTGGCAGA ACAGATCACC GATAGCGAAG AAAAAGCGCA AACCTTAACG
GGCTTACAAG GCTGGCTCGC GATGTTTGAA AGTCATGCGC TGGAATCAGA GCAAGCGCAG
TCACTCAAAG CCGGATTAAT CCAATTTGAG GCCGATCTGT TTGAGAAAAA ACAGAAGCAT
GTGCTGACTT ATACCAACGA ACAGGGCGAA GCGGTAGAAG CCTCGATTGT CACTTTAGGT
TCCACGGTGC GTACTCATGA TCAAGAAGCA GTACGTCGCA GCGCCCATCA GGCTTTCTTA
GGCTTAGAAC AGTGGTTATT ACAAAATGGA TTGTTAGAGC TGGTGAAGCG TCGTAACCAT
TTTGCGCGCA GCTTAGGTTA TAAAACCTTT TTTGATTACT CGGTTGCCAA AAAAGAGAAG
ATGACCACTG AGCAACTGTT CACCATTTTG GATGATTTTG AACAGCGCAC GCGTGATCGT
CACTTCACTA GCTTAGCGGA ACTTGCACAA AGTAAAGGTG AGCAAGCGCT GCAAGGACAT
AACTTCATCT ATTCGTTTGC GGGCGATGTG ATGCGTGAGC TGGACCCTTA CGTGCCATTT
TCACAATCGC TGCGTCGTTG GGTGGAGTCT TTTGGTCGCC TCAATATTGA ATTTAGTGGC
GCTGAGCTGA CCCTTGATTT GCTGGATCGT AAAGGTAAAT ATCCAAACGG TTTCTGCCAT
GGGCCAATTC CGGCGTTTTA CGACCAAGGT CAGTGGGTCG CGGCGAAAGT TAATTTCACC
AGTAATGCTA AGCCAGATCA GGTAGGCAGT GGTTACGATG GCATCAACAC CTTGTTCCAT
GAAGGTGGGC ACGCGGCGCA CTTTGCGAAT GTGAAGATGA ACGCGCCGTG TTTTTCACAA
GAATTTGCAC CCACTTCGAT GGCGTACGCT GAAACGCAAT CCATGTTTTG CGATAGTCTA
TTGATGGATG CTGACTGGTT GAAAACCTAT GCCAAAGATG TACAAGGTAA CCCAGTGCCA
GATGAGCTCA TCAAAGCCAT GGTATTTAGC CGCCAACCGT TTAAAGCTTA CGAAGAGCGC
AGTATTTTGC TCGTGCCGTA TTTTGAGCGA GCACTGTATG AACAGAGCGA GGAAGAGCTG
ACTGCAGAGC GCGTCACTGA ACTGGCCAGA GCCACTGAAA AGCGGATTAT CGGCCTTGAA
TGCAGTCCGC GTCCGCTGAT GGCGATCCCG CATCTGTTAT CCGATGAATC GGCGTGTGCT
TATCACGGTT ACTTGTTGGC GCACATGGCG GTGTACCAAA CTCGCGCTTA TTTCCTTGGG
CAGTTTGGTT ATTTCACCGA TAACCCAGCG ATTGGGCCAC TACTGGCGAA GCATTATTGG
CAGCAAGGCA ATGCACTTTC TCACAATGAC ACTATTGTGA ACCTCACTGG TGAAGGCTTT
AATGCTCGCT ATCTTGCCGA TGCCTGTAAC TTAACTCCGG ATGAAGCATG GCAAAAGCAG
CAGCACAAAA TGGCGCAATT GGCGACGCGT GAGCAAACCA AACCCGCTTC ATTAAATGCG
CAGATTCGAG TGATTGATGG TGCCACTGAG CTGGCTTCTA ACCGTGATTC GGATGAAGAG
ATGTGCCGCC AATTTGAGGC GTACATTGCC AAACATTACG GTTGCTAA
 
Protein sequence
MSATHYLNQL NHRYLNIHRV KEDFFWDTYM GLSYDHVGSA QAQTEWTQFL SNGARIEEIR 
QQIELAEQIT DSEEKAQTLT GLQGWLAMFE SHALESEQAQ SLKAGLIQFE ADLFEKKQKH
VLTYTNEQGE AVEASIVTLG STVRTHDQEA VRRSAHQAFL GLEQWLLQNG LLELVKRRNH
FARSLGYKTF FDYSVAKKEK MTTEQLFTIL DDFEQRTRDR HFTSLAELAQ SKGEQALQGH
NFIYSFAGDV MRELDPYVPF SQSLRRWVES FGRLNIEFSG AELTLDLLDR KGKYPNGFCH
GPIPAFYDQG QWVAAKVNFT SNAKPDQVGS GYDGINTLFH EGGHAAHFAN VKMNAPCFSQ
EFAPTSMAYA ETQSMFCDSL LMDADWLKTY AKDVQGNPVP DELIKAMVFS RQPFKAYEER
SILLVPYFER ALYEQSEEEL TAERVTELAR ATEKRIIGLE CSPRPLMAIP HLLSDESACA
YHGYLLAHMA VYQTRAYFLG QFGYFTDNPA IGPLLAKHYW QQGNALSHND TIVNLTGEGF
NARYLADACN LTPDEAWQKQ QHKMAQLATR EQTKPASLNA QIRVIDGATE LASNRDSDEE
MCRQFEAYIA KHYGC