Gene VC0395_A0284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0284 
SymbolpepB 
ID5135218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp297139 
End bp298473 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content51% 
IMG OID640531742 
Productaminopeptidase B 
Protein accessionYP_001216240 
Protein GI147675760 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000940358 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATGGC ACAGGCAAAG CCAATATTTA AGACAAGGAG AAACCATGTC TACACAGATG 
TCTGTATTTT TGAGTACTCA AGCTGCCCAG CCTCAGTGGG GAGAGAAAGC GCTCATTTCT
TTTGCAGAGC AAGGTGCAAC CATTCACCTG CAACAAACAC AAGATTTCAG CGCAATCCAA
CGCGCAGCTC GTAAGTTAGA TAATCAAGGT ATCCGCACGG CATTCCTGGC TGGAGAAGGT
TGGGATCTGG AGAGCATTTG GGCTTTCTAT CAAGGCTACC GTGATGCGAA AAAGCGCAAT
ACCGTCGAGT GGAAAGCGTT AGCGGCTGCT GAACAAGCAG AGCTAGAAGC GCGCATTAAA
GCGACGGATT GGACACGCGA TATCATCAAC AAAAGCGCGG AAGAAGTCGC GCCACGCCAA
TTGGCGACCA TGGCAGCAGA GTTCATCAAA TCGCTTGCGC CTGACCATGT TTCTTACCGT
ATCGTCAAAG ACAAAGATCT GCTCACTGAA GGGTGGGAGG GGATTTACGC TGTAGGCCGT
GGCTCTGAGC GTACGTCGGC GATGCTGCAA CTCGACTACA ACCCAACGGG CGATGAAAAT
GCACCGGTAT TCGCGTGTTT AGTCGGTAAA GGCATCACTT TTGACTCGGG TGGTTACAGC
TTAAAACCTT CCAACATGAT GTCAGCGATG AAAGCGGACA TGGGCGGCTC AGGCATGATC
ACTGGTGCGC TTGGTTTGGC TATCATGCGC GGCTTTAACA AGCGCGTGAA ACTCATTCTA
TGCTGCGCGG AAAACATGGT TTCCGGCCGT GCGTTGAAGC TTGGTGACAT CATCACCTAC
AAAAATGGCA AAACCGTTGA AATCATGAAC ACCGATGCGG AAGGCCGTTT GGTGCTGGCC
GACGGTCTTA TCTACGCCAG TGAACAGAAA CCGCAATTGA TTATCGACTG TGCAACCTTA
ACCGGAGCGG CGAAAAACGC GCTGGGTAAT GATTACCACG CACTGCTTTC TTATGATGAG
TCGCTGAGCC AACAAGCATT ATCTGCGGCA AAAGAAGAGA ATGAAGCGCT GTGGGCTCTG
CCTTTAGCTG AGTTCCACCG TGAAATGCTG CCTTCTAACT TTGCGGATCT GTCAAACATC
AGTAACGGCG ATTACACGCC GGGAGCCAGC ACCGCAGCGG CCTTCCTTTC CTATTTCGTG
GAAGGCTACC AAAAAGGTTG GCTACACTTC GATTGTTCAG CCACGTATCG CAAGTCAGCC
AGCGATAAAT GGGCTGCAGG AGCCACGGGC ATGGGCGTGA AAATGCTCGC ACGTATTTTG
ATGCAGCAAG CATAA
 
Protein sequence
MRWHRQSQYL RQGETMSTQM SVFLSTQAAQ PQWGEKALIS FAEQGATIHL QQTQDFSAIQ 
RAARKLDNQG IRTAFLAGEG WDLESIWAFY QGYRDAKKRN TVEWKALAAA EQAELEARIK
ATDWTRDIIN KSAEEVAPRQ LATMAAEFIK SLAPDHVSYR IVKDKDLLTE GWEGIYAVGR
GSERTSAMLQ LDYNPTGDEN APVFACLVGK GITFDSGGYS LKPSNMMSAM KADMGGSGMI
TGALGLAIMR GFNKRVKLIL CCAENMVSGR ALKLGDIITY KNGKTVEIMN TDAEGRLVLA
DGLIYASEQK PQLIIDCATL TGAAKNALGN DYHALLSYDE SLSQQALSAA KEENEALWAL
PLAEFHREML PSNFADLSNI SNGDYTPGAS TAAAFLSYFV EGYQKGWLHF DCSATYRKSA
SDKWAAGATG MGVKMLARIL MQQA