Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A0284 |
Symbol | pepB |
ID | 5135218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 297139 |
End bp | 298473 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640531742 |
Product | aminopeptidase B |
Protein accession | YP_001216240 |
Protein GI | 147675760 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00000940358 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGATGGC ACAGGCAAAG CCAATATTTA AGACAAGGAG AAACCATGTC TACACAGATG TCTGTATTTT TGAGTACTCA AGCTGCCCAG CCTCAGTGGG GAGAGAAAGC GCTCATTTCT TTTGCAGAGC AAGGTGCAAC CATTCACCTG CAACAAACAC AAGATTTCAG CGCAATCCAA CGCGCAGCTC GTAAGTTAGA TAATCAAGGT ATCCGCACGG CATTCCTGGC TGGAGAAGGT TGGGATCTGG AGAGCATTTG GGCTTTCTAT CAAGGCTACC GTGATGCGAA AAAGCGCAAT ACCGTCGAGT GGAAAGCGTT AGCGGCTGCT GAACAAGCAG AGCTAGAAGC GCGCATTAAA GCGACGGATT GGACACGCGA TATCATCAAC AAAAGCGCGG AAGAAGTCGC GCCACGCCAA TTGGCGACCA TGGCAGCAGA GTTCATCAAA TCGCTTGCGC CTGACCATGT TTCTTACCGT ATCGTCAAAG ACAAAGATCT GCTCACTGAA GGGTGGGAGG GGATTTACGC TGTAGGCCGT GGCTCTGAGC GTACGTCGGC GATGCTGCAA CTCGACTACA ACCCAACGGG CGATGAAAAT GCACCGGTAT TCGCGTGTTT AGTCGGTAAA GGCATCACTT TTGACTCGGG TGGTTACAGC TTAAAACCTT CCAACATGAT GTCAGCGATG AAAGCGGACA TGGGCGGCTC AGGCATGATC ACTGGTGCGC TTGGTTTGGC TATCATGCGC GGCTTTAACA AGCGCGTGAA ACTCATTCTA TGCTGCGCGG AAAACATGGT TTCCGGCCGT GCGTTGAAGC TTGGTGACAT CATCACCTAC AAAAATGGCA AAACCGTTGA AATCATGAAC ACCGATGCGG AAGGCCGTTT GGTGCTGGCC GACGGTCTTA TCTACGCCAG TGAACAGAAA CCGCAATTGA TTATCGACTG TGCAACCTTA ACCGGAGCGG CGAAAAACGC GCTGGGTAAT GATTACCACG CACTGCTTTC TTATGATGAG TCGCTGAGCC AACAAGCATT ATCTGCGGCA AAAGAAGAGA ATGAAGCGCT GTGGGCTCTG CCTTTAGCTG AGTTCCACCG TGAAATGCTG CCTTCTAACT TTGCGGATCT GTCAAACATC AGTAACGGCG ATTACACGCC GGGAGCCAGC ACCGCAGCGG CCTTCCTTTC CTATTTCGTG GAAGGCTACC AAAAAGGTTG GCTACACTTC GATTGTTCAG CCACGTATCG CAAGTCAGCC AGCGATAAAT GGGCTGCAGG AGCCACGGGC ATGGGCGTGA AAATGCTCGC ACGTATTTTG ATGCAGCAAG CATAA
|
Protein sequence | MRWHRQSQYL RQGETMSTQM SVFLSTQAAQ PQWGEKALIS FAEQGATIHL QQTQDFSAIQ RAARKLDNQG IRTAFLAGEG WDLESIWAFY QGYRDAKKRN TVEWKALAAA EQAELEARIK ATDWTRDIIN KSAEEVAPRQ LATMAAEFIK SLAPDHVSYR IVKDKDLLTE GWEGIYAVGR GSERTSAMLQ LDYNPTGDEN APVFACLVGK GITFDSGGYS LKPSNMMSAM KADMGGSGMI TGALGLAIMR GFNKRVKLIL CCAENMVSGR ALKLGDIITY KNGKTVEIMN TDAEGRLVLA DGLIYASEQK PQLIIDCATL TGAAKNALGN DYHALLSYDE SLSQQALSAA KEENEALWAL PLAEFHREML PSNFADLSNI SNGDYTPGAS TAAAFLSYFV EGYQKGWLHF DCSATYRKSA SDKWAAGATG MGVKMLARIL MQQA
|
| |