Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A1869 |
Symbol | pepD |
ID | 5137624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 1987265 |
End bp | 1988869 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640533326 |
Product | aminoacyl-histidine dipeptidase |
Protein accession | YP_001217793 |
Protein GI | 147674070 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2195] Di- and tripeptidases |
TIGRFAM ID | [TIGR01893] aminoacyl-histidine dipeptidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.550605 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGACCTACA CCCTGCCCCT GCACAGTGAT ATGCTGTGCA ACGAAAGTAC CTACCTGTTA CAGGCAAATT CTCTGCCACG ACCCAGCTCT GGAGAGAGGA TACACGGGAA AAATTACAGT AAGGAGTCAT CTGTGTCTGA GTTCCAAACC GAAATCAGTA AGTTATCGTC AAATCCAATT TGGCCATTTT TCGCCACTAT CTGTTCCATC CCGCACCCTT CAAAACATGA AGAGGCATTA GCTCAATACA TTATCAACTG GGCTAAAGAA CAAGGATTGG CCGTTCGTCG TGATGAGACC GGTAACGTCT TTATTAAAAA GCCAGCGACA CCGGGCATGG AAAATCGTAA AGGTGTGGTA CTTCAAGCGC ACATTGATAT GGTGCCGCAA AAAAATGAAG ACACAGTGCA TGATTTCACC AAAGATCCGA TCCAGCCTTA TATTGATGGT GAATGGGTTA CTGCTAAAGG CACTACGCTT GGCGCGGATA ATGGTATCGG CATGGCTTCT TGCCTAGCAG TACTGGCTTC TAAAGAGATC CAACATGGTC CAATTGAAGT TCTGCTGACT ATTGATGAAG AAGCAGGCAT GACCGGCGCT TTTGGCCTCA AAGAAGGTTG GCTGGAGGGC GACATTCTGC TCAATACCGA CTCTGAGCAA GAAGGTGAAG TCTATATGGG CTGCGCGGGA GGCGTGAACG CCGAGTTCAC TTTCTCCATT GAGCGTGAAG CGATCCCTGC TGGTTATGTT GGCCGCCAAC TGATCTTAAA GGGTTTGAAA GGCGGTCACT CAGGTTGTGA TATTCACACT GGCCGTGGTA ACGCTAACAA GCTGATGGCG CGCTTTCTCG CAGGCCATGC GAAAGAATTA GATCTGCGCT TAGTCGAATT CCGTGGCGGT AGTCTACGTA ATGCGATCCC GCGTGAAGCT TTTGTCACCG TCGCCTTGCC AGAACAGCAC GTAGCCGAAT TAGAAACCTT ATTCCACCGC TACACTGAGC TACTCAAAGC TGAACTGGGT AAGGTTGAAA CTCACTTGGT AACTTTCCTT GAAGCCAAAG AACTGCAAAG TGAAGTGCTG ACCGCGCACA CTCAACAACG TTTTGTTGCC GCTCTGAACA CGTGTCCAAA CGGTGTGATC CGCATGAGCG ATGATATTGC AGGTGTTGTA GAAACCTCAC TCAACGTGGG AGTGATCACC ACAGAAGCCA ACAAAATCAA AGTGCTGTGC TTGATTCGCT CCCTAATGGA CTCAGGCCGC CACCAAGTCG AGGGCATGTT GCAATCGCTG GCACAACTTG CGGGGGCAGA GCTGGACCTT TCTGGTGCTT ACCCTGGCTG GAAACCCGAT GCTGATTCTG AAATCATGCA TATTTTCCGT GATATGTATG AAGGCATTTA TGGCCACAAA CCGAATATCA TGGTGATCCA CGCGGGTCTT GAGTGTGGGC TGTTCAAAAA ACCCTATCCA AACATGGATA TGGTCTCTTT CGGTCCAACC ATCAAGTTCC CACATTCACC GGATGAAAAA GTGAAGATAG ACACGGTTGA TCTGTTCTGG CAACAGATGG TGGCACTACT CGCCAATATC CCAGTGAAAG CCTAA
|
Protein sequence | MTYTLPLHSD MLCNESTYLL QANSLPRPSS GERIHGKNYS KESSVSEFQT EISKLSSNPI WPFFATICSI PHPSKHEEAL AQYIINWAKE QGLAVRRDET GNVFIKKPAT PGMENRKGVV LQAHIDMVPQ KNEDTVHDFT KDPIQPYIDG EWVTAKGTTL GADNGIGMAS CLAVLASKEI QHGPIEVLLT IDEEAGMTGA FGLKEGWLEG DILLNTDSEQ EGEVYMGCAG GVNAEFTFSI EREAIPAGYV GRQLILKGLK GGHSGCDIHT GRGNANKLMA RFLAGHAKEL DLRLVEFRGG SLRNAIPREA FVTVALPEQH VAELETLFHR YTELLKAELG KVETHLVTFL EAKELQSEVL TAHTQQRFVA ALNTCPNGVI RMSDDIAGVV ETSLNVGVIT TEANKIKVLC LIRSLMDSGR HQVEGMLQSL AQLAGAELDL SGAYPGWKPD ADSEIMHIFR DMYEGIYGHK PNIMVIHAGL ECGLFKKPYP NMDMVSFGPT IKFPHSPDEK VKIDTVDLFW QQMVALLANI PVKA
|
| |