Gene VC0395_A1869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1869 
SymbolpepD 
ID5137624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1987265 
End bp1988869 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content49% 
IMG OID640533326 
Productaminoacyl-histidine dipeptidase 
Protein accessionYP_001217793 
Protein GI147674070 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01893] aminoacyl-histidine dipeptidase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.550605 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACCTACA CCCTGCCCCT GCACAGTGAT ATGCTGTGCA ACGAAAGTAC CTACCTGTTA 
CAGGCAAATT CTCTGCCACG ACCCAGCTCT GGAGAGAGGA TACACGGGAA AAATTACAGT
AAGGAGTCAT CTGTGTCTGA GTTCCAAACC GAAATCAGTA AGTTATCGTC AAATCCAATT
TGGCCATTTT TCGCCACTAT CTGTTCCATC CCGCACCCTT CAAAACATGA AGAGGCATTA
GCTCAATACA TTATCAACTG GGCTAAAGAA CAAGGATTGG CCGTTCGTCG TGATGAGACC
GGTAACGTCT TTATTAAAAA GCCAGCGACA CCGGGCATGG AAAATCGTAA AGGTGTGGTA
CTTCAAGCGC ACATTGATAT GGTGCCGCAA AAAAATGAAG ACACAGTGCA TGATTTCACC
AAAGATCCGA TCCAGCCTTA TATTGATGGT GAATGGGTTA CTGCTAAAGG CACTACGCTT
GGCGCGGATA ATGGTATCGG CATGGCTTCT TGCCTAGCAG TACTGGCTTC TAAAGAGATC
CAACATGGTC CAATTGAAGT TCTGCTGACT ATTGATGAAG AAGCAGGCAT GACCGGCGCT
TTTGGCCTCA AAGAAGGTTG GCTGGAGGGC GACATTCTGC TCAATACCGA CTCTGAGCAA
GAAGGTGAAG TCTATATGGG CTGCGCGGGA GGCGTGAACG CCGAGTTCAC TTTCTCCATT
GAGCGTGAAG CGATCCCTGC TGGTTATGTT GGCCGCCAAC TGATCTTAAA GGGTTTGAAA
GGCGGTCACT CAGGTTGTGA TATTCACACT GGCCGTGGTA ACGCTAACAA GCTGATGGCG
CGCTTTCTCG CAGGCCATGC GAAAGAATTA GATCTGCGCT TAGTCGAATT CCGTGGCGGT
AGTCTACGTA ATGCGATCCC GCGTGAAGCT TTTGTCACCG TCGCCTTGCC AGAACAGCAC
GTAGCCGAAT TAGAAACCTT ATTCCACCGC TACACTGAGC TACTCAAAGC TGAACTGGGT
AAGGTTGAAA CTCACTTGGT AACTTTCCTT GAAGCCAAAG AACTGCAAAG TGAAGTGCTG
ACCGCGCACA CTCAACAACG TTTTGTTGCC GCTCTGAACA CGTGTCCAAA CGGTGTGATC
CGCATGAGCG ATGATATTGC AGGTGTTGTA GAAACCTCAC TCAACGTGGG AGTGATCACC
ACAGAAGCCA ACAAAATCAA AGTGCTGTGC TTGATTCGCT CCCTAATGGA CTCAGGCCGC
CACCAAGTCG AGGGCATGTT GCAATCGCTG GCACAACTTG CGGGGGCAGA GCTGGACCTT
TCTGGTGCTT ACCCTGGCTG GAAACCCGAT GCTGATTCTG AAATCATGCA TATTTTCCGT
GATATGTATG AAGGCATTTA TGGCCACAAA CCGAATATCA TGGTGATCCA CGCGGGTCTT
GAGTGTGGGC TGTTCAAAAA ACCCTATCCA AACATGGATA TGGTCTCTTT CGGTCCAACC
ATCAAGTTCC CACATTCACC GGATGAAAAA GTGAAGATAG ACACGGTTGA TCTGTTCTGG
CAACAGATGG TGGCACTACT CGCCAATATC CCAGTGAAAG CCTAA
 
Protein sequence
MTYTLPLHSD MLCNESTYLL QANSLPRPSS GERIHGKNYS KESSVSEFQT EISKLSSNPI 
WPFFATICSI PHPSKHEEAL AQYIINWAKE QGLAVRRDET GNVFIKKPAT PGMENRKGVV
LQAHIDMVPQ KNEDTVHDFT KDPIQPYIDG EWVTAKGTTL GADNGIGMAS CLAVLASKEI
QHGPIEVLLT IDEEAGMTGA FGLKEGWLEG DILLNTDSEQ EGEVYMGCAG GVNAEFTFSI
EREAIPAGYV GRQLILKGLK GGHSGCDIHT GRGNANKLMA RFLAGHAKEL DLRLVEFRGG
SLRNAIPREA FVTVALPEQH VAELETLFHR YTELLKAELG KVETHLVTFL EAKELQSEVL
TAHTQQRFVA ALNTCPNGVI RMSDDIAGVV ETSLNVGVIT TEANKIKVLC LIRSLMDSGR
HQVEGMLQSL AQLAGAELDL SGAYPGWKPD ADSEIMHIFR DMYEGIYGHK PNIMVIHAGL
ECGLFKKPYP NMDMVSFGPT IKFPHSPDEK VKIDTVDLFW QQMVALLANI PVKA