Gene VC0395_A0959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0959 
Symbol 
ID5136849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp988401 
End bp989507 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content49% 
IMG OID640532417 
ProductM20A family peptidase 
Protein accessionYP_001216905 
Protein GI147673074 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01883] peptidase T-like protein 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCTGA TTAATACCCA ACGCCTTGTT GACCACTTCC TGCAACTGAT TCAGATCGAT 
AGCGAATCGG GCAATGAAAA AAAGATTGCT GAAACGCTCG CGGAGCAACT GGGCGAACTT
GGTTTCACCG TACATAAACT CCCTGTTCCG GCGGAGGTGT CAAACGGCTT TAACCTGTAT
GCCCGCCTAG AAGGCACACT CAATGACAGT ATTCTGTTTA GTTGCCACAT GGATACGGTT
AAACCGGGTA TCGGCATTGA GCCTGTGATT GAAGATGGCA TTATCCGTTC CAAAGGCAAC
ACGATTTTGG GTGGCGATGA CAAATCTGGC ATTGCCGCGA TCCTTGAAGC GGTACGTGTT
CTGCGCGATA GCCAGCAAGC GCACAAAACC ATTGAGATTG CTTTCACTGT GCATGAAGAA
GGCGGTCTGA AAGGTTCTGA GCATTTTGAT ATGAGTAAGG TGCAAGCAGA GAAAGCGATT
GTTCTCGACA CAGGTGGCCC AATCGGCACT ATCGTGCGTG CAGCACCGGG TCAGCAAAAA
ATCGTCGCAC AGATCAAAGG TAAACCCGCT CATGCTGGTT TAGTACCGGA AGATGGAATC
AGCGCCATTG CGGTGGCCGC TGATGCAATT ACTCAAATGA AACTGCTGCG AATTGACGAA
GAAACCACGG CTAACATCGG TATTGTGCAA GGCGGTCAAG CGACGAACAT TGTGATGCCT
GAGCTGAAAA TCGTGGCGGA AGCGCGTTCA CTCAACGATG CCAAACTCGA AGCGCAAGTT
CAGCACATGA TCGAAACTTT TGAACGTGCC GCGGAAAAGC ATGACGCAAC CGTTGAGATT
GAATCGACTC GCGCTTACAA CGCCTTTAAG TTGGAAGAAG ACAACGCGCA TATCCAAGCG
ATCAAAGCGA GCTTTGAAAC AATCGGTATT GAGCCGAAAA CCAAGCTGAG TGGTGGTGGC
AGCGATGCCA ATAATTTCAA TGCGAAAGGG TTAACTACGG TAAACCTCTC AACCGGTATG
GCTAAAGTGC ATACTACTGA AGAGTACATC GCGATTGCGG ATATGGTGAA AATTGCCGAA
TTCGTCTGCG CTTACACTAC CGCCTAA
 
Protein sequence
MSLINTQRLV DHFLQLIQID SESGNEKKIA ETLAEQLGEL GFTVHKLPVP AEVSNGFNLY 
ARLEGTLNDS ILFSCHMDTV KPGIGIEPVI EDGIIRSKGN TILGGDDKSG IAAILEAVRV
LRDSQQAHKT IEIAFTVHEE GGLKGSEHFD MSKVQAEKAI VLDTGGPIGT IVRAAPGQQK
IVAQIKGKPA HAGLVPEDGI SAIAVAADAI TQMKLLRIDE ETTANIGIVQ GGQATNIVMP
ELKIVAEARS LNDAKLEAQV QHMIETFERA AEKHDATVEI ESTRAYNAFK LEEDNAHIQA
IKASFETIGI EPKTKLSGGG SDANNFNAKG LTTVNLSTGM AKVHTTEEYI AIADMVKIAE
FVCAYTTA