Gene VC0395_1008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_1008 
SymbolhlyA 
ID5134642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009456 
Strand
Start bp986290 
End bp987792 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content49% 
IMG OID640531330 
Producthaemolysin 
Protein accessionYP_001215844 
Protein GI147671739 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGCAC GGGAGCCGGC TGATCAACTC GGTTATCGTC AGTTTGGAGC CAGTTATACG 
ACGTTAGATG CCTATTTCCG TGAGTGGTCA ACCGATGCGA TTGCCCAAGA TTATCGCTTC
GTGTTTAACG CATCGAACAA TAAAGCGCAG ATCCTGAAAA CCTTTCCTGT CGATAACATT
AACGAGAAAT TTGAGCGCAA AGAGGTTTCA GGTTTTGAGC TTGGGGTGAC TGGTGGGGTG
GAAGTCAGTG GAGATGGCCC GAAAGCCAAA CTAGAGGCGA GAGCAAGTTA TACCCAGAGT
CGCTGGTTAA CCTACAACAC ACAAGACTAT CGTATTGAGC GTAATGCGAA GAATGCGCAA
GCGGTTAGCT TTACATGGAA TCGTCAACAA TACGCGACAG CAGAATCGCT ACTCAATCGT
TCGACCGATG CTTTGTGGGT GAATACCTAC CCGGTAGATG TAAACCGTAT TAGCCCGCTG
AGCTACGCGA GTTTTGTGCC GAAAATGGAT GTGATTTATA AAGCCTCAGC CACAGAGACA
GGCAGTACGG ATTTTATCAT CGACTCTTCG GTCAATATCC GCCCAATCTA TAACGGTGCT
TATAAGCACT ACTATGTGGT CGGTGCTCAT CAGTCCTACC ATGGCTTTGA AGATACCCCA
CGTCGTCGAA TCACGAAATC GGCAAGCTTT ACGGTCGATT GGGATCACCC AGTATTCACG
GGTGGCCGCC CGGTCAACCT ACAACTTGCC AGCTTTAACA ACCGCTGTAT TCAAGTCGAT
GCTCAAGGTC GCTTGGCGGC CAATACGTGC GATAGCCAGC AATCAGCGCA ATCGTTCATC
TATGATCAGC TTGGTCGTTA TGTGAGTGCG AGTAACACCA AGCTCTGTCT TGATGGTGAG
GCATTAGACG CATTGCAACC CTGTAACCAA AACCTGACTC AGCGTTGGGA GTGGCGTAAA
GGCACAGATG AATTGACCAA TGTCTACAGC GGCGAGTCCC TTGGACATGA CAAACAAACC
GGTGAGCTTG GTTTGTATGC GAGCAGCAAC GATGCGGTAA GTTTACGTAC CATCACCGCT
TATACCGATG TGTTTAATGC GCAAGAAAGT TCGCCGATTC TGGGTTACAC ACAAGGGAAA
ATGAATCAGC AGCGTGTGGG ACAAGATCAT CGTTTGTATG TGCGAGCGGG TGCTGCCATT
GATGCATTAG GGTCCGCCTC CGATTTATTG GTTGGTGGCA ATGGTGGTAG CTTGAGTTCG
GTGGATCTGT CCGGCGTGAA ATCCATCACG GCAACCTCTG GTGATTTCCA ATATGGCGGT
CAGCAGTTGG TGGCGCTGAC ATTCACCTAC CAAGATGGAC GTCAGCAAAC GGTAGGCTCG
AAAGCGTATG TCACCAATGC TCATGAAGAC CGTTTCGATT TACCGGCTGC CGCTAAGATC
ACTCAACTGA AAATTTGGTC TGACGATTGG TTGGTGAAAG GGGTTCAATT TGATTTGAAC
TAA
 
Protein sequence
MTAREPADQL GYRQFGASYT TLDAYFREWS TDAIAQDYRF VFNASNNKAQ ILKTFPVDNI 
NEKFERKEVS GFELGVTGGV EVSGDGPKAK LEARASYTQS RWLTYNTQDY RIERNAKNAQ
AVSFTWNRQQ YATAESLLNR STDALWVNTY PVDVNRISPL SYASFVPKMD VIYKASATET
GSTDFIIDSS VNIRPIYNGA YKHYYVVGAH QSYHGFEDTP RRRITKSASF TVDWDHPVFT
GGRPVNLQLA SFNNRCIQVD AQGRLAANTC DSQQSAQSFI YDQLGRYVSA SNTKLCLDGE
ALDALQPCNQ NLTQRWEWRK GTDELTNVYS GESLGHDKQT GELGLYASSN DAVSLRTITA
YTDVFNAQES SPILGYTQGK MNQQRVGQDH RLYVRAGAAI DALGSASDLL VGGNGGSLSS
VDLSGVKSIT ATSGDFQYGG QQLVALTFTY QDGRQQTVGS KAYVTNAHED RFDLPAAAKI
TQLKIWSDDW LVKGVQFDLN