Gene VC0395_1007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_1007 
SymbolhylB 
ID5133982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009456 
Strand
Start bp984345 
End bp985991 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content49% 
IMG OID640531329 
Producthemolysin secretion protein HylB 
Protein accessionYP_001215843 
Protein GI147671663 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.702356 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCATCA ATAAATTTTC CCTTAAATGG ATGTTGGCTA TTGCCGTCGC CATCCCTGCG 
ATAGCACTGT TGTTTGTGGC TTTCACCAGT CTAAACACCA TGTCAGTGAT GCAAGCGCAG
TCCAACAGCT TGTATGCCAA CACGGCTGCA CCAATGCGTG CCATGGCTGA AGCAACCTCA
CGTATTCCTC GGATGCGTGT CGGTATCGAT ATGATGCTAC TGCAAGAAAC GGCGCTCAAA
GATGCGAAAG GGGTCCTCAA ACGAGTCGAA GAGGCAAGAA CCGAAGATAT CCCAGAAATG
CGTCAAGCAA TGCAAGTTGC GGTTGATTCT CAGGTTAATC CGGAACTCAA AGAGCAGGCA
CGCAAACTTC AAGCTCGTTT TGAACAAATG GTACGTGAAG AGTTAGAGCC TATGCTGCAA
GCCTTCGCCA ATAACGATAT GACCACGGCA CAAAACATTT ACCGCGATAA ATACGCGCCG
ACCTATGGTG AAATGCGTAA ACAAGCCAAC CAGATCCTCG ATACGCTTTT GCAGCAAGCG
GAGCAGCAAA ACCATGCCAG TGTGGAAAGC TTCGAAGCAG GACGCACCAA GCAAATGGTG
ATCATTGCAG CAGGCTTGAT CATTTCATTC ATCACTTCAC TGGTTATCAT AACGAACTTA
CGTAGCCGAG TGGCTTACCT GAAAGATCGT ATGAGTTCTG CGGCGGCGAA TCTTTCACTG
CGTACTCGAT TGGAGTTGGA TGGTAACGAT GAACTGTGTG ACATCGGTAA AAGCTTCAAT
GCGTTCATTG ATAAAGTGCA TCACTCGATT GAAGAAGTGG CAGAAAACTC AAAAGAGCTG
GCGACGATGG CCTCTAGTGT GTCGCAGCGC GCGCACATGA CGCAATCTAA CTGTGCTTCG
CAGCGAGATA GAACAGTGCA AGTTGCGACG GCGATTCATG AGCTTGGTGC CACCGTATCC
GAAATCGCTT CCAATGCGGC CATGGCTGCC GATGTCGCGA AGCAAGCGAC GCTGCATTCT
GGTGAAGGGA AAAAAGTGGT AGGCGAAGTG CAAAATCGGA TCCAAACACT GGTCAATGAA
CTCGATAATG CCACTCAAGT TGTCTCATCA CTGGCGACCC AAATTAACGG TATTAGCTCA
ACACTTGATA CCATTCGCAG TATTTCTGAG CAAACGAACC TATTGGCGCT CAACGCTGCG
ATTGAAGCTG CGCGAGCGGG TGAACAAGGT CGTGGTTTTG CGGTGGTGGC GGATGAAGTT
CGCACATTAG CAAGTCGTTC AGCGGCATCG ACGGAAGAGA TCCAGCAAGT CATTAATCGC
CTTCAAACGG AGTCAACTCG CGCAGTAGAA GCAATGGAAA AAGGTCGCTC GCAAAGTGAT
GTGGTGGTTG AGTTTTCCGC TAAAGCGAAC CAATCTCTCA CAGAGATCAA CAGCCAAATT
GATCAGATTA ATGATCAAAA TATTCAAGTT GCGACCGCGA CAGAGGAACA ATCAACCGTG
GTGGAAGACA TTAATCGCAA CGTTGAAGAC ATCAACCAAC TGACGACAGA AACCTCGCAT
GTTGCGGATG AGTTAAGCCG AGCCAGTGCA AGCTTGCAAC GTCTCTCTTC GCAACTGGAT
AAACTGGTGG GCAGTTTTGA ACTTTAA
 
Protein sequence
MIINKFSLKW MLAIAVAIPA IALLFVAFTS LNTMSVMQAQ SNSLYANTAA PMRAMAEATS 
RIPRMRVGID MMLLQETALK DAKGVLKRVE EARTEDIPEM RQAMQVAVDS QVNPELKEQA
RKLQARFEQM VREELEPMLQ AFANNDMTTA QNIYRDKYAP TYGEMRKQAN QILDTLLQQA
EQQNHASVES FEAGRTKQMV IIAAGLIISF ITSLVIITNL RSRVAYLKDR MSSAAANLSL
RTRLELDGND ELCDIGKSFN AFIDKVHHSI EEVAENSKEL ATMASSVSQR AHMTQSNCAS
QRDRTVQVAT AIHELGATVS EIASNAAMAA DVAKQATLHS GEGKKVVGEV QNRIQTLVNE
LDNATQVVSS LATQINGISS TLDTIRSISE QTNLLALNAA IEAARAGEQG RGFAVVADEV
RTLASRSAAS TEEIQQVINR LQTESTRAVE AMEKGRSQSD VVVEFSAKAN QSLTEINSQI
DQINDQNIQV ATATEEQSTV VEDINRNVED INQLTTETSH VADELSRASA SLQRLSSQLD
KLVGSFEL