Gene VC0395_0511 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_0511 
Symbolzot 
ID5134487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009456 
Strand
Start bp565053 
End bp566252 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content49% 
IMG OID640530833 
Productzona occludens toxin 
Protein accessionYP_001215350 
Protein GI147672022 
COG category[R] General function prediction only 
COG ID[COG4128] Zonula occludens toxin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATCT TTATTCATCA CGGCGCGCCA GGCTCTTATA AAACGTCCGG GGCATTATGG 
CTTCGTCTGC TGCCGGCGAT TAAGTCAGGC CGTCACATCA TCACGAATGT GCGAGGCTTA
AACCTTGAAC GCATAGCTAA GTACTTAAAA ATGGACGTCT CAGACATCAG TATCGAGTTT
ATTGATACAG ACCATCCAGA CGGTCGCTTA ACGATGGCGC GTTTTTGGCA CTGGGCGAGA
AAGGACGCGT TTCTCTTTAT TGATGAATGT GGTCGCATCT GGCCGCCGAG ACTGACGGCC
ACCAATTTAA AGGCGCTCGA CACGCCGCCG GATTTGGTCG CAGAGGATAG GCCTGAGAGC
TTTGAGGTGG CTTTTGACAT GCATCGTCAC CACGGCTGGG ATATCTGCCT AACCACGCCT
AACATTGCCA AAGTGCACAA CATGATAAGA GAGGCGGCGG AGATAGGGTA TCGCCACTTT
AACCGCGCCA CGGTGGGGCT AGGGGCAAAG TTTACCCTGA CCACCCACGA TGCAGCCAAC
TCTGGACAGA TGGATTCGCA CGCGCTGACA CGCCAAGTCA AAAAAATTCC AAGTCCGATT
TTTAAGATGT ACGCAAGCAC CACCACAGGC AAAGCACGCG ACACGATGGC CGGAACGGCG
CTGTGGAAAG ACAGAAAGAT CCTTTTCTTG TTCGGCATGG TTTTTTTGAT GTTCTCTTAT
TCGTTTTACG GCTTACACGA CAATCCAATT TTTACAGGGG GAAATGATGC AACTATCGAG
TCAGAGCAAT CCGAGCCTCA GTCAAAGGCT ACTGCTGGGA ATGCTGTCGG GAGCAAGGCG
GCTGCTCCTG CGTCTTTTGG TTTTTGTATT GGTCGGCTTT GTGTCCAAGA TGGTTTTGTC
ACTGTTGGTG ATGAGCGTTA TCGCCTCGTA GACAATTTGG ACATTCCTTA TCGTGGTCTA
TGGGCGACAG GTCATCACAT TTACAAGGAT ACGCTTACAG TGTTTTTTGA AACCGAGAGT
GGCAGCGTCC CAACAGAGCT GTTTGCATCG AGCTACCGCT ACAAGGTGCT ACCGTTACCG
GATTTCAATC ACTTTGTGGT GTTCGATACC TTTGCAGCGC AAGCGCTGTG GGTAGAAGTG
AAACGGGGTT TACCGATAAA AACAGAAAAT GATAAAAAAG GACTAAATAG TATATTTTGA
 
Protein sequence
MSIFIHHGAP GSYKTSGALW LRLLPAIKSG RHIITNVRGL NLERIAKYLK MDVSDISIEF 
IDTDHPDGRL TMARFWHWAR KDAFLFIDEC GRIWPPRLTA TNLKALDTPP DLVAEDRPES
FEVAFDMHRH HGWDICLTTP NIAKVHNMIR EAAEIGYRHF NRATVGLGAK FTLTTHDAAN
SGQMDSHALT RQVKKIPSPI FKMYASTTTG KARDTMAGTA LWKDRKILFL FGMVFLMFSY
SFYGLHDNPI FTGGNDATIE SEQSEPQSKA TAGNAVGSKA AAPASFGFCI GRLCVQDGFV
TVGDERYRLV DNLDIPYRGL WATGHHIYKD TLTVFFETES GSVPTELFAS SYRYKVLPLP
DFNHFVVFDT FAAQALWVEV KRGLPIKTEN DKKGLNSIF