Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A1061 |
Symbol | zot |
ID | 5136175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 1116043 |
End bp | 1117242 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640532519 |
Product | zona occludens toxin |
Protein accession | YP_001217007 |
Protein GI | 147674144 |
COG category | [R] General function prediction only |
COG ID | [COG4128] Zonula occludens toxin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTATCT TTATTCATCA CGGCGCGCCA GGCTCTTATA AAACGTCCGG GGCATTATGG CTTCGTCTGC TGCCGGCGAT TAAGTCAGGC CGTCACATCA TCACGAATGT GCGAGGCTTA AACCTTGAAC GCATAGCTAA GTACTTAAAA ATGGACGTCT CAGACATCAG TATCGAGTTT ATTGATACAG ACCATCCAGA CGGTCGCTTA ACGATGGCGC GTTTTTGGCA CTGGGCGAGA AAGGACGCGT TTCTCTTTAT TGATGAATGT GGTCGCATCT GGCCGCCGAG ACTGACGGCC ACCAATTTAA AGGCGCTCGA CACGCCGCCG GATTTGGTCG CAGAGGATAG GCCTGAGAGC TTTGAGGTGG CTTTTGACAT GCATCGTCAC CACGGCTGGG ATATCTGCCT AACCACGCCT AACATTGCCA AAGTGCACAA CATGATAAGA GAGGCGGCGG AGATAGGGTA TCGCCACTTT AACCGCGCCA CGGTGGGGCT AGGGGCAAAG TTTACCCTGA CCACCCACGA TGCAGCCAAC TCTGGACAGA TGGATTCGCA CGCGCTGACA CGCCAAGTCA AAAAAATTCC AAGTCCGATT TTTAAGATGT ACGCAAGCAC CACCACAGGC AAAGCACGCG ACACGATGGC CGGAACGGCG CTGTGGAAAG ACAGAAAGAT CCTTTTCTTG TTCGGCATGG TTTTTTTGAT GTTCTCTTAT TCGTTTTACG GCTTACACGA CAATCCAATT TTTACAGGGG GAAATGATGC AACTATCGAG TCAGAGCAAT CCGAGCCTCA GTCAAAGGCT ACTGCTGGGA ATGCTGTCGG GAGCAAGGCG GCTGCTCCTG CGTCTTTTGG TTTTTGTATT GGTCGGCTTT GTGTCCAAGA TGGTTTTGTC ACTGTTGGTG ATGAGCGTTA TCGCCTCGTA GACAATTTGG ACATTCCTTA TCGTGGTCTA TGGGCGACAG GTCATCACAT TTACAAGGAT ACGCTTACAG TGTTTTTTGA AACCGAGAGT GGCAGCGTCC CAACAGAGCT GTTTGCATCG AGCTACCGCT ACAAGGTGCT ACCGTTACCG GATTTCAATC ACTTTGTGGT GTTCGATACC TTTGCAGCGC AAGCGCTGTG GGTAGAAGTG AAACGGGGTT TACCGATAAA AACAGAAAAT GATAAAAAAG GACTAAATAG TATATTTTGA
|
Protein sequence | MSIFIHHGAP GSYKTSGALW LRLLPAIKSG RHIITNVRGL NLERIAKYLK MDVSDISIEF IDTDHPDGRL TMARFWHWAR KDAFLFIDEC GRIWPPRLTA TNLKALDTPP DLVAEDRPES FEVAFDMHRH HGWDICLTTP NIAKVHNMIR EAAEIGYRHF NRATVGLGAK FTLTTHDAAN SGQMDSHALT RQVKKIPSPI FKMYASTTTG KARDTMAGTA LWKDRKILFL FGMVFLMFSY SFYGLHDNPI FTGGNDATIE SEQSEPQSKA TAGNAVGSKA AAPASFGFCI GRLCVQDGFV TVGDERYRLV DNLDIPYRGL WATGHHIYKD TLTVFFETES GSVPTELFAS SYRYKVLPLP DFNHFVVFDT FAAQALWVEV KRGLPIKTEN DKKGLNSIF
|
| |