Gene VEA_000142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVEA_000142 
Symbol 
ID8558447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio sp. Ex25 
KingdomBacteria 
Replicon accessionNC_013457 
Strand
Start bp155892 
End bp157277 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content49% 
IMG OID646407807 
Productzona occludens toxin 
Protein accessionYP_003287295 
Protein GI262395442 
COG category[R] General function prediction only 
COG ID[COG4128] Zonula occludens toxin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.976786 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTACTT CATTTCGATA CGGTCACGGT GGCTCTTACA AATCGGCTTG CGCCGTGTGG 
TTTGACTTAC TGCCTGCACT GCGTGAAGGT CGAATTTGCA TTACGAACAT TCATGGTATG
CAGCCACTTG AAGTGATTGA ACAACGCCTT GGTGAGAAGT TCCCTGATAC CGCTCGGCTC
ATTCGCATTA GCTCTCGCAA CCCTGAAGGC TTCGAGCTTT GGAAATACTT CTTCTGTTGG
GCACCCATTG GGGCGTTCAT CCTCATTGAT GAGTGTCAGC AAATCTTCTC GGTCAATGCA
GGTTTCAAAA TGGCGAACAT ACACAAGCGC CCTTTCACTG ACTTTGAGCC TCACTTACCG
GAAGGATTCT CTGAGCTGTT TCACTCTCGT TGGCTAACGA TTGATACATC CAGTTTGGAC
AATGGCGAGA TAGACGATTG CCAACGCACA CGTTTTGATG AGCAAGGACG CATCATCTAT
CCAGAGAACT TTAACAACGC CTTTATGGAG CACCGGCATT ATAACTGGGA CATTGTGTTG
CTCACGCCTG ACTTTGCTCA AATCCCGAAA GAGTTAAAAG GTGTCGCGGA GTTGGCCAAG
CAACATAAGG GGAAAGATGG GATCTTCTTT TCCAACCGAA AACCGCGCAT CTTGGAGCAT
GACCCAACTC GAACGGTCAC CAAACCAAGC AAAGATGATG TGGTTTATAA CCTCAAGGTG
CCGCTTGATG TCCACCTACT CTACGCCTCG ACTGTCACGG GGCAAATCAC CAAATCGGGA
CTTGGGAAGA ACATCTTTCT TAACCCGAAA TTCTTAGCAG CTATGGCACT GGTCGTGCTT
TCATTTGGGT ACTTAGTTTA TGCGCTTATT GGTATGGTTT CTGATTCTGA GACGACAACT
GCGGAAGGAA CGCAGCTTCA TCAAACTTCG CAGCAAAGTG GCGTTTCGAC TTCGCAAGGT
CAAGCACGTC CTGGTCAAAG TGGTTCGCCT GGTTCTGTCA TGGGTTCTAG TGGTTCTGGC
TGTACGGGTT CTGGTTGCGG GAATGAGTCT TATCATGACG TAGGTACCGT TCCGGCTTGG
TTCCCACTGG CGAACTCAGA GAGTATCTAT GTCTCTGCGG TGGAACGTTG GCACAAAGCC
ACCTCGATAC ACGTCAACGT GCATTTTGAG GTTGTCACAC CGCGTGGTGT GACTTACCTC
GATGACGGAT TCCTAAACAA GTTGGGCGTC AAGATGGAAT ATCTGGACGA TTGCCTCGTC
CAGCTGTCTC GCGGTGCATC CAACTTCTAT GTCACGTGTT CGCCGTATGA GCAATATGCA
CAACGGCAAG AGCAAGATAT TGAACTAAAA CCCGTTGGCG GTTTGTTTAG TGGAGACGAA
ACCTAA
 
Protein sequence
MATSFRYGHG GSYKSACAVW FDLLPALREG RICITNIHGM QPLEVIEQRL GEKFPDTARL 
IRISSRNPEG FELWKYFFCW APIGAFILID ECQQIFSVNA GFKMANIHKR PFTDFEPHLP
EGFSELFHSR WLTIDTSSLD NGEIDDCQRT RFDEQGRIIY PENFNNAFME HRHYNWDIVL
LTPDFAQIPK ELKGVAELAK QHKGKDGIFF SNRKPRILEH DPTRTVTKPS KDDVVYNLKV
PLDVHLLYAS TVTGQITKSG LGKNIFLNPK FLAAMALVVL SFGYLVYALI GMVSDSETTT
AEGTQLHQTS QQSGVSTSQG QARPGQSGSP GSVMGSSGSG CTGSGCGNES YHDVGTVPAW
FPLANSESIY VSAVERWHKA TSIHVNVHFE VVTPRGVTYL DDGFLNKLGV KMEYLDDCLV
QLSRGASNFY VTCSPYEQYA QRQEQDIELK PVGGLFSGDE T