Gene VEA_002658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVEA_002658 
Symbol 
ID8556386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio sp. Ex25 
KingdomBacteria 
Replicon accessionNC_013456 
Strand
Start bp1173816 
End bp1176914 
Gene Length3099 bp 
Protein Length1032 aa 
Translation table11 
GC content50% 
IMG OID646405749 
Productbeta-D-galactosidase alpha subunit 
Protein accessionYP_003285285 
Protein GI262393431 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.454065 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACAATT GGGAAAACTT CCAACACTTA CACGAGAATC GTATGGCACC GCGTGCGTAC 
TTTTTCTCCT ACGACTCTGT TCAGAGCGCA CAAACCTTTC AACGTGAACT GAGTCGCCGC
TTTATGTTGC TCAGCGGCCA ATGGACATTC CGTTATTTCA CCAACCCAAT GTTGGTACCT
GATGAGTTTT ACTCACAAAC AATGAACGGC TGGGGACACA TTACCGTCCC AAACATGTGG
CAAATGGAAG GCCACGGCGA TCTGCAATAC ACCGATGAGG GCTTCCCATT CCCGATCGAC
GTCCCGTTTG TACCAACCGA CAACCCAACG GGCGCTTATC AACGCACATT TACACTTGGT
CCACAGTGGG GTAACCAACA AACCATCATC AAGTTCGACG GCGTGGAAAC CTACTTTGAA
GTGTACATAA ACGGCGAGTA CGTCGGCTTT AGTAAAGGCA GCCGACTCAC AGCGGAATTC
GACATTTCCA AGTTCGTCCA ACAAGGCGAA AACTTACTCT CTGTACGCGT GATGCAGTGG
GCCGATTCCA CTTACATTGA AGACCAAGAC ATGTGGTGGA CGGCGGGGAT ATTCCGTGAT
GTTTACCTAA TCGGAAAAGA GAACATTCAC GTCCAAGACC TAACAATTCG CACCGATTTC
GCCGACGATT ACCAAAGTGC GACGCTCAGT GCTCAAATCG AATTGGAAAA CCTTTCCACC
GCTATCGCGT CTGGCTATAC GCTGGAATAC GCGCTGCATG ACAAAGGCAC TGTCGTAGCC
AGTGGTCAAT GCGATTCACT GACTATCCAA AACCACCACT CAACAAGCTT TGCGATTGAT
ATGGTGAGCC CAACGCATTG GACGGCTGAA AATCCTTACC TTTACCACTT GTTCATTACC
TTAAAAGATC ACCAAGGCAA CGTAGTGGAA GTGATCCCAC AACGCGTGGG TTTCCGTGAT
ATCAAGGTGC GTGATGGCCT GTTTTACATC AACAATCAAT ACGTGATGCT GCATGGTGTC
AACCGTCATG ACAACGACCA CCTAAAAGGT CGCGCAGTGG GCATGGACCG AGTGGAAAAA
GATCTGATTT TGATGAAGCA ACACAACATC AACTCAGTTC GCACCGCACA CTACCCGAAC
GACCCACGCT TCTACGAGTT GTGTGATATT TACGGCCTGT TTGTGATGGC AGAAACCGAT
GTGGAAACCC ACGGCTTTGC CAACGTTGGC GACCTGAGCC GTATCACCAA TGATCCAGCA
TGGGAAGCGG TGTTTGTTGA TCGCGCAGTG CGCCATGTGC ACGCACAGAA AAACCACCCA
TCGATCATCA TGTGGTCACT CGGTAACGAA TCGGGTTATG GCTGTAACAT CCGAGCGATG
TATACCGCGA CCAAAGCAAT TGATAACACC CGTTTAGTTC ACTACGAAGA AGACCGCGAC
GCTGAAGTCG TCGATGTGAT TTCGACGATG TACTCACGCG CTCAACTGAT GAATTACTTT
GGCGAGCACC CACACGAGAA GCCACGCATT ATCTGCGAAT ACGCACACGC GATGGGCAAT
GGACCAGGTG GTTTAACCGA ATATCAAAAT GTGTTCTATG CGCACGACCA CATTCAAGGT
CACTACGTTT GGGAATGGTG TGATCACGGC ATTCTAGCGC GTGATGAACA CGGCCAAGAG
TTCTACAAAT ACGGCGGCGA TTACGGTGAC TACCCAAACA ACTACAACTT CTGTATGGAT
GGTCTGATTT ACCCAGATCA AACCCCGGGC CCAGGCCTGA AAGAGTACAA ACAAGTCATT
GCACCGGTGA AAATCCGCGC CGTTGAAGGT TGCCATGACC GCTTCATCGT TGAGAACAAA
TTGTGGTTCA CCAACCTCGA TGATTACACC ATTACTGCTG ACGTGCGTGC CGAAGGTGAA
ACCCTGCATA GCGTGCAATT CAAAGTCAAA GCTTTGGTCG CCAACAGCGA ACGTGAAGTG
ACCATCGACT TGCCAGAATT GGATGAACGC GAAGCCTTTG TTAACTTCAC CGTACGCAAA
GACAGTCGCA CGCTTTACAG CGAAGCCAAC CATGAAATTG CGGTGTATCA GTTCCAACTG
AAAGAAAACA CCGCGACGTT GCCTGCGCTG GTCAACCACA ATATTCAACC CTTGGTGCTT
GAAGAAAGCC GCCTAGAGCA CGTGATCACA GGTCATAACT TCGCGCTTAC TTTCTCGAAA
GTAAACGGCA AGCTGACTTC TTGGCGCGTA AATGGCGAAG AGATCATTCA ATCAGAGCCG
AGACTGAACT TCTTCAAACC GATGATTGAT AACCACAAAC AAGAGTATGA AGGTTTGTGG
CACCCAGCGC ATCTGCAAAT CATACAGGAG CACTTCCGCA CACTTGCTGT GGAAGCAACT
TATGATTCCG TCTTGATTAC GACCACCAGC ATCATTGCGC CGCCGGTCTT TGATTTTGGC
ATGCGTTGTA CCTATCGCTA TCAAATCAAT GCTCAAGGCC ATTTGAACGT GGAACTGAGT
GGCGAGCGCT ACGGCGACTA TCCTCACGTG ATTCCGGTCA TCGGCTTGGA TTTGGGCATC
AACGGCAGCT TTGATCAAGT GAGTTACTAC GGTCGCGGCC CTGAAGAGAA CTATCAAGAC
AGCCGCCAAG CTAACCTCAT TGATGTTTAC CACACCAATG TGGCGGACAT GTTCGAGAAC
TACCCGCTCC CGCAAAACAA CGGCAACCGC CAACACGTGC GCTGGGCCTC ACTCACCAAC
CGTCATGGCA CAGGGTTATT GGTGAAACCT CAGCAAGAAA TCAACTTCAG CGCGTGGTTC
TACACCAACC AAAATCTGCA TGAAGCGCAA CATACGATCG AGCTAGAAAA GAGTGGCTAC
ATCACCCTTA ACCTTGACCA TCAAGTGATG GGATTGGGCT CGAACTCATG GGGCAGCGAA
GTGTTCGACT CTTACCGCGT GTACATGGAC GAGTTCCGTT ACGGTTTGAC GCTGATGCCA
CTGCAAGCAG GCGATTGCAA CGCACAAGTG ATGGCAAACC ATGATTTCGA CAACGCGTTT
TTCACTCAAA CCAATACTCA ATCAGTCAAC GAGGCGTAA
 
Protein sequence
MNNWENFQHL HENRMAPRAY FFSYDSVQSA QTFQRELSRR FMLLSGQWTF RYFTNPMLVP 
DEFYSQTMNG WGHITVPNMW QMEGHGDLQY TDEGFPFPID VPFVPTDNPT GAYQRTFTLG
PQWGNQQTII KFDGVETYFE VYINGEYVGF SKGSRLTAEF DISKFVQQGE NLLSVRVMQW
ADSTYIEDQD MWWTAGIFRD VYLIGKENIH VQDLTIRTDF ADDYQSATLS AQIELENLST
AIASGYTLEY ALHDKGTVVA SGQCDSLTIQ NHHSTSFAID MVSPTHWTAE NPYLYHLFIT
LKDHQGNVVE VIPQRVGFRD IKVRDGLFYI NNQYVMLHGV NRHDNDHLKG RAVGMDRVEK
DLILMKQHNI NSVRTAHYPN DPRFYELCDI YGLFVMAETD VETHGFANVG DLSRITNDPA
WEAVFVDRAV RHVHAQKNHP SIIMWSLGNE SGYGCNIRAM YTATKAIDNT RLVHYEEDRD
AEVVDVISTM YSRAQLMNYF GEHPHEKPRI ICEYAHAMGN GPGGLTEYQN VFYAHDHIQG
HYVWEWCDHG ILARDEHGQE FYKYGGDYGD YPNNYNFCMD GLIYPDQTPG PGLKEYKQVI
APVKIRAVEG CHDRFIVENK LWFTNLDDYT ITADVRAEGE TLHSVQFKVK ALVANSEREV
TIDLPELDER EAFVNFTVRK DSRTLYSEAN HEIAVYQFQL KENTATLPAL VNHNIQPLVL
EESRLEHVIT GHNFALTFSK VNGKLTSWRV NGEEIIQSEP RLNFFKPMID NHKQEYEGLW
HPAHLQIIQE HFRTLAVEAT YDSVLITTTS IIAPPVFDFG MRCTYRYQIN AQGHLNVELS
GERYGDYPHV IPVIGLDLGI NGSFDQVSYY GRGPEENYQD SRQANLIDVY HTNVADMFEN
YPLPQNNGNR QHVRWASLTN RHGTGLLVKP QQEINFSAWF YTNQNLHEAQ HTIELEKSGY
ITLNLDHQVM GLGSNSWGSE VFDSYRVYMD EFRYGLTLMP LQAGDCNAQV MANHDFDNAF
FTQTNTQSVN EA