Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VEA_002658 |
Symbol | |
ID | 8556386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio sp. Ex25 |
Kingdom | Bacteria |
Replicon accession | NC_013456 |
Strand | - |
Start bp | 1173816 |
End bp | 1176914 |
Gene Length | 3099 bp |
Protein Length | 1032 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 646405749 |
Product | beta-D-galactosidase alpha subunit |
Protein accession | YP_003285285 |
Protein GI | 262393431 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.454065 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAACAATT GGGAAAACTT CCAACACTTA CACGAGAATC GTATGGCACC GCGTGCGTAC TTTTTCTCCT ACGACTCTGT TCAGAGCGCA CAAACCTTTC AACGTGAACT GAGTCGCCGC TTTATGTTGC TCAGCGGCCA ATGGACATTC CGTTATTTCA CCAACCCAAT GTTGGTACCT GATGAGTTTT ACTCACAAAC AATGAACGGC TGGGGACACA TTACCGTCCC AAACATGTGG CAAATGGAAG GCCACGGCGA TCTGCAATAC ACCGATGAGG GCTTCCCATT CCCGATCGAC GTCCCGTTTG TACCAACCGA CAACCCAACG GGCGCTTATC AACGCACATT TACACTTGGT CCACAGTGGG GTAACCAACA AACCATCATC AAGTTCGACG GCGTGGAAAC CTACTTTGAA GTGTACATAA ACGGCGAGTA CGTCGGCTTT AGTAAAGGCA GCCGACTCAC AGCGGAATTC GACATTTCCA AGTTCGTCCA ACAAGGCGAA AACTTACTCT CTGTACGCGT GATGCAGTGG GCCGATTCCA CTTACATTGA AGACCAAGAC ATGTGGTGGA CGGCGGGGAT ATTCCGTGAT GTTTACCTAA TCGGAAAAGA GAACATTCAC GTCCAAGACC TAACAATTCG CACCGATTTC GCCGACGATT ACCAAAGTGC GACGCTCAGT GCTCAAATCG AATTGGAAAA CCTTTCCACC GCTATCGCGT CTGGCTATAC GCTGGAATAC GCGCTGCATG ACAAAGGCAC TGTCGTAGCC AGTGGTCAAT GCGATTCACT GACTATCCAA AACCACCACT CAACAAGCTT TGCGATTGAT ATGGTGAGCC CAACGCATTG GACGGCTGAA AATCCTTACC TTTACCACTT GTTCATTACC TTAAAAGATC ACCAAGGCAA CGTAGTGGAA GTGATCCCAC AACGCGTGGG TTTCCGTGAT ATCAAGGTGC GTGATGGCCT GTTTTACATC AACAATCAAT ACGTGATGCT GCATGGTGTC AACCGTCATG ACAACGACCA CCTAAAAGGT CGCGCAGTGG GCATGGACCG AGTGGAAAAA GATCTGATTT TGATGAAGCA ACACAACATC AACTCAGTTC GCACCGCACA CTACCCGAAC GACCCACGCT TCTACGAGTT GTGTGATATT TACGGCCTGT TTGTGATGGC AGAAACCGAT GTGGAAACCC ACGGCTTTGC CAACGTTGGC GACCTGAGCC GTATCACCAA TGATCCAGCA TGGGAAGCGG TGTTTGTTGA TCGCGCAGTG CGCCATGTGC ACGCACAGAA AAACCACCCA TCGATCATCA TGTGGTCACT CGGTAACGAA TCGGGTTATG GCTGTAACAT CCGAGCGATG TATACCGCGA CCAAAGCAAT TGATAACACC CGTTTAGTTC ACTACGAAGA AGACCGCGAC GCTGAAGTCG TCGATGTGAT TTCGACGATG TACTCACGCG CTCAACTGAT GAATTACTTT GGCGAGCACC CACACGAGAA GCCACGCATT ATCTGCGAAT ACGCACACGC GATGGGCAAT GGACCAGGTG GTTTAACCGA ATATCAAAAT GTGTTCTATG CGCACGACCA CATTCAAGGT CACTACGTTT GGGAATGGTG TGATCACGGC ATTCTAGCGC GTGATGAACA CGGCCAAGAG TTCTACAAAT ACGGCGGCGA TTACGGTGAC TACCCAAACA ACTACAACTT CTGTATGGAT GGTCTGATTT ACCCAGATCA AACCCCGGGC CCAGGCCTGA AAGAGTACAA ACAAGTCATT GCACCGGTGA AAATCCGCGC CGTTGAAGGT TGCCATGACC GCTTCATCGT TGAGAACAAA TTGTGGTTCA CCAACCTCGA TGATTACACC ATTACTGCTG ACGTGCGTGC CGAAGGTGAA ACCCTGCATA GCGTGCAATT CAAAGTCAAA GCTTTGGTCG CCAACAGCGA ACGTGAAGTG ACCATCGACT TGCCAGAATT GGATGAACGC GAAGCCTTTG TTAACTTCAC CGTACGCAAA GACAGTCGCA CGCTTTACAG CGAAGCCAAC CATGAAATTG CGGTGTATCA GTTCCAACTG AAAGAAAACA CCGCGACGTT GCCTGCGCTG GTCAACCACA ATATTCAACC CTTGGTGCTT GAAGAAAGCC GCCTAGAGCA CGTGATCACA GGTCATAACT TCGCGCTTAC TTTCTCGAAA GTAAACGGCA AGCTGACTTC TTGGCGCGTA AATGGCGAAG AGATCATTCA ATCAGAGCCG AGACTGAACT TCTTCAAACC GATGATTGAT AACCACAAAC AAGAGTATGA AGGTTTGTGG CACCCAGCGC ATCTGCAAAT CATACAGGAG CACTTCCGCA CACTTGCTGT GGAAGCAACT TATGATTCCG TCTTGATTAC GACCACCAGC ATCATTGCGC CGCCGGTCTT TGATTTTGGC ATGCGTTGTA CCTATCGCTA TCAAATCAAT GCTCAAGGCC ATTTGAACGT GGAACTGAGT GGCGAGCGCT ACGGCGACTA TCCTCACGTG ATTCCGGTCA TCGGCTTGGA TTTGGGCATC AACGGCAGCT TTGATCAAGT GAGTTACTAC GGTCGCGGCC CTGAAGAGAA CTATCAAGAC AGCCGCCAAG CTAACCTCAT TGATGTTTAC CACACCAATG TGGCGGACAT GTTCGAGAAC TACCCGCTCC CGCAAAACAA CGGCAACCGC CAACACGTGC GCTGGGCCTC ACTCACCAAC CGTCATGGCA CAGGGTTATT GGTGAAACCT CAGCAAGAAA TCAACTTCAG CGCGTGGTTC TACACCAACC AAAATCTGCA TGAAGCGCAA CATACGATCG AGCTAGAAAA GAGTGGCTAC ATCACCCTTA ACCTTGACCA TCAAGTGATG GGATTGGGCT CGAACTCATG GGGCAGCGAA GTGTTCGACT CTTACCGCGT GTACATGGAC GAGTTCCGTT ACGGTTTGAC GCTGATGCCA CTGCAAGCAG GCGATTGCAA CGCACAAGTG ATGGCAAACC ATGATTTCGA CAACGCGTTT TTCACTCAAA CCAATACTCA ATCAGTCAAC GAGGCGTAA
|
Protein sequence | MNNWENFQHL HENRMAPRAY FFSYDSVQSA QTFQRELSRR FMLLSGQWTF RYFTNPMLVP DEFYSQTMNG WGHITVPNMW QMEGHGDLQY TDEGFPFPID VPFVPTDNPT GAYQRTFTLG PQWGNQQTII KFDGVETYFE VYINGEYVGF SKGSRLTAEF DISKFVQQGE NLLSVRVMQW ADSTYIEDQD MWWTAGIFRD VYLIGKENIH VQDLTIRTDF ADDYQSATLS AQIELENLST AIASGYTLEY ALHDKGTVVA SGQCDSLTIQ NHHSTSFAID MVSPTHWTAE NPYLYHLFIT LKDHQGNVVE VIPQRVGFRD IKVRDGLFYI NNQYVMLHGV NRHDNDHLKG RAVGMDRVEK DLILMKQHNI NSVRTAHYPN DPRFYELCDI YGLFVMAETD VETHGFANVG DLSRITNDPA WEAVFVDRAV RHVHAQKNHP SIIMWSLGNE SGYGCNIRAM YTATKAIDNT RLVHYEEDRD AEVVDVISTM YSRAQLMNYF GEHPHEKPRI ICEYAHAMGN GPGGLTEYQN VFYAHDHIQG HYVWEWCDHG ILARDEHGQE FYKYGGDYGD YPNNYNFCMD GLIYPDQTPG PGLKEYKQVI APVKIRAVEG CHDRFIVENK LWFTNLDDYT ITADVRAEGE TLHSVQFKVK ALVANSEREV TIDLPELDER EAFVNFTVRK DSRTLYSEAN HEIAVYQFQL KENTATLPAL VNHNIQPLVL EESRLEHVIT GHNFALTFSK VNGKLTSWRV NGEEIIQSEP RLNFFKPMID NHKQEYEGLW HPAHLQIIQE HFRTLAVEAT YDSVLITTTS IIAPPVFDFG MRCTYRYQIN AQGHLNVELS GERYGDYPHV IPVIGLDLGI NGSFDQVSYY GRGPEENYQD SRQANLIDVY HTNVADMFEN YPLPQNNGNR QHVRWASLTN RHGTGLLVKP QQEINFSAWF YTNQNLHEAQ HTIELEKSGY ITLNLDHQVM GLGSNSWGSE VFDSYRVYMD EFRYGLTLMP LQAGDCNAQV MANHDFDNAF FTQTNTQSVN EA
|
| |