Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A1917 |
Symbol | lacZ |
ID | 5135797 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 2040732 |
End bp | 2043866 |
Gene Length | 3135 bp |
Protein Length | 1044 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640533374 |
Product | beta-D-galactosidase |
Protein accession | YP_001217841 |
Protein GI | 147673874 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACGCCG TAGAGCAAAG GCGTTATTGG CTTGTTGCTT CGCGTATTCA AGAGGATCTT ATGCGCAACT TCTCCGATAT TCTTCTTAGC CAAGATTGGC AAAACCCGCA CATCGTTAAA TGGCACTGCC GTACACCCCA TGTTCCTTTG CACAGTTATC GCACTGAGCA GGAGGCTCGT TTGGATGTTG GGGGGAATCG CCAATCTCTA AATGGTCAGT GGCGGTTTGC TCTGTTTGAG AAGCCAGAAG CGGTTGAGCC TGCGGTGATA GACCCGGATT TCGATGATAG CGCTTGGGCG CACATTCCTG TACCGAGTAA CTGGCAGATG CAAGGCTTTG ATAAGCCGAT TTACACCAAT ATCCAATATC CATTTGCGGA TCGGCCGCCT TACGTGCCGC AAGATAATCC AACCGGCTGT TATCGCCACC GTTTTACACT GGAAAAACAA GCGCTAACCG AGTCCATTCG CATTGTATTT GATGGGGTCA ATTCGGCATT TCATCTGTGG TGCAATGGTC ATTGGGTCGG TTATTCGCAA GATAGCCGCT TGCCTGCCGA GTTTGAGTTA ACCCCTTATC TACAAGAGGG TGAAAACCTG TTGGTGGCCA TGGTGCTGCG CTGGTCTGAT GGCTCTTATT TGGAAGACCA AGATATGTGG TGGCTGAGTG GCATCTTTCG CGATGTGTAT CTCTACCGCA AGCCGATACT CGCGATTGAA GATTTTTTTA TCCGCACTGA ATTAGATGCG CTTTATCAAC ACGCTGAATT GCGAGTAGAA ACACGCTTAA GCCAAGTGAC TCGCCATCAT CAAGTGCAAG TGGCTTTATT CGATGCACAA GGTGAATGCG TGGCGCGTTC ACAAGCCTTA CATACAGGCC AGCGTGTAGT GGATGAAAAA GGAGCATGGC ACGATAAAAC CGAACACAGT TTAGCGATTT GCTCTCCGAC ACTGTGGAGT GATGAAGCGC CTTATCTTTA CCGCTGCGTG ATCTGTTTGC TTGATGAAGA TGGCGCGCCG ATTGAGTTTG AAAGTGCAGC AGTGGGTTTT CGCAAAGTAG AAATCACTCA GGGACTACTG AAGCTCAATG GTCAGCCCTT GTTGATCCGC GGGGTGAACC GTCATGAACA TCATCCCGAA TTGGGGCATG TGATGGATGA AGCAAGCATG CGCCGCGATA TTGAATTGAT GAAACAGCAT AATTTCAATG CGGTGCGTAC CGCCCATTAC CCCAATCATC CGCGTTGGTA CGAACTGTGT GATGAGTACG GTTTGTATGT GGTGGATGAG GCCAATCTCG AAACCCACGG CCAATTTCCG ATGAGCCGAC TTTCCAATGA TCCACAATGG GTGAATGCCT ATTTGCAGCG CATGATTGGC ATGGTGGAGC GCGATAAAAA CCACCCTTGT GTGATCATTT GGTCGCTCGG CAATGAATCG GGGATTGGTA CCAATCATCA CGCCATGTAT CAGTGGACGA AACAGCGCGA CCCATCGCGT CCTGTGCAAT ACGAAGGGGG CGGCGCTAAT ACGGCGGCGA CCGATATTGT TTGCCCGATG TATGCGCGGG TCGATCAGCA TCAGCCACAT CCTGCGGTTC CAAAATATGC GCTGAAAAAT TGGATCAGTT TGCCGCAGGA AAACCGCCCC CTCATCTTGT GTGAATATGC TCATGCGATG GGCAACAGCT TGGGCGCGTT TTATAAATAC TGGCAGGCGT TTCGTGAGTT TCCTCGTCTG CAAGGTGGCT TTATTTGGGA TTGGGTCGAT CAGGGCATTT CCAAATGGGA TAGCGAGGGG CGCCACTATT GGGGCTATGG CGGTGATTTT GGCGATACGA TTAACGATCG CCAATTCTGC ATAAACGGTT TGCTGTTCCC AGATCGCACG CCGCATCCGG CATTACATGA AGTCAAAAAA GTCCAGCAGC CGTACCAGTT TTCGTTGAGC TATCCCAAGC TCACCATTCA CAATGAGCGC TTGTTTGCAG CGCTGCCGCT GGAGCTGGTA GTTAGTGTGC TATGCGATGG GCAAGAGATT AAGCAAGAAC GTCTGCCGCT TGATATTGCG CCGCGCGGCA CAATCACGCT GGATTTAGCG TCGCTGCCAA TGTTGCCAGA GCATGAATAC CACCTCAATG CAGTCTTATT GTGTCGTGAG GATCAGCCAT GGTCTAACGC GGGGCACTGC ATCGCTAGTG AGCAGTGGTG TTTGCAGCCA CGAAGAAGCA TGTTACCTAA AATCACACAC GCTCCGCTGC CTCAATGGCA GCAAGATGGA GATAAGGTGC GCATCGAGGC GGCCAATCAG CAATGGCAGT TTAACCGCCA AACTGGGCTA TTGGAGCAGT GGTGGCAAAA TGGTCAGCCC GTATTGAGTG AACCGCTGCG CGATAACTTT TACCGCGCGG TGCTGGATAA CGATATTGGT ACTAGCGAAG CGCAGCATCT TGACCCGAAC AGCTGGATCG CACGTTGGCA TGCGGCGGGC TTAGATAAGC TGCGTGTGGA ATGTGACGAT CTTCGCGTCA CCACCTTGAA CGAGAGTGTC GAAGTGGTGA TCGATGTCGC CCATTACCAT CAGCAAGCGT TAGCGCTTCG TACCCGTTGG CGTTACCAAA TCTTCGGTGA TGCGCGGGTA GAACTGAATG TTGAGGTGAT GCTGTGTTCT GATTTACCGC CGCTGCCAAG AGTGGGGTTA ACGCTCGCAT TACCAGTGGC AGAAAACCCA GTGTCTTGGT TTGGTCGCGG GCCGCATGAG AATTATCCGG ATCGTTTGCA ATCGGCGCAT GTGGGGCGAT ACACCGCCAC GGTGGATGAG CTGCATACAC CGTACATTTT CCCGAGCGAA AATGGTTTGC GTTGTGATAC TCGCCAGCTA CAAGTGGGCG CTTTGGTGGT GGAAGGGCAT TTTCACTTCT CGCTCAGTCG CTACTCACAA ACGATGTTGG ATAAAGCCAA ACACAGCAAC GAGTTGGTGG CGGGCGATAA GTGGTATCTC AATCTGGATG CGCAGCATAT GGGCGTGGGC GGCGATGATT CGTGGAGCCA AAGTGTGCAC CCTGAATTTT TGCTCACTCA GCCGCACTAT CAGTATCAGC TCACCTTACG TGTGAAAGCG TCATCCCCAC AATAA
|
Protein sequence | MYAVEQRRYW LVASRIQEDL MRNFSDILLS QDWQNPHIVK WHCRTPHVPL HSYRTEQEAR LDVGGNRQSL NGQWRFALFE KPEAVEPAVI DPDFDDSAWA HIPVPSNWQM QGFDKPIYTN IQYPFADRPP YVPQDNPTGC YRHRFTLEKQ ALTESIRIVF DGVNSAFHLW CNGHWVGYSQ DSRLPAEFEL TPYLQEGENL LVAMVLRWSD GSYLEDQDMW WLSGIFRDVY LYRKPILAIE DFFIRTELDA LYQHAELRVE TRLSQVTRHH QVQVALFDAQ GECVARSQAL HTGQRVVDEK GAWHDKTEHS LAICSPTLWS DEAPYLYRCV ICLLDEDGAP IEFESAAVGF RKVEITQGLL KLNGQPLLIR GVNRHEHHPE LGHVMDEASM RRDIELMKQH NFNAVRTAHY PNHPRWYELC DEYGLYVVDE ANLETHGQFP MSRLSNDPQW VNAYLQRMIG MVERDKNHPC VIIWSLGNES GIGTNHHAMY QWTKQRDPSR PVQYEGGGAN TAATDIVCPM YARVDQHQPH PAVPKYALKN WISLPQENRP LILCEYAHAM GNSLGAFYKY WQAFREFPRL QGGFIWDWVD QGISKWDSEG RHYWGYGGDF GDTINDRQFC INGLLFPDRT PHPALHEVKK VQQPYQFSLS YPKLTIHNER LFAALPLELV VSVLCDGQEI KQERLPLDIA PRGTITLDLA SLPMLPEHEY HLNAVLLCRE DQPWSNAGHC IASEQWCLQP RRSMLPKITH APLPQWQQDG DKVRIEAANQ QWQFNRQTGL LEQWWQNGQP VLSEPLRDNF YRAVLDNDIG TSEAQHLDPN SWIARWHAAG LDKLRVECDD LRVTTLNESV EVVIDVAHYH QQALALRTRW RYQIFGDARV ELNVEVMLCS DLPPLPRVGL TLALPVAENP VSWFGRGPHE NYPDRLQSAH VGRYTATVDE LHTPYIFPSE NGLRCDTRQL QVGALVVEGH FHFSLSRYSQ TMLDKAKHSN ELVAGDKWYL NLDAQHMGVG GDDSWSQSVH PEFLLTQPHY QYQLTLRVKA SSPQ
|
| |