Gene VC0395_A1917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1917 
SymbollacZ 
ID5135797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp2040732 
End bp2043866 
Gene Length3135 bp 
Protein Length1044 aa 
Translation table11 
GC content51% 
IMG OID640533374 
Productbeta-D-galactosidase 
Protein accessionYP_001217841 
Protein GI147673874 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACGCCG TAGAGCAAAG GCGTTATTGG CTTGTTGCTT CGCGTATTCA AGAGGATCTT 
ATGCGCAACT TCTCCGATAT TCTTCTTAGC CAAGATTGGC AAAACCCGCA CATCGTTAAA
TGGCACTGCC GTACACCCCA TGTTCCTTTG CACAGTTATC GCACTGAGCA GGAGGCTCGT
TTGGATGTTG GGGGGAATCG CCAATCTCTA AATGGTCAGT GGCGGTTTGC TCTGTTTGAG
AAGCCAGAAG CGGTTGAGCC TGCGGTGATA GACCCGGATT TCGATGATAG CGCTTGGGCG
CACATTCCTG TACCGAGTAA CTGGCAGATG CAAGGCTTTG ATAAGCCGAT TTACACCAAT
ATCCAATATC CATTTGCGGA TCGGCCGCCT TACGTGCCGC AAGATAATCC AACCGGCTGT
TATCGCCACC GTTTTACACT GGAAAAACAA GCGCTAACCG AGTCCATTCG CATTGTATTT
GATGGGGTCA ATTCGGCATT TCATCTGTGG TGCAATGGTC ATTGGGTCGG TTATTCGCAA
GATAGCCGCT TGCCTGCCGA GTTTGAGTTA ACCCCTTATC TACAAGAGGG TGAAAACCTG
TTGGTGGCCA TGGTGCTGCG CTGGTCTGAT GGCTCTTATT TGGAAGACCA AGATATGTGG
TGGCTGAGTG GCATCTTTCG CGATGTGTAT CTCTACCGCA AGCCGATACT CGCGATTGAA
GATTTTTTTA TCCGCACTGA ATTAGATGCG CTTTATCAAC ACGCTGAATT GCGAGTAGAA
ACACGCTTAA GCCAAGTGAC TCGCCATCAT CAAGTGCAAG TGGCTTTATT CGATGCACAA
GGTGAATGCG TGGCGCGTTC ACAAGCCTTA CATACAGGCC AGCGTGTAGT GGATGAAAAA
GGAGCATGGC ACGATAAAAC CGAACACAGT TTAGCGATTT GCTCTCCGAC ACTGTGGAGT
GATGAAGCGC CTTATCTTTA CCGCTGCGTG ATCTGTTTGC TTGATGAAGA TGGCGCGCCG
ATTGAGTTTG AAAGTGCAGC AGTGGGTTTT CGCAAAGTAG AAATCACTCA GGGACTACTG
AAGCTCAATG GTCAGCCCTT GTTGATCCGC GGGGTGAACC GTCATGAACA TCATCCCGAA
TTGGGGCATG TGATGGATGA AGCAAGCATG CGCCGCGATA TTGAATTGAT GAAACAGCAT
AATTTCAATG CGGTGCGTAC CGCCCATTAC CCCAATCATC CGCGTTGGTA CGAACTGTGT
GATGAGTACG GTTTGTATGT GGTGGATGAG GCCAATCTCG AAACCCACGG CCAATTTCCG
ATGAGCCGAC TTTCCAATGA TCCACAATGG GTGAATGCCT ATTTGCAGCG CATGATTGGC
ATGGTGGAGC GCGATAAAAA CCACCCTTGT GTGATCATTT GGTCGCTCGG CAATGAATCG
GGGATTGGTA CCAATCATCA CGCCATGTAT CAGTGGACGA AACAGCGCGA CCCATCGCGT
CCTGTGCAAT ACGAAGGGGG CGGCGCTAAT ACGGCGGCGA CCGATATTGT TTGCCCGATG
TATGCGCGGG TCGATCAGCA TCAGCCACAT CCTGCGGTTC CAAAATATGC GCTGAAAAAT
TGGATCAGTT TGCCGCAGGA AAACCGCCCC CTCATCTTGT GTGAATATGC TCATGCGATG
GGCAACAGCT TGGGCGCGTT TTATAAATAC TGGCAGGCGT TTCGTGAGTT TCCTCGTCTG
CAAGGTGGCT TTATTTGGGA TTGGGTCGAT CAGGGCATTT CCAAATGGGA TAGCGAGGGG
CGCCACTATT GGGGCTATGG CGGTGATTTT GGCGATACGA TTAACGATCG CCAATTCTGC
ATAAACGGTT TGCTGTTCCC AGATCGCACG CCGCATCCGG CATTACATGA AGTCAAAAAA
GTCCAGCAGC CGTACCAGTT TTCGTTGAGC TATCCCAAGC TCACCATTCA CAATGAGCGC
TTGTTTGCAG CGCTGCCGCT GGAGCTGGTA GTTAGTGTGC TATGCGATGG GCAAGAGATT
AAGCAAGAAC GTCTGCCGCT TGATATTGCG CCGCGCGGCA CAATCACGCT GGATTTAGCG
TCGCTGCCAA TGTTGCCAGA GCATGAATAC CACCTCAATG CAGTCTTATT GTGTCGTGAG
GATCAGCCAT GGTCTAACGC GGGGCACTGC ATCGCTAGTG AGCAGTGGTG TTTGCAGCCA
CGAAGAAGCA TGTTACCTAA AATCACACAC GCTCCGCTGC CTCAATGGCA GCAAGATGGA
GATAAGGTGC GCATCGAGGC GGCCAATCAG CAATGGCAGT TTAACCGCCA AACTGGGCTA
TTGGAGCAGT GGTGGCAAAA TGGTCAGCCC GTATTGAGTG AACCGCTGCG CGATAACTTT
TACCGCGCGG TGCTGGATAA CGATATTGGT ACTAGCGAAG CGCAGCATCT TGACCCGAAC
AGCTGGATCG CACGTTGGCA TGCGGCGGGC TTAGATAAGC TGCGTGTGGA ATGTGACGAT
CTTCGCGTCA CCACCTTGAA CGAGAGTGTC GAAGTGGTGA TCGATGTCGC CCATTACCAT
CAGCAAGCGT TAGCGCTTCG TACCCGTTGG CGTTACCAAA TCTTCGGTGA TGCGCGGGTA
GAACTGAATG TTGAGGTGAT GCTGTGTTCT GATTTACCGC CGCTGCCAAG AGTGGGGTTA
ACGCTCGCAT TACCAGTGGC AGAAAACCCA GTGTCTTGGT TTGGTCGCGG GCCGCATGAG
AATTATCCGG ATCGTTTGCA ATCGGCGCAT GTGGGGCGAT ACACCGCCAC GGTGGATGAG
CTGCATACAC CGTACATTTT CCCGAGCGAA AATGGTTTGC GTTGTGATAC TCGCCAGCTA
CAAGTGGGCG CTTTGGTGGT GGAAGGGCAT TTTCACTTCT CGCTCAGTCG CTACTCACAA
ACGATGTTGG ATAAAGCCAA ACACAGCAAC GAGTTGGTGG CGGGCGATAA GTGGTATCTC
AATCTGGATG CGCAGCATAT GGGCGTGGGC GGCGATGATT CGTGGAGCCA AAGTGTGCAC
CCTGAATTTT TGCTCACTCA GCCGCACTAT CAGTATCAGC TCACCTTACG TGTGAAAGCG
TCATCCCCAC AATAA
 
Protein sequence
MYAVEQRRYW LVASRIQEDL MRNFSDILLS QDWQNPHIVK WHCRTPHVPL HSYRTEQEAR 
LDVGGNRQSL NGQWRFALFE KPEAVEPAVI DPDFDDSAWA HIPVPSNWQM QGFDKPIYTN
IQYPFADRPP YVPQDNPTGC YRHRFTLEKQ ALTESIRIVF DGVNSAFHLW CNGHWVGYSQ
DSRLPAEFEL TPYLQEGENL LVAMVLRWSD GSYLEDQDMW WLSGIFRDVY LYRKPILAIE
DFFIRTELDA LYQHAELRVE TRLSQVTRHH QVQVALFDAQ GECVARSQAL HTGQRVVDEK
GAWHDKTEHS LAICSPTLWS DEAPYLYRCV ICLLDEDGAP IEFESAAVGF RKVEITQGLL
KLNGQPLLIR GVNRHEHHPE LGHVMDEASM RRDIELMKQH NFNAVRTAHY PNHPRWYELC
DEYGLYVVDE ANLETHGQFP MSRLSNDPQW VNAYLQRMIG MVERDKNHPC VIIWSLGNES
GIGTNHHAMY QWTKQRDPSR PVQYEGGGAN TAATDIVCPM YARVDQHQPH PAVPKYALKN
WISLPQENRP LILCEYAHAM GNSLGAFYKY WQAFREFPRL QGGFIWDWVD QGISKWDSEG
RHYWGYGGDF GDTINDRQFC INGLLFPDRT PHPALHEVKK VQQPYQFSLS YPKLTIHNER
LFAALPLELV VSVLCDGQEI KQERLPLDIA PRGTITLDLA SLPMLPEHEY HLNAVLLCRE
DQPWSNAGHC IASEQWCLQP RRSMLPKITH APLPQWQQDG DKVRIEAANQ QWQFNRQTGL
LEQWWQNGQP VLSEPLRDNF YRAVLDNDIG TSEAQHLDPN SWIARWHAAG LDKLRVECDD
LRVTTLNESV EVVIDVAHYH QQALALRTRW RYQIFGDARV ELNVEVMLCS DLPPLPRVGL
TLALPVAENP VSWFGRGPHE NYPDRLQSAH VGRYTATVDE LHTPYIFPSE NGLRCDTRQL
QVGALVVEGH FHFSLSRYSQ TMLDKAKHSN ELVAGDKWYL NLDAQHMGVG GDDSWSQSVH
PEFLLTQPHY QYQLTLRVKA SSPQ