Gene VC0395_A0142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0142 
Symbolchb-1 
ID5136454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp137659 
End bp139572 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content50% 
IMG OID640531602 
Productbeta-N-acetylhexosaminidase 
Protein accessionYP_001216107 
Protein GI147674312 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTATC GAATTGAATT TGCGGTGCTC TCGGAACAAA AACCGGATTG CCGTTTTGGT 
TTAACCCTGC ACAATTTGAG CGATCAAGAT CTGCATGATT GGTCGCTGTA TTTTGTGATT
GATCGTTACA TCCAACCCAT GAGTGTGACC AACGGTCAAC TGACCCAAGT CGGCAGCTTA
TGTTCGATTG TTCCAACGGA AAAAGTGTTG CAAGCTAATG GCCACTTTTA TTGTGAGTTC
ATCATCAAAA CCGCGCCTTA CCATTTCTAC ACCGATGGGG TAAAGCACGC GTTTGTCCAA
CTTAATGATA AACAGCCTGT TGAACGTATT AACGTCGCGG TTAACCCTAT CGTTCTCGCG
TCACCATTTC GCGAGCGTAG CCAGATCCCT GAAGTGACTG CGGCTGAGCT CTGTCTCATC
CCCAAACCTA ACTCACTGCA ACGTTTCCAA GGTGAGTTTG TGGTCAACCA CTCCAGCCAG
ATCTCGCTGC AATCGGACTC GGCCGCGCGT GCTGCACGCT GGTTAGAGCA AGAATTGCAT
GCACTGCATG AGTTCAAACT GAATACGGTT GGCCATAGCG ATATCGTCTA CCGCAGTAAT
CCCACGCTCG ATGAGGGCCA TTACCAACTC AATATCGAAG CGCAAGGGAT CAAGATTGAA
GCAGGCAGCC ACAGTGGCTT TATGCATGCC AGCGCGACTT TGCTGCAACT GGCGCAAGCG
CATCAAGGCT CATTGCGCTT TCCTCTGGTC AACATTGTCG ATGCACCGCG CTTTAAGTAT
CGCGGTATGA TGCTCGATTG CGCCCGCCAT TTTCACTCGC TTGAGCAGGT CAAACGAGTG
ATCAATCAAC TGGCACACTA CAAATTTAAC GTGTTCCACT GGCATCTGAC TGATGATGAA
GGTTGGCGTA TTGAGATTAA ACGCCTGCCG CAACTGACAG ACATTGGTGC ATGGCGTGGC
ATGGATGAAG TGCTGGAACC TCAGTACAGC TTACTCACCG AGCGTCATGG CGGTTTCTAT
ACCCAAGAGG AGATCCGTGC AGTGATTGAG TACGCCAGCG ATCGTGGCAT TACTGTCATC
CCTGAAATTG ACGTACCAGG GCACAGCCGC GCCGCGATTA AAGCGCTGCC GGAATGGCTG
GTCGATGAGG AAGATTGCTC GCAATATCGC AGTATTCAGT ACTACAACGA CAACGTGCTC
TCCCCTGCAC TGCCGGGCAC TTATCAATTC CTCGACATCG TATTAGAAGA AGTGGCTGCG
CTGTTTCCAA GCCAATTTAT TCATATCGGT GCCGATGAAG TTCCACACGG TGTGTGGGTA
GATAGCCCGA AATGCCAAGC CTTAATGCAA GAGCAAGGCT ATACCGACCC GAAAGAGCTG
CAAGGCCACT TACTGCGCTA CGCCGAGAAA AAACTCAAAA GCTTGGGTAA GCGTATGGTC
GGCTGGGAAG AAGCCCATCA CGGTGACAAA GTGAGTAAAG ATACGGTGAT TTACTCTTGG
TTATCGGAAA AAGCCGCCTT GGATTGCGCC AAACAAGGCT TTGACGTGAT TTTGCAGCCG
GGACAATTTA CCTATCTCGA TATCGTTCAA GACTATGCTC CTGAAGAACC GGGCGTGGAT
TGGGCAGGTG TTACTCCGCT AGAGCGTGCT TACGGTTATG AACCGTTAGC TGACGTTCCG
GCCAATAACC CACTGCGTAA ACGCATTTTA GGTATTCAAT GCGCCTTGTG GTGTGAATTG
ATCAATCACT CAGAACGCAT GGAATACATG CTCTATCCAC GTCTCACGGC ATTAGCTGAA
GGCGGTTGGA CAGAGAAATC CCAGCGTGAC TGGTTGGATT ATCTAGCGCG TTTGAAAGGC
CATTTACCAC TGCTTGATAA GCAGAAAATA CCTTATCGCG CGCCTTGGAA GTAA
 
Protein sequence
MSYRIEFAVL SEQKPDCRFG LTLHNLSDQD LHDWSLYFVI DRYIQPMSVT NGQLTQVGSL 
CSIVPTEKVL QANGHFYCEF IIKTAPYHFY TDGVKHAFVQ LNDKQPVERI NVAVNPIVLA
SPFRERSQIP EVTAAELCLI PKPNSLQRFQ GEFVVNHSSQ ISLQSDSAAR AARWLEQELH
ALHEFKLNTV GHSDIVYRSN PTLDEGHYQL NIEAQGIKIE AGSHSGFMHA SATLLQLAQA
HQGSLRFPLV NIVDAPRFKY RGMMLDCARH FHSLEQVKRV INQLAHYKFN VFHWHLTDDE
GWRIEIKRLP QLTDIGAWRG MDEVLEPQYS LLTERHGGFY TQEEIRAVIE YASDRGITVI
PEIDVPGHSR AAIKALPEWL VDEEDCSQYR SIQYYNDNVL SPALPGTYQF LDIVLEEVAA
LFPSQFIHIG ADEVPHGVWV DSPKCQALMQ EQGYTDPKEL QGHLLRYAEK KLKSLGKRMV
GWEEAHHGDK VSKDTVIYSW LSEKAALDCA KQGFDVILQP GQFTYLDIVQ DYAPEEPGVD
WAGVTPLERA YGYEPLADVP ANNPLRKRIL GIQCALWCEL INHSERMEYM LYPRLTALAE
GGWTEKSQRD WLDYLARLKG HLPLLDKQKI PYRAPWK