Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A0142 |
Symbol | chb-1 |
ID | 5136454 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 137659 |
End bp | 139572 |
Gene Length | 1914 bp |
Protein Length | 637 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640531602 |
Product | beta-N-acetylhexosaminidase |
Protein accession | YP_001216107 |
Protein GI | 147674312 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 58 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTATC GAATTGAATT TGCGGTGCTC TCGGAACAAA AACCGGATTG CCGTTTTGGT TTAACCCTGC ACAATTTGAG CGATCAAGAT CTGCATGATT GGTCGCTGTA TTTTGTGATT GATCGTTACA TCCAACCCAT GAGTGTGACC AACGGTCAAC TGACCCAAGT CGGCAGCTTA TGTTCGATTG TTCCAACGGA AAAAGTGTTG CAAGCTAATG GCCACTTTTA TTGTGAGTTC ATCATCAAAA CCGCGCCTTA CCATTTCTAC ACCGATGGGG TAAAGCACGC GTTTGTCCAA CTTAATGATA AACAGCCTGT TGAACGTATT AACGTCGCGG TTAACCCTAT CGTTCTCGCG TCACCATTTC GCGAGCGTAG CCAGATCCCT GAAGTGACTG CGGCTGAGCT CTGTCTCATC CCCAAACCTA ACTCACTGCA ACGTTTCCAA GGTGAGTTTG TGGTCAACCA CTCCAGCCAG ATCTCGCTGC AATCGGACTC GGCCGCGCGT GCTGCACGCT GGTTAGAGCA AGAATTGCAT GCACTGCATG AGTTCAAACT GAATACGGTT GGCCATAGCG ATATCGTCTA CCGCAGTAAT CCCACGCTCG ATGAGGGCCA TTACCAACTC AATATCGAAG CGCAAGGGAT CAAGATTGAA GCAGGCAGCC ACAGTGGCTT TATGCATGCC AGCGCGACTT TGCTGCAACT GGCGCAAGCG CATCAAGGCT CATTGCGCTT TCCTCTGGTC AACATTGTCG ATGCACCGCG CTTTAAGTAT CGCGGTATGA TGCTCGATTG CGCCCGCCAT TTTCACTCGC TTGAGCAGGT CAAACGAGTG ATCAATCAAC TGGCACACTA CAAATTTAAC GTGTTCCACT GGCATCTGAC TGATGATGAA GGTTGGCGTA TTGAGATTAA ACGCCTGCCG CAACTGACAG ACATTGGTGC ATGGCGTGGC ATGGATGAAG TGCTGGAACC TCAGTACAGC TTACTCACCG AGCGTCATGG CGGTTTCTAT ACCCAAGAGG AGATCCGTGC AGTGATTGAG TACGCCAGCG ATCGTGGCAT TACTGTCATC CCTGAAATTG ACGTACCAGG GCACAGCCGC GCCGCGATTA AAGCGCTGCC GGAATGGCTG GTCGATGAGG AAGATTGCTC GCAATATCGC AGTATTCAGT ACTACAACGA CAACGTGCTC TCCCCTGCAC TGCCGGGCAC TTATCAATTC CTCGACATCG TATTAGAAGA AGTGGCTGCG CTGTTTCCAA GCCAATTTAT TCATATCGGT GCCGATGAAG TTCCACACGG TGTGTGGGTA GATAGCCCGA AATGCCAAGC CTTAATGCAA GAGCAAGGCT ATACCGACCC GAAAGAGCTG CAAGGCCACT TACTGCGCTA CGCCGAGAAA AAACTCAAAA GCTTGGGTAA GCGTATGGTC GGCTGGGAAG AAGCCCATCA CGGTGACAAA GTGAGTAAAG ATACGGTGAT TTACTCTTGG TTATCGGAAA AAGCCGCCTT GGATTGCGCC AAACAAGGCT TTGACGTGAT TTTGCAGCCG GGACAATTTA CCTATCTCGA TATCGTTCAA GACTATGCTC CTGAAGAACC GGGCGTGGAT TGGGCAGGTG TTACTCCGCT AGAGCGTGCT TACGGTTATG AACCGTTAGC TGACGTTCCG GCCAATAACC CACTGCGTAA ACGCATTTTA GGTATTCAAT GCGCCTTGTG GTGTGAATTG ATCAATCACT CAGAACGCAT GGAATACATG CTCTATCCAC GTCTCACGGC ATTAGCTGAA GGCGGTTGGA CAGAGAAATC CCAGCGTGAC TGGTTGGATT ATCTAGCGCG TTTGAAAGGC CATTTACCAC TGCTTGATAA GCAGAAAATA CCTTATCGCG CGCCTTGGAA GTAA
|
Protein sequence | MSYRIEFAVL SEQKPDCRFG LTLHNLSDQD LHDWSLYFVI DRYIQPMSVT NGQLTQVGSL CSIVPTEKVL QANGHFYCEF IIKTAPYHFY TDGVKHAFVQ LNDKQPVERI NVAVNPIVLA SPFRERSQIP EVTAAELCLI PKPNSLQRFQ GEFVVNHSSQ ISLQSDSAAR AARWLEQELH ALHEFKLNTV GHSDIVYRSN PTLDEGHYQL NIEAQGIKIE AGSHSGFMHA SATLLQLAQA HQGSLRFPLV NIVDAPRFKY RGMMLDCARH FHSLEQVKRV INQLAHYKFN VFHWHLTDDE GWRIEIKRLP QLTDIGAWRG MDEVLEPQYS LLTERHGGFY TQEEIRAVIE YASDRGITVI PEIDVPGHSR AAIKALPEWL VDEEDCSQYR SIQYYNDNVL SPALPGTYQF LDIVLEEVAA LFPSQFIHIG ADEVPHGVWV DSPKCQALMQ EQGYTDPKEL QGHLLRYAEK KLKSLGKRMV GWEEAHHGDK VSKDTVIYSW LSEKAALDCA KQGFDVILQP GQFTYLDIVQ DYAPEEPGVD WAGVTPLERA YGYEPLADVP ANNPLRKRIL GIQCALWCEL INHSERMEYM LYPRLTALAE GGWTEKSQRD WLDYLARLKG HLPLLDKQKI PYRAPWK
|
| |