Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A0903 |
Symbol | celF |
ID | 5135712 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 917472 |
End bp | 918794 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640532361 |
Product | 6-phospho-beta-glucosidase |
Protein accession | YP_001216849 |
Protein GI | 147674231 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.851589 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCGAA ATGCATTAAA GCTGGCAGTC ATCGGCGGTG GCAGTAGTTA CACCCCAGAG CTTGTGGAAG GGGTTCTCAA AAGATTGGAT TTTCTGCCAG TGAAGCAGTT GCACTTTGTG GATATTGAAG CAGGTGCGGA AAAATTACAG ATCATTCGAG ATCTTGCGCA GAGAATGGTC GACAAAGTTG GAGCTGACAT TGAAATTAAA GCGGGCTTTG ATCGCCGAGA GGCGATTACT GGCGCTGATT TCGTTATGAC CCAGTTTCGG GTTGGGGGGC TTGCCGCTCG CGCGAATGAT GAACGAATTC CACTGAAATA TGATGTGATT GGGCAAGAAA CCACAGGGCC TGGTGGTTTT GCCAAAGCAC TTCGAACGAT CCCAGTCATA CTGGATATTT GCCGAGACAT TGAAGAGCTA GCACCCAATG CCTGGATGCT CAACTTTACC AATCCTGCGG GCTTGGTCTC GGAGGCAGTC AGTAAATACT CAAAAGTCAA AAGCATAGGA TTATGTAATG TTCCCGTTTC TATGCAGATG ATGATTGCTG AAATGATGGC CTGTGAACCT CAAGACCTTC AGCTCGAGTT CGCTGGGCTG AACCATTTAG TTTGGGTGCA CCAAGCTTGG TTGGATGGTC AAAATATCAC TCAAACCGTT TTAGAAAAGG TGGGTGATGG CGCCAATTTC AGCATGAAAA ATATCTGGGA AGAACCTTGG GATCCTGAGT TCTTAAAAGC GTTAGGCGCG ATTCCTTGCC CTTATCACCG CTACTTTTAC CAAACGGATG CTATGTTGGC AGAAGAGAAA AAAAGTGCTC AAGAGAAAGG AACGCGAGCT GAGCAAGTGA TGGTCACTGA AAAGGCCTTA TTTGAGCTCT ATCAAGATCC ACATCTGGCT CATAAGCCTA AAGAGTTAGA AGCGCGTGGC GGTGCTTATT ATTCGGATGC CTCGTTAAAT CTTGTTGATG CCATCTACAA TAATCGCAAT AGTATTCATG TGGTTAATGT GCAGAATCAT GGGGCGATCA GTTCATTACC TCATGATGCT GTGATTGAAT GCAGTGCGGT GGTGGGCAGT TGGGGAGCAA AACCGATTGC GGTCGGAGAA CTTTCACCAA AAATCAGTGG ATTGCTACAT CAAGTGAAAG CCTATGAGCA GCTTGCGATA GAAGCTGCGG TACACGGTGA CTATCACCTT GCTTTAATGG CTTTAACGAA TAACCCATTG GTGCCTGATA TTGGTCGGGC GAAAGCCATT TTGGATGATA TTCTGCGTGA AAATGCCGTT TATCTACCCC AATTTAAACT CACCACATGG TAA
|
Protein sequence | MSRNALKLAV IGGGSSYTPE LVEGVLKRLD FLPVKQLHFV DIEAGAEKLQ IIRDLAQRMV DKVGADIEIK AGFDRREAIT GADFVMTQFR VGGLAARAND ERIPLKYDVI GQETTGPGGF AKALRTIPVI LDICRDIEEL APNAWMLNFT NPAGLVSEAV SKYSKVKSIG LCNVPVSMQM MIAEMMACEP QDLQLEFAGL NHLVWVHQAW LDGQNITQTV LEKVGDGANF SMKNIWEEPW DPEFLKALGA IPCPYHRYFY QTDAMLAEEK KSAQEKGTRA EQVMVTEKAL FELYQDPHLA HKPKELEARG GAYYSDASLN LVDAIYNNRN SIHVVNVQNH GAISSLPHDA VIECSAVVGS WGAKPIAVGE LSPKISGLLH QVKAYEQLAI EAAVHGDYHL ALMALTNNPL VPDIGRAKAI LDDILRENAV YLPQFKLTTW
|
| |