Gene VC0395_A0903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0903 
SymbolcelF 
ID5135712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp917472 
End bp918794 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content46% 
IMG OID640532361 
Product6-phospho-beta-glucosidase 
Protein accessionYP_001216849 
Protein GI147674231 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.851589 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCGAA ATGCATTAAA GCTGGCAGTC ATCGGCGGTG GCAGTAGTTA CACCCCAGAG 
CTTGTGGAAG GGGTTCTCAA AAGATTGGAT TTTCTGCCAG TGAAGCAGTT GCACTTTGTG
GATATTGAAG CAGGTGCGGA AAAATTACAG ATCATTCGAG ATCTTGCGCA GAGAATGGTC
GACAAAGTTG GAGCTGACAT TGAAATTAAA GCGGGCTTTG ATCGCCGAGA GGCGATTACT
GGCGCTGATT TCGTTATGAC CCAGTTTCGG GTTGGGGGGC TTGCCGCTCG CGCGAATGAT
GAACGAATTC CACTGAAATA TGATGTGATT GGGCAAGAAA CCACAGGGCC TGGTGGTTTT
GCCAAAGCAC TTCGAACGAT CCCAGTCATA CTGGATATTT GCCGAGACAT TGAAGAGCTA
GCACCCAATG CCTGGATGCT CAACTTTACC AATCCTGCGG GCTTGGTCTC GGAGGCAGTC
AGTAAATACT CAAAAGTCAA AAGCATAGGA TTATGTAATG TTCCCGTTTC TATGCAGATG
ATGATTGCTG AAATGATGGC CTGTGAACCT CAAGACCTTC AGCTCGAGTT CGCTGGGCTG
AACCATTTAG TTTGGGTGCA CCAAGCTTGG TTGGATGGTC AAAATATCAC TCAAACCGTT
TTAGAAAAGG TGGGTGATGG CGCCAATTTC AGCATGAAAA ATATCTGGGA AGAACCTTGG
GATCCTGAGT TCTTAAAAGC GTTAGGCGCG ATTCCTTGCC CTTATCACCG CTACTTTTAC
CAAACGGATG CTATGTTGGC AGAAGAGAAA AAAAGTGCTC AAGAGAAAGG AACGCGAGCT
GAGCAAGTGA TGGTCACTGA AAAGGCCTTA TTTGAGCTCT ATCAAGATCC ACATCTGGCT
CATAAGCCTA AAGAGTTAGA AGCGCGTGGC GGTGCTTATT ATTCGGATGC CTCGTTAAAT
CTTGTTGATG CCATCTACAA TAATCGCAAT AGTATTCATG TGGTTAATGT GCAGAATCAT
GGGGCGATCA GTTCATTACC TCATGATGCT GTGATTGAAT GCAGTGCGGT GGTGGGCAGT
TGGGGAGCAA AACCGATTGC GGTCGGAGAA CTTTCACCAA AAATCAGTGG ATTGCTACAT
CAAGTGAAAG CCTATGAGCA GCTTGCGATA GAAGCTGCGG TACACGGTGA CTATCACCTT
GCTTTAATGG CTTTAACGAA TAACCCATTG GTGCCTGATA TTGGTCGGGC GAAAGCCATT
TTGGATGATA TTCTGCGTGA AAATGCCGTT TATCTACCCC AATTTAAACT CACCACATGG
TAA
 
Protein sequence
MSRNALKLAV IGGGSSYTPE LVEGVLKRLD FLPVKQLHFV DIEAGAEKLQ IIRDLAQRMV 
DKVGADIEIK AGFDRREAIT GADFVMTQFR VGGLAARAND ERIPLKYDVI GQETTGPGGF
AKALRTIPVI LDICRDIEEL APNAWMLNFT NPAGLVSEAV SKYSKVKSIG LCNVPVSMQM
MIAEMMACEP QDLQLEFAGL NHLVWVHQAW LDGQNITQTV LEKVGDGANF SMKNIWEEPW
DPEFLKALGA IPCPYHRYFY QTDAMLAEEK KSAQEKGTRA EQVMVTEKAL FELYQDPHLA
HKPKELEARG GAYYSDASLN LVDAIYNNRN SIHVVNVQNH GAISSLPHDA VIECSAVVGS
WGAKPIAVGE LSPKISGLLH QVKAYEQLAI EAAVHGDYHL ALMALTNNPL VPDIGRAKAI
LDDILRENAV YLPQFKLTTW