Gene VC0395_A0141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0141 
Symbol 
ID5135769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp135135 
End bp137540 
Gene Length2406 bp 
Protein Length801 aa 
Translation table11 
GC content50% 
IMG OID640531601 
Productcellulose degradation product phosphorylase 
Protein accessionYP_001216106 
Protein GI147673254 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATACG GCTATTTCGA TAATGATAAT CGCGAATACG TCATCACTCG CCCTGACGTA 
CCCGCACCTT GGACTAACTA TCTCGGTACT GAAAAATTCT GTACGGTGAT TTCACACAAC
GCAGGGGGTT ACTCCTTCTA TCACTCACCT GAGTACAACC GCGTGACCAA GTTTCGTCCA
AACTTTACCC AAGATCGTCC CGGGCACTAT GTTTACCTGC GCGATGATGC GACAGGCGAT
TTCTGGTCAA TCTCTTGGCA ACCAGTTGCG AAAAGCCTTG AACAAGCGAA ATACGAAGTT
CGCCACGGCT TGTCCTACTC AAAATTCAAG TGTGAGTACA ACGGCATTCA CGCCACCAAA
ACTCTGTTTG TTCCTAAAGG CGAAGATGCC GAAGTTTGGG ATGTAGTGAT CAAAAATACC
TCCAACGAAG TGCGCACCAT CAGTGCGTTC AACTATGTTG AGTTCTCTTT CAGCCACATC
AAGTCAGACA ACCAAAACCA TCAGATGTCG CTCTACTCAG CCGGCACCTC GTTCAAAGAT
GGCGTGATTG AGTATGACCT GTACTACAAC ACCGATGATT TCCTCGGCTT CTACTACCTG
ACTGCAACTT TCGATGCCGA CAGTTACGAC GGCCAACGTG ACCAATTCCT TGGCATGTAC
CGTGATGAAG CCAACCCAAT CGCCGTGGCG CAAGGTAAAT GCTCTAACAG TGCGCAAACC
TGTTACAACC ACTGTGGTGC ACTGCATAAG CAATTCGTGC TGCAACCGGG CGAGAAGGTG
CGCTTTGCGG TGATCTTAGG TGTAGGTAAA GGCAACGGCG CAAAACTGCG TGAAAAATAC
CAAGACCTGA GCAAAGTGGA TTCGGCCTTT GCAGGTATCA AAGCACACTG GGATGAGCGT
TGTGCGAAAT TCCAAGTGAA ATCACCCAAC CAAGGTCTCG ATACCATGAT CAACGCTTGG
ACTCTGTACC AAGCGGAAAC GTGTGTGGTG TGGTCCCGTT TCGCCTCTTT CATTGAAGTC
GGCGGCCGTA CAGGCCTTGG CTACCGTGAT ACTGCGCAAG ATGCGATCTC AGTACCGCAC
ACTAACCCAG CGATGACTCG TAAGCGCCTC GTTGATCTAC TGCGTGGTCA AGTGAAAGCC
GGTTACGGTC TGCACCTGTT TGATCCTGAC TGGTTCGATC CAGAAAAAGC GGATGTTAAA
CCATCTAAAT CACCCACAGT TGTCCCCACA CCGTCGGATG AAGACAAGAT CCACGGCATT
AAAGATACCT GTTCTGACGA TCACCTGTGG ATTGTGCCAA CCATCCTCAA CTATGTGAAA
GAGACCGGTG ACTTCGCCTT TATCGACGAA GTGATTCCTT ACGCGGATGG CGGCAACGCC
ACTGTGTACG AGCACATGAT GGCAGCGCTA GATTTCTCTG CAGAATATGT GGGTCAAACC
GGTATCTGTA AGGGTCTGCG TGCCGACTGG AACGACTGTT TGAACCTCGG TGGTGGTGAG
TCCTCTATGG TCTCTTTCCT ACACTTCTGG GCGTTGGAAT CTTTCCTTGA ACTGTCACGC
TATCGCAATG ATGAAGCGGC AACCGACAAG TACCAAGCGA TGGCCGATGG TGTACGCGAA
GCGTGTGAAA CTCACTTGTG GGATGAACAA GGCGAATGGT ACATCCGTGG CCTGACCAAA
AATGGCGACA AGATCGGAAC CTTCGAACAA GTGGAAGGCA AAGTGCATTT AGAGTCTAAC
TCGCTTGCAG TGTTGTCTGG CACGGTTAGC CATGAACGCG GCATCAAAGC AATGGATGCG
GTCTACAAAT ACCTGTTCTC CAAATACGGT CTACACCTGA ACGCTCCATC ATTTGCCACG
CCAAATGATG ACATCGGTTT CGTGACTCGC GTTTACCAAG GCGTGAAAGA GAACGGCGCG
ATCTTCTCGC ATCCAAACCC ATGGGCATGG GTAGCCGAAG CGAAACTGGG CCGTGGTGAT
CGTGCGATGG AGCTGTATGA CGCACTCAAC CCATACAACC AAAACGACAT CATCGAAACC
CGTATTGCTG AACCTTACTC TTACGTACAG TTCATCATGG GGCGTGACCA CCAAGATCAC
GGCCGCGCTA ACCACCCATG GTTAACCGGT ACTTCTGGCT GGGCATACCA TGCGACCACC
AACTATATCT TGGGTATCAA AGCGGGCTTC GATGCACTGG AGATCGATCC TTGTATCCCA
ACGTCATGGC CGGGTTTTGA AGTGACTCGC GAATGGCGTG ATGCGACTTA TCAGATCAAA
GTGGAAAACC CGCAAAGTGT TTCAAAAGGC GTGAAATCCA TCACCCTAAA TGGTCAAGCG
ATTGAAGGTG CCGTTCCTGT GCAAGCCGCA GGCAGCGTTA ACCAAGTCGT GGTTGTTCTA
GGTTAA
 
Protein sequence
MKYGYFDNDN REYVITRPDV PAPWTNYLGT EKFCTVISHN AGGYSFYHSP EYNRVTKFRP 
NFTQDRPGHY VYLRDDATGD FWSISWQPVA KSLEQAKYEV RHGLSYSKFK CEYNGIHATK
TLFVPKGEDA EVWDVVIKNT SNEVRTISAF NYVEFSFSHI KSDNQNHQMS LYSAGTSFKD
GVIEYDLYYN TDDFLGFYYL TATFDADSYD GQRDQFLGMY RDEANPIAVA QGKCSNSAQT
CYNHCGALHK QFVLQPGEKV RFAVILGVGK GNGAKLREKY QDLSKVDSAF AGIKAHWDER
CAKFQVKSPN QGLDTMINAW TLYQAETCVV WSRFASFIEV GGRTGLGYRD TAQDAISVPH
TNPAMTRKRL VDLLRGQVKA GYGLHLFDPD WFDPEKADVK PSKSPTVVPT PSDEDKIHGI
KDTCSDDHLW IVPTILNYVK ETGDFAFIDE VIPYADGGNA TVYEHMMAAL DFSAEYVGQT
GICKGLRADW NDCLNLGGGE SSMVSFLHFW ALESFLELSR YRNDEAATDK YQAMADGVRE
ACETHLWDEQ GEWYIRGLTK NGDKIGTFEQ VEGKVHLESN SLAVLSGTVS HERGIKAMDA
VYKYLFSKYG LHLNAPSFAT PNDDIGFVTR VYQGVKENGA IFSHPNPWAW VAEAKLGRGD
RAMELYDALN PYNQNDIIET RIAEPYSYVQ FIMGRDHQDH GRANHPWLTG TSGWAYHATT
NYILGIKAGF DALEIDPCIP TSWPGFEVTR EWRDATYQIK VENPQSVSKG VKSITLNGQA
IEGAVPVQAA GSVNQVVVVL G