Gene VC0395_A1197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1197 
SymbolgalK 
ID5137652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1260635 
End bp1261852 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content52% 
IMG OID640532655 
Productgalactokinase 
Protein accessionYP_001217143 
Protein GI147673567 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.765458 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGCGGCG CAGCACCAAC GCCAGCGTCG CTCTCCCCAT TTAAAGTGAG AAGCCCTATG 
TCTGAATTAA TCCAAAACGT GACTACAACC TTTGCACAAC TCTTTGGCTA TGATGCTACG
CACCTTGTGC AAGCTCCGGG GCGGGTCAAT TTGATCGGCG AGCACACCGA CTACAACGAT
GGCTTTGTGC TGCCTTGCGC GATTAACTAC CAAACCGTCG TGGCTGCAGC CAAACGGGAA
GACTTTCTAG TGCGTTTGGT CGCCGTCGAT TACGACAATG ACACGGACGA ATTTGACCTG
CGAGAAGAGA TTGCCTTTCA GCCTAAAAAA ATGTGGTCGA ACTATATTCG CGGTGTAATC
AAGTGCTTGA TTGAACGTGG TTTTGAGTTT AATGGGGCAG ATATTGTGGT CTCAGGTAAC
GTACCTCAAG GGGCGGGTCT CAGTTCCTCG GCGGCTTTAG AAGTCGTGAT TGGGCAAACT
TTTAAAGAGC TTTACCAGCT AAAAATCAGT CAGGCGGAGA TCGCCCTCAA TGGCCAGCAA
GCGGAGAACC AGTTTGTCGG TTGTAACTGC GGCATTATGG ATCAGATGAT CTCGGCGCAG
GGGCAAGCGA ACCATGCCAT GTTGCTTGAT TGTCGTAGCT TGCAAACCGA GGCCGTTGCG
ATGCCAGAGC AGATGGCAGT GGTGATCCTC AATTCCAATA AAAAACGCGG CTTGGTGGAG
AGTGAATACA ATACCCGTCG TCAGCAATGC GAAGCCGCAG CCAAAACTTT TGGTGTGAAA
GCGCTACGCG ATGTCACTTT GGCGCAATTG ACTGCAAAGC AGGCCGAACT TGATCCTGTG
GTGGCCAAAC GTGCGCGCCA TGTCATCACG GAAAATGAAC GCACTTTACA TGCCGCTCAG
GCCCTGCGTG AAGGAAACAT GCCGCGCTTA GGCGAGTTAA TGGCCGCTTC TCACGCTTCG
ATGCGTGATG ATTTTGAAAT CACTGTCAAG GAGATAGATA CGCTGGTCGA GATTGTTCAA
TCTGTGATTG GCGATCAAGG CGGTGTGCGG ATGACTGGCG GCGGCTTTGG TGGTTGTGTG
GTGGCCCTTG TACACCCGAA GCAAGTAGAA GCGGTGCAGC AAGCGGTGGC TGAACACTAT
GAAGCTGCGA CAGGGCTGAA GGCATCGATC TATGTCTGCC ATGCAACTTC GGGCGCGGGA
TTGGTTGAGC TTGCATAA
 
Protein sequence
MSGAAPTPAS LSPFKVRSPM SELIQNVTTT FAQLFGYDAT HLVQAPGRVN LIGEHTDYND 
GFVLPCAINY QTVVAAAKRE DFLVRLVAVD YDNDTDEFDL REEIAFQPKK MWSNYIRGVI
KCLIERGFEF NGADIVVSGN VPQGAGLSSS AALEVVIGQT FKELYQLKIS QAEIALNGQQ
AENQFVGCNC GIMDQMISAQ GQANHAMLLD CRSLQTEAVA MPEQMAVVIL NSNKKRGLVE
SEYNTRRQQC EAAAKTFGVK ALRDVTLAQL TAKQAELDPV VAKRARHVIT ENERTLHAAQ
ALREGNMPRL GELMAASHAS MRDDFEITVK EIDTLVEIVQ SVIGDQGGVR MTGGGFGGCV
VALVHPKQVE AVQQAVAEHY EAATGLKASI YVCHATSGAG LVELA