Gene VC0395_A0647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0647 
Symbolgsk-1 
ID5135622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp687552 
End bp688856 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content43% 
IMG OID640532105 
Productinosine-guanosine kinase 
Protein accessionYP_001216597 
Protein GI147675110 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0524] Sugar kinases, ribokinase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.243174 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTC CTGGCCAACG CAAATCTAAG CATTATTTTC CGGTTCACGC GCGTGATCCT 
CTGGTCATTC AAGCGCAAGA AAATAAGAAG ATGTCGCGCA CCCACATTAT CGGTATTGAT
CAAACCCTAG TGGATATTGA GGCCAAAGTC GATTCAGATT TGATTGAACG TTATGGTTTA
AGTAAAGGAC ACTCATTGGT CATTGATGAT CATGCTGCAG AAGCGTTATA CAACGAATTA
AAAGAGCAGC GTTTGATTAC CAATGAATAT GCAGGTGGAA CCATAGGGAA TACGCTGCAT
AATTACTCCG TGCTTGCGGA TGATCGCTCC ACGCTACTGG GTGTGATGAG CCAAGATATT
AAAATTGGCA GTTACGGTTA TCGCTATCTG TGTAACACTT CTAGCCGCAT GGATCTCAAC
TATCTGCAAG GTGTAGATGG CGCGATTGGC CGCTGCTTTG CCTTAATTAC CGAAGATGGT
GAACGTACTT TCGCGATCAG CGAAGGTCAA ATGAACCAAT TACGCCCAGA CAGTATTCCT
GAAAAAATAT TTGCTAGTGC CTCTGCATTA GTGATCACGG CTTATTTAGT TCGTTGTAAA
GAAGGCGATC CAATGCCGGA AGCAACAATG CGTGCCATTG AATATGCCAA AAAATATGAT
GTGCCTGTGG TATTAACCCT AGGTACTAAA TTTGTTATTC AAGATGATCC GAAATTCTGG
CAAGAATTTT TACGTGATCA TGTCACTGTG GTGGCAATGA ATGAAGATGA AGCATTAGCT
TTAACAGGAG AAAGCGATCC GCTCGCAGCC TCAGATAAAG CGTTAGATTG GGTTGATTTA
GTACTGTGTA CTGCAGGCCC AGTCGGTTTA TTTATGGCGG GTTACACCGA AGATTCCGCT
AAACGTGAAA CGTCATTACC GTTATTACCG GGATCGGTCC CAGAATTTAA CCGCTATGAA
TTTAGCCGCC CTGCCCGTAA AGAGAGTTGT ATAAATCCTA TTCGTGTTTA TTCACATATT
TCACCGTATA TAGGCGGTCC AGAGAAGATA AAAAATACCA ATGGTGCTGG AGACGCAGCA
TTATCCGCTC TACTGCATGA TATGGCGGCT AATAAATACC ATAAAGAAAA CGTCCCTAAC
TCAAGTAAGC ATCAACATGA GTTTTTAACC TATTCTTCTT TCTCTCAAGT TTGCAAATAC
TCTAACCGTG CAAGTTATGA AGTATTGGCG CAGCACTCAC CACGTCTTTC ACGTGGTCTT
CCTGAGCGAG AAGATAGCCT TGAAGAGGCT TATTGGGAAA GATAA
 
Protein sequence
MKFPGQRKSK HYFPVHARDP LVIQAQENKK MSRTHIIGID QTLVDIEAKV DSDLIERYGL 
SKGHSLVIDD HAAEALYNEL KEQRLITNEY AGGTIGNTLH NYSVLADDRS TLLGVMSQDI
KIGSYGYRYL CNTSSRMDLN YLQGVDGAIG RCFALITEDG ERTFAISEGQ MNQLRPDSIP
EKIFASASAL VITAYLVRCK EGDPMPEATM RAIEYAKKYD VPVVLTLGTK FVIQDDPKFW
QEFLRDHVTV VAMNEDEALA LTGESDPLAA SDKALDWVDL VLCTAGPVGL FMAGYTEDSA
KRETSLPLLP GSVPEFNRYE FSRPARKESC INPIRVYSHI SPYIGGPEKI KNTNGAGDAA
LSALLHDMAA NKYHKENVPN SSKHQHEFLT YSSFSQVCKY SNRASYEVLA QHSPRLSRGL
PEREDSLEEA YWER