Gene Plav_1934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_1934 
Symbol 
ID5453658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp2106318 
End bp2107367 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content64% 
IMG OID640877511 
Productgalactokinase 
Protein accessionYP_001413206 
Protein GI154252382 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.210895 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGAAA TCTTCAAGGA GATATTCAAT CGCGATGCGG CGGCGGAAGC CTCCGCGCCC 
GGCCGCGTCA ATCTCATCGG CGATCATACG GATTATGCAG GCGGCTTTTG CCTGCCGATG
CCGCTCGCGC TCGAGACACG AGTGGCGATG GCACCTGCGC CGGCGTTTCG CGCCCACAGC
CTCGATCTGG ACGAGACGGC GCCGTTCGAT CCTGCCGCGC CCGCGCGTGG CGACTGGACG
GACTATATCG CGGGGCCGCT TGCGGTACTC CGGCAGGCGG GATTTGCCGT TCCGCCGGTC
GAGGTGCTGG TGAGTTCCGA TGTCCCGCAA GGGGCGGGCG TTTCTTCGTC GGCTGCGTTA
GAGGTTGCCA CCTTGCGCGC GGCGCTGGAT TTGTCAGGTG CCAAGCTGCC GGACATGGAA
GTGGCGCGGC TCGCGCAGTC GGCGGAAAAC GTCTATTGCG GCGTTCAATG CGGCATCCTC
GATCAGATGG CGAGTGCCGT GGGCCGTCCC GGTCAGGCGT TGCTCCTCGA TTGCCGGAGC
AACGGCACGC GCCTTGTGCC AGTGCCGCCT GAATTCCATT TCGCCATTGT CCATTGCGGC
GAGGCACGCC GGCTCGTCGA TGGCGAATAT AATGAACGCC GCCGCTCGGT GGAAGAAGCG
GCGCGGCTCC TTGGCATGGT TTCACTGCGC GATGCGGGTC CGGACGATCT TGCCGGGATC
TCCGATGTGC GACTTCTCAA GCGCGCCCGT CATGTTGTCA GCGAGAATAC GAGGGTAACT
GCTGCCGTGG CGGCGCTTGA GCGGCGCGAC CTGCGCGGCT TCGGCATGTT GATGGTGGAG
AGCCATCGCT CGCTCGCGGA AAATTTCGAG GTTTCTACGC CGGTGCTTGA CCGTCTTGTC
GATGATGCGC TCGAAGCCGG TGCTTATGGC GCGCGGCTCA CCGGCGCGGG TTTTGGCGGA
TGTATTGTCG CGCTGTTGCC GGCGGGCAGG GAAGTCTGGT GGAAGAAAGT ATCGGCCGCT
CATCCGAAGG CGTGGCTCGT GCAGGCGTGA
 
Protein sequence
MREIFKEIFN RDAAAEASAP GRVNLIGDHT DYAGGFCLPM PLALETRVAM APAPAFRAHS 
LDLDETAPFD PAAPARGDWT DYIAGPLAVL RQAGFAVPPV EVLVSSDVPQ GAGVSSSAAL
EVATLRAALD LSGAKLPDME VARLAQSAEN VYCGVQCGIL DQMASAVGRP GQALLLDCRS
NGTRLVPVPP EFHFAIVHCG EARRLVDGEY NERRRSVEEA ARLLGMVSLR DAGPDDLAGI
SDVRLLKRAR HVVSENTRVT AAVAALERRD LRGFGMLMVE SHRSLAENFE VSTPVLDRLV
DDALEAGAYG ARLTGAGFGG CIVALLPAGR EVWWKKVSAA HPKAWLVQA