Gene Plav_3236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_3236 
Symbol 
ID5454739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp3455535 
End bp3457067 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content67% 
IMG OID640878827 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_001414498 
Protein GI154253674 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.358876 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.593017 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCAAT CGCGTGGCTA TAGTGGCGCC ATGATGGCCA CTTCCGATTC GTCCGCTCTT 
CTCACCGTCG CCGAGATGGG CCGCGCCGAC GCGCTGACCA TCGAACGCGG CACGCCCGGC
ACGGCCCTTA TGGAAAACGC CGGCCGCGCC GTCGCAGAGG CGATTGTCGC GCGCTTCCCT
CAGGGCCCTG TCTCCGTTCT CTGCGGTCCC GGCAATAATG GCGGTGATGG CTTTGTCGTC
GCGCGCCTGC TGGCGGAGGA GGGCTGGCCG GTTTCGCTCT TCCTTCTCGG CGGCCGCGAT
CGTTTGAAGG GCGACGCAGC CGAAGCGGCA AACCGCTGGC CGGGAGCTGT CCATCCGCTG
TCGGCGGATG CCGGACATGG TGCTTCCCTG GTCGTCGACG CGATCTTCGG CGCTGGCCTA
TCAAAAGAAG TATCGGGCAT TGCCGCCGAC GCAATAGCGC GTATCGCCGA ACGGCAAACG
CCCGTCGTCG CCGTCGATAT TCCGACGGGT ATTGATGGAG ATACGGGGCA GGTAAGGGGC
GCTGCTTTCG ATGCCGCGCT CACCGTGACT TTCTTCCGCG CCAAGCCGGG TCATCTTCTG
CTCCCCGGCC GCCTTCATTG CGGCGAGTTG CAGGTGGCCG ATATCGGCAT CGCGCCGGAT
GTACTCGACG AGATTGCCCC GCAAACATTC ATCGACAAGC CGCAGCTCTG GTCGTCATCC
TTTCCCCGTC CGCGCCTCGA CAGCCACAAA TACACACGCG GCCATGCCGT CGTCGTCTCG
GGCGGTGCCT CGCATACAGG CGCGGCCCGT CTCGCCGCGC GCGGCGCGTT GAGGGCCGGC
GCTGGGCTTG TCACACTCGC CTCGCCGCCA TCCGCGCTCC TCGTCAATGC GGCGCATCTG
ACATCGGTCA TGCTCCAGCC CTTCGATGGT GCCGACGCTC TCACGACGAT CCTTGAGGAC
AAGCGCAAGA ACGCGCTGCT CATCGGCCCC GGCGCGGGAA TCGGCGCCCA CACGCGCGAA
AATGTGCTGG CCGCGCTCCT CTCCGGCGCG GCCATGGTGC TGGACGCGGA CGCGCTCACC
TCCTTCGCCG AAATCCCGCG CGACCTCTTC GTTGCCATTG CGGGCTACTT CGCGGGTCCC
GTCGTCATGA CGCCCCACGA GGGCGAGTTC GGGCGTCTCT TCCCCCGGAT AGCCGAGGGG
GAGGGGAGCA AGCTTGCCCG CGCCCGGGCC GCCGCTGCCG AGGCGTCAGC CATCATCGTG
CTCAAGGGCG CCGACACGGT CGTCGCCGCG CCGGACGGCC GGGCCGCCAT TGCCATGAAT
GGTGGTCCCG AGCTTGCTAC CGCCGGGTCT GGCGATGTGC TGGGGGGCAT CATTCTCGGC
CTCCTGGCTC AGGCCATGCC CCCCTTCGAG GCGGCATGCG CAGGCGTCTG GCTCCATGGC
GAGGCCGGCT CCTGCTTCGG CCCCGGCCTC ATCTCCGAGG ATCTGCCCGA GATGCTGCCT
GCCGTTCTGA GGGATCTTCT CCCGGTCTTG TGA
 
Protein sequence
MPQSRGYSGA MMATSDSSAL LTVAEMGRAD ALTIERGTPG TALMENAGRA VAEAIVARFP 
QGPVSVLCGP GNNGGDGFVV ARLLAEEGWP VSLFLLGGRD RLKGDAAEAA NRWPGAVHPL
SADAGHGASL VVDAIFGAGL SKEVSGIAAD AIARIAERQT PVVAVDIPTG IDGDTGQVRG
AAFDAALTVT FFRAKPGHLL LPGRLHCGEL QVADIGIAPD VLDEIAPQTF IDKPQLWSSS
FPRPRLDSHK YTRGHAVVVS GGASHTGAAR LAARGALRAG AGLVTLASPP SALLVNAAHL
TSVMLQPFDG ADALTTILED KRKNALLIGP GAGIGAHTRE NVLAALLSGA AMVLDADALT
SFAEIPRDLF VAIAGYFAGP VVMTPHEGEF GRLFPRIAEG EGSKLARARA AAAEASAIIV
LKGADTVVAA PDGRAAIAMN GGPELATAGS GDVLGGIILG LLAQAMPPFE AACAGVWLHG
EAGSCFGPGL ISEDLPEMLP AVLRDLLPVL