Gene Plav_3098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_3098 
Symbol 
ID5454807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp3309547 
End bp3310662 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content61% 
IMG OID640878687 
ProductHpcH/HpaI aldolase 
Protein accessionYP_001414362 
Protein GI154253538 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2301] Citrate lyase beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.136599 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.227792 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTCAT GGATTTGCGA TCGAAGCATC GCTAGACTGC GCGCAAATCG CCCTTATCCC 
CAGTGGACTG TCATGAAACT GCCATCGTTC TTTTACAAAC CGCTCGCCAT CGGTGCCCCG
GCCCCCTACC GGGAGCTGCC CGTCGCCCTC GAACGGATGA TCCATTTCTT CCCCGGACAC
AATGAGAAGC TCCGTTCGCG CCTCGGCGAG ATAATCCCGC AGGTCGATGT GCTGCTCGGC
AACCTCGAGG ATGCGATTCC GGCGGACGCG AAGGAGGCGG CGCGGAAGGG TGTGATCGAG
GTCGGCAAGA GCCATGATTT CGGATCGACC GGTTTCTGGG TCCGCATCAA TCCGCTGAAC
AGTCCCTGGA TGCTGGACGA TGTGACGGAG CTCGTGGCCG AGATCGGCGA CAAGCTCGAC
GTCATCATGC TGCCCAAGGT GGAAGGCGCA TGGGATATCC ACTATCTGGA CCAGCTGCTC
GCACAGCTCG AGGCGAAGCA TGGGCTGAAG AAGCCGCTTC TCATTCACGC GATCCTCGAA
ACCGCACAAG GCGTGAAGAA CGTGGAGGAG ATCGCCTGCG CCTCCCCCCG CATGCATGGC
ATGAGCCTCG GCCCGGCGGA CCTCGCGGCA TCGCGAAAGA TGAAGACGAC GCGCGTCGGC
GGCGGACATC CTTTCTACCG GGTGCTCGAA GACCCGAAGG AAGACGGCAG CCCCCGCATC
GCCGCCCAGC AGGACCTCTG GCACTACACA TTTGCCAAAA TGGTGGACGC CTGCGTCGCG
AACGACATCC GTCCCTTTTA CGGCCCCTTC GGCGACATTC AGGACGCGGA AGCCTGCGAG
CAGCAGTTCC GCAATGCCTT CCTCATGGGC TGCGTCGGCG CATGGTCGCT GCATCCGAAC
CAGATCGCGA TCGCCAAGCG CGTCTTCAGC CCGGACCCGG ACGAAGTGCA ATTTGCAAAA
CGCATCCTGG AGGCCATGCC CGACGGCACC GGCGTTGCGA TGATCGACGG CAAGATGCAG
GACGACGCCA CCTGGAAACA GGCAAAGGTG ATCGTCGATC TCGCAAAAAA GGTCGGGGCC
AAAGACCCCG ACCTTGCAAA AGCCTACGGC TTCTGA
 
Protein sequence
MRSWICDRSI ARLRANRPYP QWTVMKLPSF FYKPLAIGAP APYRELPVAL ERMIHFFPGH 
NEKLRSRLGE IIPQVDVLLG NLEDAIPADA KEAARKGVIE VGKSHDFGST GFWVRINPLN
SPWMLDDVTE LVAEIGDKLD VIMLPKVEGA WDIHYLDQLL AQLEAKHGLK KPLLIHAILE
TAQGVKNVEE IACASPRMHG MSLGPADLAA SRKMKTTRVG GGHPFYRVLE DPKEDGSPRI
AAQQDLWHYT FAKMVDACVA NDIRPFYGPF GDIQDAEACE QQFRNAFLMG CVGAWSLHPN
QIAIAKRVFS PDPDEVQFAK RILEAMPDGT GVAMIDGKMQ DDATWKQAKV IVDLAKKVGA
KDPDLAKAYG F