Gene Plav_2698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_2698 
Symbol 
ID5456574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp2902292 
End bp2903506 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content68% 
IMG OID640878275 
Productbeta-ketoadipyl CoA thiolase 
Protein accessionYP_001413963 
Protein GI154253139 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02430] beta-ketoadipyl CoA thiolase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.398036 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCATC AGATCAGAGA CGCTTTCATT TGCGACGCCG TGCGGACGCC TATCGGGCGC 
TATGCGGGGG CGCTGGCGCA GGTGCGCGCG GACGATCTCG GCGCGGTGCC GCTGATGGCG
CTGATGGAGC GGAACCCGGA TGTGAACTGG GAGCGGGTGG ACGATGTGAT CTTCGGCTGC
GCCAACCAGG CGGGCGAGGA CAACCGGAAC GTGGCGCGGA TGTCGGCGCT GCTGGCGGGG
CTGCCCGAAG GCGTGCCGGG ATCGACGGTG AACCGGCTCT GCGGCTCGGG CATGGATGCC
GTCGGCACGG CGGCGCGGGC AATCAAATCG GGCGAGGCAT CGCTGATGAT CGCCGGCGGC
GTGGAGAGCA TGTCGCGCGC GCCCTTCGTG ATGGGGAAGG CGACGAGCGC CTTTTCGCGC
GATGCGGAGA TTTACGACAC GACTATCGGC TGGCGCTTCG TGAACCCGCT GATGAAGCGG
CAATATGGCG TCGACTCCAT GCCGGAGACG GCGGAGAACG TGGCCGAGGA TTTCCAGATT
TCACGCGCCG ACCAGGATGC CTTTGCGTGG CGGAGCCAGC AGCGTGCCGG ACGGGCCATC
GAAGAGGGGC GTTTCGCGCA GGAGATCGTG CCGGTGACGA TTGCGAGCCG CAAAGGCGAG
ACGGTGGTGA GCGCGGACGA GCACCCGCGG CCAGAAACGA CGCTCGAGGC GCTCGGCAAG
CTCAAGGCGC CGTTCCGCGA AGGCGGCACG GTGACGGCGG GGAACGCATC GGGCGTGAAT
GACGGTGCCT GCGCGCTCAT CATTGCGTCG GCCGACGGAG CGGAGGCGAA CGGGCTTCGC
CCGCGGGCGC GGATCGTGGC GATGGCGACG GCGGGCGTTC CGCCGCGCAT CATGGGTATG
GGGCCGGCGC CAGCGACGCG CAAGGTGCTG GAAAAGACGG GGCTCAATAT CGGCGATATC
GACGTGATCG AACTCAACGA GGCTTTCGCC TCGCAGGGGC TTGCCGTGCT GCGCGATCTC
GGGCTGCCGG ACAATGCGGA TCACGTGAAC CCGAATGGCG GCGCCATCGC GCTCGGCCAT
CCGCTCGGCA TGAGCGGGGC GCGGCTGGTG ACGACGGCGA TGTACGAACT GGAGAAGCGC
GACGGGCGCT ATGCGCTGTG CACGATGTGC ATCGGCGTGG GGCAGGGCAT TGCGATGGTG
ATCGAGAGGG TTTGA
 
Protein sequence
MTHQIRDAFI CDAVRTPIGR YAGALAQVRA DDLGAVPLMA LMERNPDVNW ERVDDVIFGC 
ANQAGEDNRN VARMSALLAG LPEGVPGSTV NRLCGSGMDA VGTAARAIKS GEASLMIAGG
VESMSRAPFV MGKATSAFSR DAEIYDTTIG WRFVNPLMKR QYGVDSMPET AENVAEDFQI
SRADQDAFAW RSQQRAGRAI EEGRFAQEIV PVTIASRKGE TVVSADEHPR PETTLEALGK
LKAPFREGGT VTAGNASGVN DGACALIIAS ADGAEANGLR PRARIVAMAT AGVPPRIMGM
GPAPATRKVL EKTGLNIGDI DVIELNEAFA SQGLAVLRDL GLPDNADHVN PNGGAIALGH
PLGMSGARLV TTAMYELEKR DGRYALCTMC IGVGQGIAMV IERV