Gene Plav_3069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_3069 
Symbol 
ID5455716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp3276387 
End bp3277895 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content60% 
IMG OID640878658 
Producthypothetical protein 
Protein accessionYP_001414333 
Protein GI154253509 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG1545] Predicted nucleic-acid-binding protein containing a Zn-ribbon
[COG3425] 3-hydroxy-3-methylglutaryl CoA synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCAGC AGACGGTGGG ACTTACAGCA TTTGGCGGAT ATATCCCGCG CCTTCGCCTG 
CAGCGAAAAG CGGTAGCACA GGCGAACGCA TGGGTCGCCC CGAACTTCCT CGGCAAGGGG
AAGGGCGAGC GCTCCATGGC CAATTGGGAC GAAGATGCAC TGACCATGGC GGTAGAGGCG
GCGCGGGACC TGCTGGGGCC CGACGACGAC CGGAGCCATG TGGATGCGCT CTATCTCGGG
TCGACGACCT TGCCCTTCAA AGACCGGCTG AACAGCGGCA TCGTTGCGGC CGCCCTGACG
CTTGATGAGC GGGTTCGCGC CGTCGATGTG GCGTCCACTC AGCGAGCGGG CACCTCGGCC
CTCATGCAGG CGCTGTCCGC AGTCAAGGCG GGCGAGGCGA AGCACGCGCT GGTCCTCGCG
TCCGATCACC GCAAGACCAA AGCAGTCACC GCGCAGGAAC TGGATTTTGG CGACGGTGCA
GCGGCCGTCT CCGTCGGCAC TGAAAATGTG ATTGCTGCAT ATCTCGGCGG CGCAAGCCTG
ACCGTGGATT TCGTCGATCA CTTCCGGGGC GACAGCGAAG AGTTCGACTA TAATTGGGAA
GAACGCTGGA TCCGCGACGA AGGCTTCGCA AAGATCGTTC CGCGCGCGGT GAAGGCAGCA
CTTGAGAACT CCGGCGTAAG TGCTGGAGAG ATCAAGCACT TTGTTCTCCC GTGTAGTTTC
GGCGACAAGT TTGTCCAACA GCTTGCGAAA CGTTCAGGCA TCGCGCCCGA AGCGGCGCGG
GATACGCTCG CGGCGAATTG CGGCGAGACC GGCGCGGCCC ATTCGCTCGT CATGCTCGTT
CACACATTGC AGAAGGAAGC AAAGCCTGGC GACAAGGTGA TGGTTCTCCA TTTCGGCGGC
GGTTGCGACG CTCTCGTCTT CGAAGTGACC GACAAGCTTT CGAGCCAGGC CGGGAAACGC
GGCATCGTCG GTTCGCTCGC CAATCGCACA GAGGAAACCA ACTACCTCAA ATTCCTGACC
TTCAATGGCC TCATTGAATG GGAAAAGGGA ATGCGCGCCG AGCAGGACAA GAAGACCGCT
CTCACCACGC TCTACCGCAA CGAGGACATG CTGATGGGTC TCGTGGGCGG CCGGTGCACG
GAAACGGGTG TCATCCAGTT CCCCAGGTCC CGCATATCGG TCAATCCGAA CAACCACACG
GTCGACACGC AGGAGCCTTA CAAATTCGCC GAGAAGAAGG CGAAAATCCT ATCCTGGTCG
GCGGACTTCC TCTCCTTCAG CATGAACCCG CCGAACCATT ACGGCATGGT CGTGTTCGAC
GAAGGCGGCC GGATCATGAT GGATTTCACG GATGTCGAAG CCGGCACTGT CGATTCCGGC
ATGGAAGTGA AGCTCGTCTT CCGGATCAAG GAATTCGACG ACAAGCGCGG TTTCCGCCGA
TATTTCTGGA AGGCAGTTCC TGTCGCGGCG AGCACAGCAA AATCACAAAC GGGCCAGGCG
GCCGAATAG
 
Protein sequence
MAQQTVGLTA FGGYIPRLRL QRKAVAQANA WVAPNFLGKG KGERSMANWD EDALTMAVEA 
ARDLLGPDDD RSHVDALYLG STTLPFKDRL NSGIVAAALT LDERVRAVDV ASTQRAGTSA
LMQALSAVKA GEAKHALVLA SDHRKTKAVT AQELDFGDGA AAVSVGTENV IAAYLGGASL
TVDFVDHFRG DSEEFDYNWE ERWIRDEGFA KIVPRAVKAA LENSGVSAGE IKHFVLPCSF
GDKFVQQLAK RSGIAPEAAR DTLAANCGET GAAHSLVMLV HTLQKEAKPG DKVMVLHFGG
GCDALVFEVT DKLSSQAGKR GIVGSLANRT EETNYLKFLT FNGLIEWEKG MRAEQDKKTA
LTTLYRNEDM LMGLVGGRCT ETGVIQFPRS RISVNPNNHT VDTQEPYKFA EKKAKILSWS
ADFLSFSMNP PNHYGMVVFD EGGRIMMDFT DVEAGTVDSG MEVKLVFRIK EFDDKRGFRR
YFWKAVPVAA STAKSQTGQA AE