Gene Plav_1024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_1024 
Symbol 
ID5454124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp1122701 
End bp1124116 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content59% 
IMG OID640876594 
Productaldehyde dehydrogenase 
Protein accessionYP_001412303 
Protein GI154251479 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.736803 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTCC ACGATGACTA TGTGATGACC ATCGACGGTA CGGCTGTGGC GAGTGAAGCG 
ACGATCGATG TGGTCAATCC CGCGACTGGC AAGCCCTTCG CTTTCGCTCC CGACTGTTCG
AAAGCACAGC TGGATGCGGC GGTGGCCGCC GCCGGCAGCG CCTTCAAAAA CTGGCGGCGC
ACGCCGATTG CGGAGCGTCA GGCCATGGTG GCGAAGGCCG GCGACCTGCT GGTTGCCCAT
GCCGATGAAA TGGCCCGCCT TTTCACGCGC GAGCAAGGTC GGCCCGTAGA GCTGGCAAAG
CGGGAGATTG TGGGCGCGGG AATGTGGATG ACGGCGGTTG CCGAGATGAC CCCACCCGTG
CACGTGTCCG AAGACAGCGA CAAGCAATTC ATCGAAACCC GCTATGTGCC GCTCGGCGTG
ATCTGTGCCC TTGCGCCGTG GAATTTTCCG GTCAACCTCG CCATGTGGAA GGTCGCTCCT
GCGCTGGTTG CAGGCAATAC CATGGTACTG AAGCCCAGTC CCTTCACCCC GCTATGCACT
CTGAAGATCG GTGAGCTTTT TGCCGACGTT TTTCCCGCCG GTGTGTTCAA CGTAATCAGC
GGCGGCGATG AGCTTGGTCC GATGATGACC CGCCACCCCG GCTTCGCCAA GATCAGCTTC
ACTGGCTCAA CCGCCACGGG CAAGCGGGTG ATGGAAAGCG CAGCCAGGGA TCTGAAGCGC
GTGACGCTGG AATTGGGTGG CAACGACGCT GCAATTGTAC TTCCCGACGT GGATCTGGAT
GCTGTGGCCC AGAACATATT TCTCGGTGCT TTCCTAAACA CGTCCCAGAT ATGCGTAGCA
ACCAAGCGGC TCTACGTGCA CGAAGACATA TATGACGGGC TGCGGGATCG ACTGGTCGCT
ATTGCCCGTA CAACCAAGGT GGGCGACGGT GCCGAGCAGG GTACAGTGCT GGGGCCGATC
CAGAACAAGC GTCAGTACGA TCGTGTAGTT GCATTGTTGG AAGACGCAAA AGCCAACCGA
CTGACTCTGA TCCACGGCGC AGCTATTCCC GAAAGCGATG GATATTTTGT CCCTGTCACC
ATCGTAGACA ACCCACCGGA AGATTCCCGC GTTGTGCAGG AGGAGGCGTT CGGCCCAATC
TTGCCGATGC TGAAATTCTC CGACATCGAC GATGTGATTG ATCGAGCCAA CGCCAGCGAA
TATGGCCTCG GCGGGCAGGT GTGGTCTGCA GATACGGACA AGGCCATTGA GATTGCACGG
CGCCTGGAAA CGGGAACGGT CTGGGTGAAT CAAATGCTCA ATCTGCGCGC CGATACTCCC
TTCGGCGGAC ATAAGCAGAG CGGCTTTGGT GTCGAGAACG GTATGGAGGG CCTACTTGAA
TATATGGTGC CCCAAGCGGT TTACGTGGCC CGGTAG
 
Protein sequence
MNFHDDYVMT IDGTAVASEA TIDVVNPATG KPFAFAPDCS KAQLDAAVAA AGSAFKNWRR 
TPIAERQAMV AKAGDLLVAH ADEMARLFTR EQGRPVELAK REIVGAGMWM TAVAEMTPPV
HVSEDSDKQF IETRYVPLGV ICALAPWNFP VNLAMWKVAP ALVAGNTMVL KPSPFTPLCT
LKIGELFADV FPAGVFNVIS GGDELGPMMT RHPGFAKISF TGSTATGKRV MESAARDLKR
VTLELGGNDA AIVLPDVDLD AVAQNIFLGA FLNTSQICVA TKRLYVHEDI YDGLRDRLVA
IARTTKVGDG AEQGTVLGPI QNKRQYDRVV ALLEDAKANR LTLIHGAAIP ESDGYFVPVT
IVDNPPEDSR VVQEEAFGPI LPMLKFSDID DVIDRANASE YGLGGQVWSA DTDKAIEIAR
RLETGTVWVN QMLNLRADTP FGGHKQSGFG VENGMEGLLE YMVPQAVYVA R