Gene Plav_0166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_0166 
Symbol 
ID5455004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp183572 
End bp184993 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content59% 
IMG OID640875727 
Productaldehyde dehydrogenase 
Protein accessionYP_001411446 
Protein GI154250622 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTTT TCGACAGATA TTTTATCGGC GGTGAGTGGG TGGACGCCCC TTCGCGGGCC 
AGACAAAAGC TTTTCAACCC CGCGACCGAT ACGCCTTACG GGGAGATTGC GCTTGGTACG
GCGAAGGACG TGGACGCCGC CGTGAGGGTT GCATCGGAAG CCTTTACCAG CTTCTCCGAA
ACCACGGTTG CCGAGCGGCT GGCCCTGCTC AAACGAATAC TCAACATATA TGAGCGCCGC
CGCCCCGAAA TCGCCGGGGC AATCACCGTC GAGATGGGCG CCCCAAGGAG GCTTTCGCGA
GAGGCACAAG CTGCGGCGGG GTCAGATCAT CTGCGCGCTG CCATAGAAGC CTTGGAAGAG
TTTCAAATTG AACGGCCGCT GAGCGGCGGG GTTCTTCGCA GGGAGCCTGT GGGCGTTTGC
GGGCTCATCA CCCCCTGGAA CTGGCCCATG AACCAGATCG CCGGCAAAGT TGCACCCGCC
ATCGCGGCAG GATGCACGAT GGTACTCAAG GCGAGTGAAC TGACCCCCAT GTCGGCGGTG
CTTTTCGCGG AAATCCTGGA AGAGGCCGGC GTCCCCAAAG GCGTGTTCAA CCTGCTGCAT
GGTGAGGGAA GGACCGTTGG CGCGGCGATA GCCACCCATC CGCTCATCGA TATGGTTTCG
TTCACCGGAT CGACGGCGGC AGGTGTTCAG GTCGCGATCA ACGCCGCGCC GACCGTGAAG
CGCGTGACAC AGGAACTGGG AGGCAAATCG CCGAATATCC TGCTGCGCGA TGCCGATTTT
GACGTCGCTG TCCGCCAGGC GGTCGAAGCC TGCATGGAAA ATTCCGGACA ATCCTGCAAT
GCGCCGACGA GACTTATCGT GCCGTCGGAC CGTCATGACG AAATCGCCGA ACGTGCCGAA
GAGGCCGCGA AGGCACTTCG CGTCGGAGAC CCGGAGAATG AGGAAACGGA TCTCGGTCCC
GTCGTATCTC GCGCTCATCT CAAGCGAGTA AGAAATTATA TCGAGGTCGG ACAGCAGGAA
GGCGCTATTT TAATAGCGGG TGGAGCAGAG CCCATCGATG ATCTACCGCC TGGCTATTTT
GTGAAGCCGA CTGTCTTCGC TCACGTCACA CCGGACATGA CAATTGCCCG GGAGGAAATC
TTCGGGCCGG TGCTTTCGAT TCTCTCATAT GACGATGAGG AAGAAGCCGT GCGGATCGCA
AACGATACGC CATATGGGCT TGCGGCCTAT GTTTCCTCTC AAGACAGAGA ACACGCAATC
GCCATCGCGC GCCGGCTGCG CGCGGGGCAA GTCCATGTCA ATATGACGAT CCCCGGCGCG
GATATGCCTT TTGGCGGCTT TCGTCAGTCC GGCAATGGGC GCGAAGGTGG GATTTTCGGC
ATCGAGGACT TTACCGAGAT CAAGGCGATC GCTCTTCCCT GA
 
Protein sequence
MKVFDRYFIG GEWVDAPSRA RQKLFNPATD TPYGEIALGT AKDVDAAVRV ASEAFTSFSE 
TTVAERLALL KRILNIYERR RPEIAGAITV EMGAPRRLSR EAQAAAGSDH LRAAIEALEE
FQIERPLSGG VLRREPVGVC GLITPWNWPM NQIAGKVAPA IAAGCTMVLK ASELTPMSAV
LFAEILEEAG VPKGVFNLLH GEGRTVGAAI ATHPLIDMVS FTGSTAAGVQ VAINAAPTVK
RVTQELGGKS PNILLRDADF DVAVRQAVEA CMENSGQSCN APTRLIVPSD RHDEIAERAE
EAAKALRVGD PENEETDLGP VVSRAHLKRV RNYIEVGQQE GAILIAGGAE PIDDLPPGYF
VKPTVFAHVT PDMTIAREEI FGPVLSILSY DDEEEAVRIA NDTPYGLAAY VSSQDREHAI
AIARRLRAGQ VHVNMTIPGA DMPFGGFRQS GNGREGGIFG IEDFTEIKAI ALP