Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_0092 |
Symbol | |
ID | 5456355 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | + |
Start bp | 94039 |
End bp | 95442 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640875651 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001411372 |
Protein GI | 154250548 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.204667 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 0.62822 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAAT TCAAGCTCCT CATCGGCGGC AAACTGGTCG CCGGCGCCAA ATCGATGGAC GTCATCAATC CGGCCACGGA GGAAGTCGTG GCCTCTTGCC CTCGCGCTTG CGAGGCGCAG TTGAACGAGG CGGTTGCCGC CGCCAAATCC GCCTTCCCCG GCTGGAGCGC AACGCCCATC GCCGAGCGCA AGAAGGTGCT GAACGCCATC GCCGACGCGA TCGAGGCGAA CGCCGCCGAT CTCGCGCGGC TCCTCACCCA GGAACAGGGC AAGCCGATCG GCGACGCCAC GGGCGAGGTC TATGGCACCG CCGCCTTCTT CCGCTATTTC ACCATGCTCG ACATGCCGGT GAAATTGATC GAGGACAGCG AGGGCAAGCG CGTCGAGGCG CATCGCCGTC CGCTCGGCGT CATCGGCGCC ATCGTGCCGT GGAACTTCCC GATGATCCTC ATGGCCTTCA AGCTGCCGCC GGCGCTGCTT GCCGGTAACA CGGTGGTGCT GAAGCCCGCG CCGACAACGC CGCTCACCTC GCTGAAACTC GGCGAGCTGA TCAAGGACAT CGTCCCCGCC GGCGTGGTGA ACATCATCGC CGATGCAAAC GACCTCGGCG CCGCCCTCAC CGCGCATCCG GACGTCCGCA AGATTTCCTT CACCGGCTCC ACGGCGACCG GCGCCAAGGT CATGGCGGGC GCCGCCGGAC TTCTCAAGCG CATCACCCTC GAACTCGGCG GCAACGACGC CGGCATCGTG CTCGACGATG TGAACCCGAA GGAAGCCGCG CCCAAGCTCT TCCAGAGCGC CTTCCAGAAT TCCGGCCAGG TCTGCATCGC CATGAAGCGG CTCTATGTGC ATGAAAAGAT CTACGACGAA ATCTGCGACG AGCTCGCCGC CATCGCCAAC AACACCATTG TCGGCGATGG GCTGAAGCAG GGCACGCAGC TCGGGCCGCT GCAAAACAAG ATGCAGTTCG ACAAGGTGAA GGAACTGATC GAGGATTCGA AAAAACACGG CAAGATCATC GCCGGCGGCG ATACGCCGGA AGACAAGGGC TATTTCATCC GCCCCACCAT CGTCCGCGAC ATAACGGATG GCGCCCGCCT CGTCGACGAG GAACAATTCG GCCCCGTCCT CCCCGTCATC AAATATTCCG ACGCCGACGA CGCCGTCGCC CGCGCCAACG CCTCGCCCTA TGGCCTCGGC GGCTCCATCT GGTCGTCCGA TCCCGCTCGC GCCTACGCGC TGGCCGAGAA GCTCGACAGC GGCACCGTCT GGATCAACAA ACACGCCGAC CTCGCCCCCA ACATCCCCTT CGGCGGCGCC CGCATGTCCG GCCTCGGCAC AGAACTCGGC GAAGAAGGCT TGGCGGAATT CACGCAGTTG AAGATCGTCA ACATGGCAAA ATGA
|
Protein sequence | MSEFKLLIGG KLVAGAKSMD VINPATEEVV ASCPRACEAQ LNEAVAAAKS AFPGWSATPI AERKKVLNAI ADAIEANAAD LARLLTQEQG KPIGDATGEV YGTAAFFRYF TMLDMPVKLI EDSEGKRVEA HRRPLGVIGA IVPWNFPMIL MAFKLPPALL AGNTVVLKPA PTTPLTSLKL GELIKDIVPA GVVNIIADAN DLGAALTAHP DVRKISFTGS TATGAKVMAG AAGLLKRITL ELGGNDAGIV LDDVNPKEAA PKLFQSAFQN SGQVCIAMKR LYVHEKIYDE ICDELAAIAN NTIVGDGLKQ GTQLGPLQNK MQFDKVKELI EDSKKHGKII AGGDTPEDKG YFIRPTIVRD ITDGARLVDE EQFGPVLPVI KYSDADDAVA RANASPYGLG GSIWSSDPAR AYALAEKLDS GTVWINKHAD LAPNIPFGGA RMSGLGTELG EEGLAEFTQL KIVNMAK
|
| |