Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dde_3539 |
Symbol | |
ID | 3758518 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. G20 |
Kingdom | Bacteria |
Replicon accession | NC_007519 |
Strand | + |
Start bp | 3496915 |
End bp | 3499638 |
Gene Length | 2724 bp |
Protein Length | 907 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637784453 |
Product | aldehyde dehydrogenase, molybdenum-binding subunit apoprotein |
Protein accession | YP_390027 |
Protein GI | 78358578 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [COG2080] Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.232426 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAAAC GTTCGCTCAT GATCAACGGA GCGCCCCGGA TGGCCATTGC CAAGCCGGAC ACAACGCTTG CAACGTACCT GCGCGAAAGC CTGGGGCTTA CCAGTGTCAA GATCGGTTGC GGAACCGGTC AGTGCGGCAG CTGCAGCGTC ATTGTCGACG GCAAGGTTGT CCGCTCGTGC AGCTACAAGA TGAGCCGCCT TGCCGACGGC GCGGTCATCA CCACGCTGGA AGGCATAGGC ACTCCCGACC ACCTGCACCC CCTGCAGGTG GCATGGCTGG CCCACGGCGC CGCACAGTGC GGTTTCTGCT CGCCGGGATT CATCGTTTCC GCCAAGGCTC TGCTTGATCA GAACACCTCG CCCACCCGCG AGGATGTACG CGACTGGTTC CAGAAACATC GCAACGCCTG CCGCTGCACC GGATACAAGC CGCTGGTGGA CGCCGTGATG GATGCCGCAG CTGTGCTGCG CGGCGACATG AAAGTCAACG AGCTGCTGTT CAAAATTCCC GAAGACGGAC GCATCTGGGG CACCAAATAC CCCCGTCCTT CCGGCATTGC CAAGGTTACC GGCACGCTGG ACTACGGCGC CGATCTGGGC ATAAAAATGC CCGAAGGCAC CCTGCGCTGC GCTCTGGTGC AGGCCGAAGT CTCGCACGCA CGCATTCTCG GCATCGACAC CTCCGAAGCG GAAACCATGC CCGGCGTCGC CAAGGTTGTG ACCCACAAAG ACATCAAGGG CAAAAACCGC ATCACCGGTC TCATCACCTT TCCCACCAAC AAAGGTGACG GCTGGGACCG CCCCATCCTC TGCGATGACA AGGTATACCA GTACGGCGAC GCGATAGCCA TCGTATGCGC CGACACCGAA GAACACGCCA AGGCTGCTGC CGCCAAGGTC AAGGTGGATC TGGAAGTCCT GCCCGCCTAC ATGAGCGCTC CGGAAGCCAT GGCCGAAGAC GCCATGGAAA TTCATCCCGG CACCCCCAAC GTGTACTTTG AACAGAAAAT CGCCAAGGGC GAGGAAACAG CATCCTACTT TGAAAAAGCC GATGTGGTGG CCGAAGGCGA CTTCTATGTG GGCCGTCAGC CCCATATGCC CATCGAGCCC GACGTGGGTT TTGCCTTCTT CAACGAAGAC GGCAAGCTGT GCATCCATTC CAAGTCCATC GGCCTGCACC TGCACCTGTA TATGATCGCT CCCGGTCTGG GCATTGAACC CGAAAACGTC ATCATGGTGC AGAACCCCGC AGGCGGCACC TTCGGCTACA AGTTCAGCCC CACCATGGAA GCGCTGGTCG GCGCGGCATG CATGGCCACC GGCAAGCCCG TATACCTTAA CTACACATGG TACCAGCAAC AGACCTACAC CGGCAAACGC TCGCCTTTCT TCATGAATGT GCGTTTTGCT GCCGACAAGC AGGGCAAGCT GCTTGCCATG GAAAGCGACT GGTCGGTGGA CCACGGTCCC TATTCCGAAT TCGGCGACCT GCTGACCCTG CGCGGCGCAC AGTTCATCGG TGCCGGCTAC GACATTCCCA ACATCCGCGG CGAAGGCCGC ACCGTCTGCA CCAACCACGC ATGGGGTTCG GCATTCCGCG GCTACGGTTC GCCCCAGAGT GAGTTCGCCT CTGAAGTGCT TATGGACATG CTGGCCGAAA AACTGGGCAT GGATCCGCTG GAACTGCGCT ACAAAAACGC ATACCGCCCC GGAGCCACCA CGCCCACCGG CCAGAATCCC GAAGTATTCA GCCTGCCCGA CATGCTTGAC ACCGCGCGGC CCAAGTACAA GGCCGCACTG GAAAAAGCAG CCAAAAACAG CACCGCCGAA GTAAAACGCG GTGTGGGTAT ATCGCTGGGC GTTTACGGCT GCGGTCTTGA CGGCCCCGAC ACCGCCGAAG TCGATGCAGA ACTGCATCCC GACGGCACCG TCAGCATCTA CACCACATGG CACGACCACG GACAGGGTGC CGACATCGGC GCGCTGACCA CCGCCCACGA GGCGCTGCGC CCGCTGGGCA TCGCTCCTGA AAACATCAAG CTGGTGCTCA ACGATACCGA ATTCTGCCCC AACGGCGGCC CCGCAGGCGG CAGCCGCTCG CAGGTTGTGG TGGGACGCGG CATAAAAGCG GCATGTGAGC TGCTGCTCGG CGGCATGCGC AAGAGCGACG GCACCTACCG CACCTATCAG GAAATGGTCG ATGAAAAGAT AGCCACCAAG TACAACGGTA AATGGACAGC CCCCTGCACC GACTGCGACA CCAACGGTCA GGGCAACCCC TTTGCCGTGT ACATGTACGG CATCTTCATG GCCGAAGTGG CCGTGGAAGT GGCCACCGGC AAGACCACCG TGGAAAAAAT GACCATGATC GCCGACGTGG GCAAAATCGT GAACAAGCTG GCCGTGGACG GACAGATGTA CGGCGGTGTT GCTCAGGCCA TCGGTCTGGC CCTCACTGAA GACTTCGAAG ACATCAAAAA GCACTCCACC ATGAAGGGTG CGGGCTTCCC CTACATCAAG CAGGTGCCCG ATGAAATCGA ACTGATCTAC CACGAATCGG ACCGGCCTGA AGGGCCCTTC GGCGCAGCGG GTGTGGGCGA ACTGCCGCTC ACCTGCCCCC ATGCAGCCGT GATCAACGCC ATATACAACG CATGCGGCGT ACGCATCACC CGACTGCCGG CCCTGCCGGA AAAGGTGCTT GCAGGCCTGC AGGCCAAGGC ATAG
|
Protein sequence | MIKRSLMING APRMAIAKPD TTLATYLRES LGLTSVKIGC GTGQCGSCSV IVDGKVVRSC SYKMSRLADG AVITTLEGIG TPDHLHPLQV AWLAHGAAQC GFCSPGFIVS AKALLDQNTS PTREDVRDWF QKHRNACRCT GYKPLVDAVM DAAAVLRGDM KVNELLFKIP EDGRIWGTKY PRPSGIAKVT GTLDYGADLG IKMPEGTLRC ALVQAEVSHA RILGIDTSEA ETMPGVAKVV THKDIKGKNR ITGLITFPTN KGDGWDRPIL CDDKVYQYGD AIAIVCADTE EHAKAAAAKV KVDLEVLPAY MSAPEAMAED AMEIHPGTPN VYFEQKIAKG EETASYFEKA DVVAEGDFYV GRQPHMPIEP DVGFAFFNED GKLCIHSKSI GLHLHLYMIA PGLGIEPENV IMVQNPAGGT FGYKFSPTME ALVGAACMAT GKPVYLNYTW YQQQTYTGKR SPFFMNVRFA ADKQGKLLAM ESDWSVDHGP YSEFGDLLTL RGAQFIGAGY DIPNIRGEGR TVCTNHAWGS AFRGYGSPQS EFASEVLMDM LAEKLGMDPL ELRYKNAYRP GATTPTGQNP EVFSLPDMLD TARPKYKAAL EKAAKNSTAE VKRGVGISLG VYGCGLDGPD TAEVDAELHP DGTVSIYTTW HDHGQGADIG ALTTAHEALR PLGIAPENIK LVLNDTEFCP NGGPAGGSRS QVVVGRGIKA ACELLLGGMR KSDGTYRTYQ EMVDEKIATK YNGKWTAPCT DCDTNGQGNP FAVYMYGIFM AEVAVEVATG KTTVEKMTMI ADVGKIVNKL AVDGQMYGGV AQAIGLALTE DFEDIKKHST MKGAGFPYIK QVPDEIELIY HESDRPEGPF GAAGVGELPL TCPHAAVINA IYNACGVRIT RLPALPEKVL AGLQAKA
|
| |