Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_0641 |
Symbol | |
ID | 7172528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | - |
Start bp | 766428 |
End bp | 769145 |
Gene Length | 2718 bp |
Protein Length | 905 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643539141 |
Product | aldehyde oxidase and xanthine dehydrogenase molybdopterin binding |
Protein accession | YP_002435066 |
Protein GI | 218885745 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [COG2080] Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 0.135402 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAACA AGCATTTCAT CGTCAACGGC ATACCCCGCA ACCTTGTCGT CGATCCGGAA GCGACGCTGG CGGACGTGCT GCGCGAACAG CTGCTGCTCA CCGGCGTCAA GGTGGGCTGC GGCGAAGGCC AGTGCGGCGC GTGCAGCGTC ATCCTTGACG GCAAGGTGGT CCGCTCCTGC GCGTACAAGA TGCGTCGCCT GCCCGACGGG GCATCGGTGA CCACCATCGA AGGCGTCGGC AGCCCCGACT GCCTGCACCC GCTGCAACTG GCGTGGACGG CCCACGGCGG CGCGCAGTGC GGCTTCTGCA CGCCGGGCTT CATCGTTTCG GCCAGGCAGT TGCTGGAAGA AAACAAGAGC CCCAGCCGCG ACGATGTGCG CGACTGGTTC CAGAAACACC GCAACGTCTG CCGCTGCACC GGCTACAAGC CTCTGGTCGA TGCGGTGATG GACGCCGCCA AGGTGCTGCG CGGCGAGATG AGCGCCGACG ACCTCTGCTT CAAGATTCCC GCCGATGGCC GCATCTGGGG ATCGAAGTAC CCCCGCCCCT CCGCCATTGC CAAGGTAACG GGCACCTGCG ACTACGGTGC GGACCTGGGC CTGAAGCTGC CCTCCGATGC GCTGCACCTT GCGTTGGTGC AGGCGGAAGT GTCGCATGCG AAGATCCTGT CCATCGACAC CTCCGAAGCG ATGAAGATGC CCGGCGTGCA CAGCGTGCTC ACCCACAAGG ACGTCAAGGG CAAGAACCGC ATCACCGGCC TGATCACCTT CCCCTCCAAC AAGGGCGACG GCTGGGACCG GCCCATCCTG TGCGACGAGA AGGTGTTCCA GTACGGTGAC GCCCTGGCCA TCGTCTGCGC CGACAGCGAA AAGCACGCCC GCGCCGCCGC CGAAAAGGTG AAGGTGCAGC TGGAAGAACT GCCCGCCTAC ATGAGCGCCC CGGCGGCCAT GGCCGAAGAC GCCATGGAAA TCCACCCCGG CACGCCCAAC GTGTACTTCA TCCAGAACAT CGCCAAGGGA GCCGACACCA GGCCCATCTT CGACGGGGCC GACGTGGTGG TGGAAGGCGA CTACTACGTG GGCCGCCAGC CGCACCTGCC CATCGAGCCG GACGTGGGCT TTGCCTACAC CGACGGCGAA GGGCGCCTGG TCATCCACTC CAAGTCCATC GGCCTGCACC TGCACCTGTA CATGATCGCG CCCGGCCTTG GCCTTGAGCC GGAAAAGATC GTCATGGTGC AGAACCCGGC GGGCGGCACC TTCGGCTACA AGTTCAGCCC CACCATGGAA GCCCTGGTGG GCGTGGCGGC CATGGCCACC GGACGTCCGG TGCACCTGCG CTACAACTAC CGCCAGCAGC AGAACTACAC CGGCAAGCGT TCGCCGTTCT TCTTCAACGT GCGCTACGCC GCCGACAAGA CCGGCAAGAT CAAGGCGATG GAAACCGACT GGACCGTGGA CCACGGCCCG TACTCCGAGT TCGGCGACCT TTTGACCCTG CGCGGCGCGC AGTTCGCCGG GGCGGGCTAC GGCATCGCCA ACATCCGGGG CCAGGGCCGC ACGGTGTGCA CCAACCACGC CTGGGGTTCG GCCTTCCGCG GCTACGGCGC GCCGGAAAGC GAGTTTCCGT CAGAAGTGCT GATGGATGAA CTGGCCGAAA AGCTGGGCAT GGACCCGCTG GAGCTGCGCT ACCTGAACGT CTACCGCAAG GGCGACACCA ACCCCACCGG GCAGGACCCG GAAGTGTACA GCCTGCCGGA AATGATCGAC ATCCTGCGGC CCAAGTACAA GGCGGCGCTG GATGCGGCCA GGGCCGGTTC CACGGCGGCG GTCAAGAAGG GTGTGGGCGT TGCCGTGGGC GTGTACGGCT GCGGCCTTGA CGGGCCGGAC ACCTCGGAAT GCGATGCCGA ACTGAACCCC GACGGCACCG TGACCATCTA CAACTGCTGG GAAGACCACG GCCAGGGCGC GGACATGGGC ACCCTGGGCA CCGCCCACGA GGCGCTGCGT CCGCTGGGCA TCGCCCCGCA GAACATCCGC CTGGTGCTCA ACGACACCAG CAAGGCTCCC AACAGCGGCC CGGCGGGCGG CAGCCGGTCG CAGTACGTCA CCGGCAACGC CGTGCGCGTG GCCTGCGAAA ACCTGGTGGA AGCCATGCGC AAGCCCGGCG GCTTCCGCAC CTACCAGGAA ATGGTGGACG AAAAGATCGC CACCAAGCAT CGCGGCGCCT GGACGGCGGC GGGCACCCAC TGCGACGAAA ACGCGCAGGG CAAGCCCTTC ACCGCGTACA TGTACGGCGT GTTCATGGCC GAAGTGGCCG TGGAGGTCGC CACCGGCAAG ACCACCGTGG ACAAGCTGAC CATGGTGGCC GACATCGGCA AGATCAACAA CAAGCTGCTG GTTGACGGCC AGCTGTACGG CGGGTTGGCC CAGGGCATCG GCCTGGCCCT GTCCGAAGAC TACGAGGACC TCAAGAAGCA CGCCACAATG GCGGGAGCGG GCGTTCCGTA CATCAAGGAC ATTCCCGACA ACATCGAACT GATCTACGTG GAAACCCCGC GCGGCGACGG CCCGTTCGGC GCGTCGGGCG TGGGTGAACT GCCGCTCACC GTGCCGCATG CGGCCATCAT CAACGGCATC TACAAGGCGT GCGGCGTGCG CATCCGTCAC CTGCCCGCCC TGCCGGAAAA GGTGCTGGCG GGCCTGAAGG GCGCGTAA
|
Protein sequence | MINKHFIVNG IPRNLVVDPE ATLADVLREQ LLLTGVKVGC GEGQCGACSV ILDGKVVRSC AYKMRRLPDG ASVTTIEGVG SPDCLHPLQL AWTAHGGAQC GFCTPGFIVS ARQLLEENKS PSRDDVRDWF QKHRNVCRCT GYKPLVDAVM DAAKVLRGEM SADDLCFKIP ADGRIWGSKY PRPSAIAKVT GTCDYGADLG LKLPSDALHL ALVQAEVSHA KILSIDTSEA MKMPGVHSVL THKDVKGKNR ITGLITFPSN KGDGWDRPIL CDEKVFQYGD ALAIVCADSE KHARAAAEKV KVQLEELPAY MSAPAAMAED AMEIHPGTPN VYFIQNIAKG ADTRPIFDGA DVVVEGDYYV GRQPHLPIEP DVGFAYTDGE GRLVIHSKSI GLHLHLYMIA PGLGLEPEKI VMVQNPAGGT FGYKFSPTME ALVGVAAMAT GRPVHLRYNY RQQQNYTGKR SPFFFNVRYA ADKTGKIKAM ETDWTVDHGP YSEFGDLLTL RGAQFAGAGY GIANIRGQGR TVCTNHAWGS AFRGYGAPES EFPSEVLMDE LAEKLGMDPL ELRYLNVYRK GDTNPTGQDP EVYSLPEMID ILRPKYKAAL DAARAGSTAA VKKGVGVAVG VYGCGLDGPD TSECDAELNP DGTVTIYNCW EDHGQGADMG TLGTAHEALR PLGIAPQNIR LVLNDTSKAP NSGPAGGSRS QYVTGNAVRV ACENLVEAMR KPGGFRTYQE MVDEKIATKH RGAWTAAGTH CDENAQGKPF TAYMYGVFMA EVAVEVATGK TTVDKLTMVA DIGKINNKLL VDGQLYGGLA QGIGLALSED YEDLKKHATM AGAGVPYIKD IPDNIELIYV ETPRGDGPFG ASGVGELPLT VPHAAIINGI YKACGVRIRH LPALPEKVLA GLKGA
|
| |