Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_1133 |
Symbol | |
ID | 4662037 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | - |
Start bp | 1378603 |
End bp | 1380492 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639819362 |
Product | carbon-monoxide dehydrogenase, catalytic subunit |
Protein accession | YP_966580 |
Protein GI | 120602180 |
COG category | [C] Energy production and conversion |
COG ID | [COG1151] 6Fe-6S prismane cluster-containing protein |
TIGRFAM ID | [TIGR01702] carbon-monoxide dehydrogenase, catalytic subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.394682 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.139391 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTCTT CCAAGACAAT CCGTAGCCGT TCGATTTGGG ATGACGCACA CGCCATGCTC GAAAAGGCGA AGGCCGAGGG TATCAGCACC GTCTGGGATC GAGCTGCCGA ACAGACGCCG TCCTGTAAAT TCTGCGAATT GGGCACCACC TGCCGCAACT GCATCATGGG CCCGTGCCGT ATCGCCAACC GCAAGGACGG CAAGATGCGC CTTGGCGTGT GCGGTGCCGA TGCCGATGTC ATCGTGGCGC GCAATTTCGG CCGTTTCATC GCCGGCGGCG CGGCGGGGCA CTCCGACCAC GGACGCGACC TGATCGAAAC GCTCGAAGCC GTAGCCGAGG GCAAGGCTCC CGGCTATACG ATCCGCGACG TTGCAAAACT CAGGCGCATC GCGGCCGAAC TCGGCGTCGT CGACGCGGCG ACACGCCCCG CCCATGACGT GGCCGCCGAC CTCGTCACCA TCTGCTACAA CGACTTCGGC AGCCGCCGTA ATGCGCTGGC CTTTCTTGCG CGTGCGCCGC AGGTGCGGCG CGACCTCTGG CAACGCCTTG GCATGACCCC CCGTGGCGTA GACCGCGAGA TTGCCGAAAT GATGCACCGC ACCCATATGG GCTGCGACAA CGACCACACA AGCCTGCTCG TCCATGCCGC GCGGACGGCG CTCGCCGATG GATGGGGCGG TTCCATGATC GGCACGGAAT TGTCCGACAT CCTCTTCGGC ACGCCTCGCC CTCGCCAGTC CACAGTCAAT CTCGGGGTCT TGCGCAAGGA TGCCGTCAAC ATCCTCGTGC ACGGACACAA CCCCGTCGTC TCCGAAATGA TTCTGGCCGC CACCCGTGAA CCGGCCGTAA GGCAGGCGGC ACAGGACGCC GGGGCAGCAG ACATCAACGT GGCGGGGCTA TGCTGCACGG GTAACGAACT GCTCATGCGA CAGGGCATTC CCATGGCGGG CAACCACCTC ATGACCGAAC TCGCCATTGT CACAGGTGCG GCCGATGCCA TTGTCGCAGA CTATCAGTGT ATCATGCCCA GCCTTGTGCA GATTGCCGCG TGCTACCACA CCCGCTTCGT GACGACGTCT CCCAAGGGGC GTTTCACGGG GGCCACTCAT GTGGAAGTGC ACCCGCACAA TGCGCAAGAG AGGTGCCGCG AAATCGTGAT GCTCGCCATC GATGCCTACA CCAGACGAGA CCCCGCCCGG GTCGACATCC CGTCGCAACC CGTGTCCATC ATGTCCGGGT TCTCCAACGA GGCCATCCTC GAGGCCCTTG GCGGCACTCC CAAGCCCCTC ATCGACGCTG TTGTGGCAGG GCAGATACGG GGATTTGTGG GCATCGTGGG CTGCAACAAC CCAAAGATAC GTCAGGATTC AGCCAATGTG ACGCTCACGC GGGAACTGAT ACGCCGCGAC ATCATGGTGC TTGCCACAGG ATGCGTCACG ACGGCTGCTG GCAAGGCCGG ACTGCTGGTC CCGGAAGCCG CATCGAAAGC CGGCGAGGGG CTTGCCGCCG TGTGCCGCAG TCTTGGCGTG CCTCCCGTGC TGCACATGGG CAGCTGCGTG GACAACTCCC GCATCCTCCA GTTGTGCGCC CTGCTGGCAA CCACGCTGGG CGTTGACATA AGCGACCTGC CCGTGGGGGC CTCGTCGCCC GAGTGGTATT CCGAGAAGGC AGCGGCCATT GCCATGTACG CCGTGGCAAG CGGCATTCCC ACGCATCTTG GTCTTCCCCC CAACATCCTC GGCAGCGAGA ACGTCACCGC CATGGCCCTG CATGGCCTAC AGGATGTGGT AGGCGCGGCC TTCATGGTTG AACCGGACCC CGTCAAGGCC GCGGACATGC TCGAGGCGCA TATCGTGGCA CGCCGCGCAA GGCTTGGTCT CACATCCTAG
|
Protein sequence | MSSSKTIRSR SIWDDAHAML EKAKAEGIST VWDRAAEQTP SCKFCELGTT CRNCIMGPCR IANRKDGKMR LGVCGADADV IVARNFGRFI AGGAAGHSDH GRDLIETLEA VAEGKAPGYT IRDVAKLRRI AAELGVVDAA TRPAHDVAAD LVTICYNDFG SRRNALAFLA RAPQVRRDLW QRLGMTPRGV DREIAEMMHR THMGCDNDHT SLLVHAARTA LADGWGGSMI GTELSDILFG TPRPRQSTVN LGVLRKDAVN ILVHGHNPVV SEMILAATRE PAVRQAAQDA GAADINVAGL CCTGNELLMR QGIPMAGNHL MTELAIVTGA ADAIVADYQC IMPSLVQIAA CYHTRFVTTS PKGRFTGATH VEVHPHNAQE RCREIVMLAI DAYTRRDPAR VDIPSQPVSI MSGFSNEAIL EALGGTPKPL IDAVVAGQIR GFVGIVGCNN PKIRQDSANV TLTRELIRRD IMVLATGCVT TAAGKAGLLV PEAASKAGEG LAAVCRSLGV PPVLHMGSCV DNSRILQLCA LLATTLGVDI SDLPVGASSP EWYSEKAAAI AMYAVASGIP THLGLPPNIL GSENVTAMAL HGLQDVVGAA FMVEPDPVKA ADMLEAHIVA RRARLGLTS
|
| |