Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dde_2965 |
Symbol | |
ID | 3757970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio desulfuricans subsp. desulfuricans str. G20 |
Kingdom | Bacteria |
Replicon accession | NC_007519 |
Strand | + |
Start bp | 2948110 |
End bp | 2949390 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637783866 |
Product | dihydroorotase |
Protein accession | YP_389454 |
Protein GI | 78358005 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR00857] dihydroorotase, multifunctional complex type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.322465 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGATC ATATTTCCGG CACACTGTTT GTTTCGGGCG CGCTTCTTTC CGGGCGCACG GTTGATGTCA CCGTATCGGG CGGACGCATT GCCGCCGTCA GCGAACACGG GGCGGCCCCC GCGCCGCAGC ACGCCGAAAC TGTGGAAGCC GCGGGCAAGA TACTGTTTCC CAGTTTTATT GACTGCCACG TGCACCTGCG CGAACCCGGT TTTGAGTACA AGGAAGACAT TGCCTCCGGA CTCGCAGCCG CTGCGCACGG CGGATTCGGG GCGGTACTGC CCATGGCCAA TACCAGTCCC GTCAACGATC AGGGCAGTGT GACCGAGCTG ATGCTGGAAC GCGCCCGCAA GGCATGGCCC CACGGCCCGC GCGTGCATCC TGTGGGTGCC GCCACCAAAG GACTGAAAGG CGAAGAACTG GCCCCCATGG GCGAACTGGC CGCCGCGGGC TGCGTCGCCT TTTCCAACGA CGGACTGCCG GTGGGCGGTG CCGAAATGTT CCGCCGCTGC ATGGAATACG CCGCAGACTT CGGTAAAATA GTCATCGATC ACTGCGAAGA TCCTTCACTG GCGCGCGGCA CACACATGAA CGAAGGGGTG ACCAGCGGAC GCCTGGGAGT CAAAGGACAG TCGGTGGTGG CCGAATCGGT ACAGGTGGCC CGCGACATTC TGCTGGCCGA ATATCTGGGC ATTCCGGTCC ATCTGGCACA TATCAGCTGC CGTCAGTCCG TGGAGCTTAT CGCGTGGGCC AAGCAGCGCG GCGTAAGAGT AACGGCCGAA ACCTGCCCGC ACTATCTGCT GCTGGACGAT CTGGCACTGG AACAGTACTC CACAGCAGCC AAGGTCAACC CGCCGCTGCG CACCCCCGAC GACGTGGCGG CCATGCGCCG TGCCGTGGCT GACGGCACCA TAGACATACT GGTGACAGAC CACGCCCCGC ATGCCGCGCA CGAAAAGGAT ACCCCGCTGG ATGAAGCGCC CAACGGCATT TCCGGTCTGG ATACGGCGGT GGCTCTGACA TGGCGTCTGG TGCAGGAAGG CCTGCTGACC GAAGCCGACA TGGTGCGCCT GTGGTGCCAT GCTCCGGGCA GCCTGTTCCG TCTGCCGGTA AACCGCTTCA CGGCGGGCGA CCCTGCAGAT TTTTTCCTGT TCGACCCTGC GCATGAATGG ACTGTGACCC CTGAGGCGAT GCATTCAAAA GGCAAAAACA CGCCCTTTAC CGGGTGGAAG CTTACCGGTA AGGTGACATC CCACTGGATG GGCGGTCACA GAATAGCATG A
|
Protein sequence | MSDHISGTLF VSGALLSGRT VDVTVSGGRI AAVSEHGAAP APQHAETVEA AGKILFPSFI DCHVHLREPG FEYKEDIASG LAAAAHGGFG AVLPMANTSP VNDQGSVTEL MLERARKAWP HGPRVHPVGA ATKGLKGEEL APMGELAAAG CVAFSNDGLP VGGAEMFRRC MEYAADFGKI VIDHCEDPSL ARGTHMNEGV TSGRLGVKGQ SVVAESVQVA RDILLAEYLG IPVHLAHISC RQSVELIAWA KQRGVRVTAE TCPHYLLLDD LALEQYSTAA KVNPPLRTPD DVAAMRRAVA DGTIDILVTD HAPHAAHEKD TPLDEAPNGI SGLDTAVALT WRLVQEGLLT EADMVRLWCH APGSLFRLPV NRFTAGDPAD FFLFDPAHEW TVTPEAMHSK GKNTPFTGWK LTGKVTSHWM GGHRIA
|
| |