Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3029 |
Symbol | |
ID | 4023532 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 3374537 |
End bp | 3375838 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637963228 |
Product | dihydroorotase |
Protein accession | YP_570156 |
Protein GI | 91977497 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR00857] dihydroorotase, multifunctional complex type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.678009 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.507033 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAACG ACCGCCGCCC GATCCTGCTC GCCAATGCTC GCCTGATCGA TCCGGCGCGC GACTTCGACG GCGTCGGCGA CGTGCTGATC GCCGACGGCG TGATCCGCGA CGCCCGCCGC GGCATCGGCG CCGCCGGCGT GCCGGAAGGC ACCGACATCA TCAATTGCGG CGGCATGATC GTGGCCCCCG GCCTGATCGA CATGCGCGCC TTCGTCGGCG AACCCGGCGC CAGCCATCGC GAGACCTTCG CCTCGGCCAG CCAGGCCGCA GCCGCCGGCG GCATCACCAC CATCATCTGC CAGCCGAACA CTTCGCCTGT GATCGACAAT TCGGCGACGG TGGACTTCGT GATGCGCCGC GCCCGCGACA CCGCGATCGT CAACATTCAC CCGATGGCGG CGCTGACCAA GGGTCTCGCC GGCGCGGAGA TGACCGAGAT CGGCCTGTTG AAGGCCGCCG GCGCCGTGGC CTTCAGCGAC GGCGATCGCA GCGTCATGAA CGCGCGGGTG ATGCGCAGCG CGCTGACCTA CGCCCGCGAT TTCGACGCCC TGATCGTTCA CCACACCGAA GACCCCGATC TGGTCGGCGA AGGCGTGATG AACGAGGGTG AATTCGCCAC CCGCCTCGGG CTCTCCGGTA TGCCGAACGC CGCCGAGGCC GTGGTGCTGG AGCGCGACGT CCGCCTCGCC GCACTGACCG GCGGCCGCTA TCACGCCGCG TCGCTGACCT GCATCGAGTC GCTGGAGATT TTGCAGCGCG CGCGAGACAC CGGCATCAAC GTCTCGGCCT CGGTATCGAT CAATCATGTC TCGCTGAATG AGAACGACAT CGGGCCGTAC CGCACGTTCC TCAAGCTGTC GCCGCCGCTG CGCACCGAGA ACGACCGCAA GGCTCTGATC GCCGCCGTCG CTTCGGGTCT CGTCGACGTC ATCATGTCGG ACCACAATCC GCAGGACGTC GAGGTCAAGC GGCTGCCGTT CGCCGAGGCC GCCGCCGGCG CGATCGGCCT GGAGACGATG CTGCCGGCCG GCTTGCGGCT GGTGCACAAT GGCGAGCTGG ACCTGCTGAC CCTGATCCGT GCGATGTCGA CCCGCCCGGC CGAATTGCTC GGCCTGCCCG GCGGCACGCT GCGCGCAGGC TCGCCAGCCG ATCTGATCAT GATCGACATC GACACCCCGT GGGTGGTCGA TCCGAACGAA CTGAAATCGA AGTGCAAGAA TACCCCGTTC GACGAAGCTC GGTTCTCGGG ACGGGTGACG CGGACCATCG TCGGCGGACG CACCGTCTAC GAACATGTGT GA
|
Protein sequence | MLNDRRPILL ANARLIDPAR DFDGVGDVLI ADGVIRDARR GIGAAGVPEG TDIINCGGMI VAPGLIDMRA FVGEPGASHR ETFASASQAA AAGGITTIIC QPNTSPVIDN SATVDFVMRR ARDTAIVNIH PMAALTKGLA GAEMTEIGLL KAAGAVAFSD GDRSVMNARV MRSALTYARD FDALIVHHTE DPDLVGEGVM NEGEFATRLG LSGMPNAAEA VVLERDVRLA ALTGGRYHAA SLTCIESLEI LQRARDTGIN VSASVSINHV SLNENDIGPY RTFLKLSPPL RTENDRKALI AAVASGLVDV IMSDHNPQDV EVKRLPFAEA AAGAIGLETM LPAGLRLVHN GELDLLTLIR AMSTRPAELL GLPGGTLRAG SPADLIMIDI DTPWVVDPNE LKSKCKNTPF DEARFSGRVT RTIVGGRTVY EHV
|
| |