Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2423 |
Symbol | |
ID | 3909557 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 2779053 |
End bp | 2780363 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637884322 |
Product | dihydroorotase |
Protein accession | YP_486039 |
Protein GI | 86749543 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR00857] dihydroorotase, multifunctional complex type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.148654 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGATCG ACCGCCGCCC GATCCTGCTC GCCAACGCCC GCCTGATCGA TCCGGCGCGC GACTTCGACG GCCTCGGCGA CGTGCTGATC GCCGACGGCG TGATCCGCGA CGCGCGCCGC GGCATCGGCG CCGCCGGCGT GCCCGAGGGC ACCGACATTA TTAATTGCGC CGGCATGGTG GTCGCCCCCG GGCTGGTCGA TATGCGCGCC TTCGTCGGCG AGCCCGGCGC CAGCCACCGC GAGACCTTCG CCTCGGCCAG CCAGGCCGCC GCCGCCGGCG GCATCACCAC CATCATCTGC CAGCCCAACA CCTCGCCGGT GATCGACAAT TCGGCGACGG TCGATTTCGT GATGCGTCGC GCCCGCGACA CCGCGATCGT CAACATCCAT CCGATGGCCG CGCTGACCAA GGGTCTGAAC GGAATCGAGA TGACCGAGAT CGGGCTGTTG AAGGCCGCCG GCGCCGTGGC GTTCAGCGAC GGCGACCGCA GTGTGATGAA TGCGCGGGTG ATGCGCAGCG CGCTGACCTA TGCGCGCGAT TTCGATGCGC TGATCGTCCA TCACACCGAA GACCCCGATC TGGTTGCCGA AGGCGTGATG AACGAAAGCG AGTTCGCCAC CCGCCTCGGC CTGTCCGGCG TGCCGAGTGC CGCAGAAGCC GTGGTGCTGG AGCGCGACGT CCGCCTCGCC GCATTGACCG GCGGGCGCTA TCACGCCGCC TCGCTGACCT GCATCGAGTC GCTGGAGATC CTGCAGCGCG CTCGCGACGC CGGGATCAAC GTCTCGGCGT CGGTGTCGAT CAATCACGTC ACGCTCAACG AGAACGACAT CGGCCCGTAT CGCACCTTCC TCAAGCTGTC GCCGCCGCTG CGCAGCGAGG ACGACCGCAA GGCGCTGATC GCCGCGGTCT CGTCCGGTCT GATCGACGTC ATCATGTCGG ACCACAATCC GCAGGATGTC GAGGTCAAGC GGCTGCCCTT CGCCGAGGCC GCCGCCGGCG CGATCGGGCT GGAGACGATG CTGCCGGCCG GCCTGCGATT GCTGCACGCC GGCGAACTCG ATCTCTTGAG TCTGATCCGC GCGATGTCGA CCCGCCCGGC CGAACTGCTC GGCCTGCCCG GCGGCACGCT GCGCGCAGGC AGCCCGGCCG ACCTGATCGT GATCGACCTC GACACGCCGT GGATCGTCGA TCCGAACGAA CTGAAATCGA AGTGCAAGAA CACCCCGTTC GACGAGGCGC GGTTCTCCGG ACGGGTGGTC CGCACCATCG TCGGCGGACG CACGGTGTAC GAGCACGTCA GCGCACATTG A
|
Protein sequence | MLIDRRPILL ANARLIDPAR DFDGLGDVLI ADGVIRDARR GIGAAGVPEG TDIINCAGMV VAPGLVDMRA FVGEPGASHR ETFASASQAA AAGGITTIIC QPNTSPVIDN SATVDFVMRR ARDTAIVNIH PMAALTKGLN GIEMTEIGLL KAAGAVAFSD GDRSVMNARV MRSALTYARD FDALIVHHTE DPDLVAEGVM NESEFATRLG LSGVPSAAEA VVLERDVRLA ALTGGRYHAA SLTCIESLEI LQRARDAGIN VSASVSINHV TLNENDIGPY RTFLKLSPPL RSEDDRKALI AAVSSGLIDV IMSDHNPQDV EVKRLPFAEA AAGAIGLETM LPAGLRLLHA GELDLLSLIR AMSTRPAELL GLPGGTLRAG SPADLIVIDL DTPWIVDPNE LKSKCKNTPF DEARFSGRVV RTIVGGRTVY EHVSAH
|
| |