Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4622 |
Symbol | |
ID | 3912439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 5223342 |
End bp | 5224436 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637886526 |
Product | dihydroorotate dehydrogenase 2 |
Protein accession | YP_488216 |
Protein GI | 86751720 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0167] Dihydroorotate dehydrogenase |
TIGRFAM ID | [TIGR01036] dihydroorotate dehydrogenase, subfamily 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCCGCG CCTTCGACGC GTTCTCGCTC CCGCTGCTGC GTCTGCTCGA CGCCGAAGAC GCCCACCGCC TCGCCATCCA GGGGTTGCGG CTGCTGCCGC AGGTGAAGCC GCGCCCGGAC GATTCCAAGC TCGCGGTGCG CGCCTTCGGG CTGAACTTCC CCAATCCGGT CGGCATCGCC GCCGGTTTCG ACAAGAATGC CGAAGCGCCG GATGCGCTGC TGCGGCTCGG CTTCGGCTTC GTCGAGATCG GCACGGTGAC GCCGAAGCCG CAGGCCGGCA ATCCGCGGCC GCGGTTGTTC CGGCTGGAGC GCGACGAGGC TATCATCAAC CGGATGGGCT TCAACAATGA CGGCGCCGAG GCCGTGCTAC GCCGGCTTGC GGCGCGGGCG CAGCAGGGCG GCATCGTCGG CGTCAATGTC GGCGCCAACA AGGACAGCAC CGATCGCGTC GCCGACTACG TGTCGCTGAT CGAGACCTTT GCGCCGGTGG CGAGCTATTT CACCGTCAAC GTGTCGTCGC CGAATACGCC GGGCCTGCGC AATCTGCAGC AGGCGGCGGC GCTCGACGAT CTGCTGGCGC GGGTGATCGA GGCCCGCGAA CGGGTCCGCG CCAGCGCCGG CGACACTCCT GTGCTGCTGA AGATTGCGCC CGACCTCACG CTCAGTGAAC TCGACGACGT GGTGCACATC GCCCGCTCGC GCCGGGTCGA CGGCATGATC GTCGCCAACA CCACGCTGTC GCGCTCCCCG ATGCTGCGCG AACGGACGCG GCTGAACGAG CAGGGCGGCC TCAGCGGCCG GCCGCTGTTC CGGCTGTCGA CCCGGATGGT GGCGGAGACC TATGTCCGGG CCGAGGGCGC ATTTCCGCTG ATCGGCGTCG GCGGCATCGA TTCCGGCGGC GCGGCGCTGA CCAAGATCCG CGCCGGCGCC AGCCTGGTGC AGCTGTATTC GGCGCTGATC TACAAGGGCC TCGGCCTCGT CGACAGCATC AAGGCCGATC TCGCCTCGAC GCTGCTGCGC ACCGGGCGTG ACTCGCTTTC CGAAATCGTC GGTGCCGACG CGCCGACCAT CACCGCGGAA GAGTGGCCGG TGTAA
|
Protein sequence | MIRAFDAFSL PLLRLLDAED AHRLAIQGLR LLPQVKPRPD DSKLAVRAFG LNFPNPVGIA AGFDKNAEAP DALLRLGFGF VEIGTVTPKP QAGNPRPRLF RLERDEAIIN RMGFNNDGAE AVLRRLAARA QQGGIVGVNV GANKDSTDRV ADYVSLIETF APVASYFTVN VSSPNTPGLR NLQQAAALDD LLARVIEARE RVRASAGDTP VLLKIAPDLT LSELDDVVHI ARSRRVDGMI VANTTLSRSP MLRERTRLNE QGGLSGRPLF RLSTRMVAET YVRAEGAFPL IGVGGIDSGG AALTKIRAGA SLVQLYSALI YKGLGLVDSI KADLASTLLR TGRDSLSEIV GADAPTITAE EWPV
|
| |