Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2588 |
Symbol | |
ID | 4023084 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 2899977 |
End bp | 2901038 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637962785 |
Product | dihydrouridine synthase TIM-barrel protein NifR3 |
Protein accession | YP_569718 |
Protein GI | 91977059 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0042] tRNA-dihydrouridine synthase |
TIGRFAM ID | [TIGR00737] putative TIM-barrel protein, nifR3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.225137 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00524476 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGGTGGC GACAGAATGA GCAGCTTGTG ATCCACCCAA CAGTAACAGC TTTGCCGGCT TTGAGGATTG GCAATATTGC GGTGGCCAAT CGGGTGCTGC TGGCGCCGAT GTCCGGCATT ACCGACGCGC CTTTCCGCAA GCAGGTCGCA GCTCTCGGGG CCGGATTGGT GGTGTCCGAG ATGACCGCGA GCGAAGACCT TGTCCAGGGG CGCGCAATGT CGGTCCGCCG CTGCGACGCC ATCGACGGTG CTCCGCACGT TGTCCAGCTC GCCGGCTGCG AACCGCATTG GATGGCGGAA GGCGCCCGGA TCGCCGAGGC GGGCGGCGCC GACATCATCG ATATCAATAT GGGCTGTCCG GCCCGGCACG TGACCGGCGG CCAATCCGGT TCGGCGTTGA TGCGTGATCT CGATCACGCA CTGACGCTGA TCGAGGCCAC GATCGACGCG GTGCGCGTGC CGGTGACGCT GAAGATGCGG CTCGGCTGGG ATGAGCGCTC GCTCAACGCG CCGGAATTGG CGCGGCGGGC CGAGGCCGCC GGCGTCCAAT TAGTGACGGT TCACGGCCGC ACCCGCAGTC AGTTCTACAA AGGCGAGGCC GACTGGCGCG CGGTCCGCGC CGTTCGCGAG GCGATCAGCA TTCCACTGGT CGTCAACGGC GATATCACGA CGTATCACAT GGCGGTCGAA GCGCTCGACC AGTCCGGCGC CGACGCGGTA ATGATCGGCC GCGGTGCGCA GGGGCAGCCC TGGCTGCCGG GCCAGATCGG CCGGCGGCTG CAGACCGGGA TCGCCGAGGC GATGCCGTCG CTCGCCGAAC AGTTCGACTA TCTCCGCAGC CTCTATGACG GCGTGCTGAG TTTGTACGGA CAACGCATTG GGCTGCGCCA CGCTCGCAAG CATCTCGGCT GGTCGCTCGA CGTCGCCGCA GCGGCGAGCG GCGCGCCGCC GGCGGCGCTG AAAAGCTGGC GGGCCCAGAT CCTGACCGAG GAAAATCCGG TCCGTGTGCA TCGTGCGCTT GCCGATGCCT ACGACGATTT CGCCTGGAGA GCCGCAGCAT GA
|
Protein sequence | MRWRQNEQLV IHPTVTALPA LRIGNIAVAN RVLLAPMSGI TDAPFRKQVA ALGAGLVVSE MTASEDLVQG RAMSVRRCDA IDGAPHVVQL AGCEPHWMAE GARIAEAGGA DIIDINMGCP ARHVTGGQSG SALMRDLDHA LTLIEATIDA VRVPVTLKMR LGWDERSLNA PELARRAEAA GVQLVTVHGR TRSQFYKGEA DWRAVRAVRE AISIPLVVNG DITTYHMAVE ALDQSGADAV MIGRGAQGQP WLPGQIGRRL QTGIAEAMPS LAEQFDYLRS LYDGVLSLYG QRIGLRHARK HLGWSLDVAA AASGAPPAAL KSWRAQILTE ENPVRVHRAL ADAYDDFAWR AAA
|
| |