Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_2576 |
Symbol | |
ID | 3970915 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 2796921 |
End bp | 2797991 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637925686 |
Product | dihydrouridine synthase TIM-barrel protein NifR3 |
Protein accession | YP_532445 |
Protein GI | 90424075 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0042] tRNA-dihydrouridine synthase |
TIGRFAM ID | [TIGR00737] putative TIM-barrel protein, nifR3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.04538 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.387025 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTGCGAT CGCCGAACCA CGAGAAGGCT GTGACCGCCT CTCCTTTAAC ATGTCCTCGC GCGCTGCAAA TTGGCGTTGT TGAGGTGGTG AACAGGGCGT TCCTGGCCCC TATGTCCGGC ATCACCGATG CGCCGTTCCG GCGGCTCACC GCGGCGCTCG GCGCCGGCCT GGTGGTATCG GAAATGACCG CCAGCGACGA TCTGGCGCGC GGCCGGCCGA TGTCGGTGCG GCGTTGCGAT ACCGCGGGGA TCGGGCCGCA TGTGGTGCAG CTCGCCGGCT GCCAGGCGCG CTGGATGGCG GAGGGCGCGC GGATTGCGGA AGCCGGCGGC GCCGACCTGA TCGATATCAA CATGGGCTGT CCGGCCCGCC ACGTCACCGG CGGCAGCGGG CAGTCCGGCT CGGCCTTGAT GCGCGACCTC GACCACGCGC TGACGCTGAT CGAAGCCACG GTCGGCGCGG TGCGGGTGCC GGTGACCCTG AAAATGCGGC TCGGCTGGGA TCGCGATTGC CTTAACGCGC CGGAATTGGC GCGCCGCGCC GAGGCCGCCG GCGTGCAGCT GATCACCGTG CACGGCCGCA CCCGCAATCA GTTCTACAAG GGCACGGCGG ATTGGGCCGC GGTGCGCGCA GTGCGCGAGG CGACCTCGCT GCCGCTGGTG GTCAACGGCG ATATCACCTC GGTCGAAGCC GCGCGCGAAG CGCTGCGGCA GTCCGGCGCC GATGCGGTGA TGATCGGCCG CGGCGCGCAG GGCCAGCCCT GGCTGCCGGG GCAGATCGGC CGGCAATTGC AGACCGGCAT CGCCGAAGCG CCGCCGTCGC TCGCCGAACA GCTGAGCTAT CTGCAGACGC TGTATGACGA ACTGTTGGAG CTTTACGGCC TGCACGTCGG CTTGCGTCAC GCCCGCAAGC ATCTCGGCTG GTCGCTCGAA GTCGCCGCCA GCGCAACCGA CGCCGAGCCC GCGACGCTGA AATCCTGGCG CGAACGGATC CTGCGGTCGG AGGATCCCGC CGCGGTCCGT CGCGCGCTGG TCGAGGCGTT TGACGATTTT GCCTGGAGGG CCGCCGCATG A
|
Protein sequence | MLRSPNHEKA VTASPLTCPR ALQIGVVEVV NRAFLAPMSG ITDAPFRRLT AALGAGLVVS EMTASDDLAR GRPMSVRRCD TAGIGPHVVQ LAGCQARWMA EGARIAEAGG ADLIDINMGC PARHVTGGSG QSGSALMRDL DHALTLIEAT VGAVRVPVTL KMRLGWDRDC LNAPELARRA EAAGVQLITV HGRTRNQFYK GTADWAAVRA VREATSLPLV VNGDITSVEA AREALRQSGA DAVMIGRGAQ GQPWLPGQIG RQLQTGIAEA PPSLAEQLSY LQTLYDELLE LYGLHVGLRH ARKHLGWSLE VAASATDAEP ATLKSWRERI LRSEDPAAVR RALVEAFDDF AWRAAA
|
| |