Gene RPB_2884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2884 
Symbol 
ID3910678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3284224 
End bp3285288 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content68% 
IMG OID637884785 
Productdihydrouridine synthase TIM-barrel protein NifR3 
Protein accessionYP_486497 
Protein GI86750001 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.423298 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGCT GGCGACAGAA TGAGCAGCTT GTGACCCGAC CTACAGTAAC AGGTTCGCCG 
CCTTTGAGAA TTGGCAAAAT TGCGGTGGCC AATCGCGTGC TGCTGGCGCC GATGTCCGGC
ATTACAGATG CGCCGTTTCG CAAACAGGTC GCAGCTCTCG GCGCCGGGCT CGTGGTATCC
GAGATGACCG CCAGCGAAGA TCTTGTGCAG GGGCGCGCGA TGTCTGTCCG CCGCTGCGAC
GCCATCGACG ACGGGCCGCA TGTGGTTCAG CTCGCCGGCT GCGAGCCGCA CTGGATGGCG
GAAGCCGCCC GGATCGCGGA AGCCGGCGGC GCCGATATCA TCGACATCAA TATGGGCTGC
CCGGCGCGGC ATGTGACCGG CGGCCAGTCC GGATCGGCCT TGATGCGTGA TCTCGATCAC
GCGCTGACGC TGATCGAGGC AACGATCGGC GCGGTGCGCG TGCCGGTGAC GCTGAAGATG
CGGCTCGGCT GGGACGATCG CTCGCGCAAC GCGCCGGAAC TGGCGCGGCG AGCCGAGGCC
GCCGGCGTGC AACTCGTGAC CGTGCATGGC CGCACCCGCA GTCAGTTCTA CAAGGGCGAA
GCCGACTGGC GTGCCGTCCG CGCGGTCCGC GACGCCGTCG GCATTCCGCT GGTCGTCAAT
GGCGATATCA CCTCGTATCA GATGGCGGTC GAGGCGCTGG ATCAGTCCGG CGCCGACGCG
GTGATGATCG GCCGCGGGGC GCAGGGGCAG CCGTGGCTGC CGGGCCAGAT CGGTCGCCGC
CTGCAGACCG GGATCGCCGA AGCGATGCCG TCGCTCACCG ATCAGCTCGC TTATCTGCGC
GCGCTGTATG ACGGCGTGCT CGGCCTGTAT GGCGAGCGCA TCGGGCTGCG CCATGCGCGC
AAACATCTCG GCTGGTCGCT CGACGTCGCC GCCGCCGCCA GCGGCGCGCC GCCGGCGACG
CTGAAGCGGT GGAGGACGAC GATCCTCACC GACGACAGCC CGACCCGTGT GCATCATGCG
CTCACTGACG CCTACGACGA TTTCGCCTGG AGAGCCGCCG CATGA
 
Protein sequence
MTRWRQNEQL VTRPTVTGSP PLRIGKIAVA NRVLLAPMSG ITDAPFRKQV AALGAGLVVS 
EMTASEDLVQ GRAMSVRRCD AIDDGPHVVQ LAGCEPHWMA EAARIAEAGG ADIIDINMGC
PARHVTGGQS GSALMRDLDH ALTLIEATIG AVRVPVTLKM RLGWDDRSRN APELARRAEA
AGVQLVTVHG RTRSQFYKGE ADWRAVRAVR DAVGIPLVVN GDITSYQMAV EALDQSGADA
VMIGRGAQGQ PWLPGQIGRR LQTGIAEAMP SLTDQLAYLR ALYDGVLGLY GERIGLRHAR
KHLGWSLDVA AAASGAPPAT LKRWRTTILT DDSPTRVHHA LTDAYDDFAW RAAA