Gene RPD_2588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2588 
Symbol 
ID4023084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2899977 
End bp2901038 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content67% 
IMG OID637962785 
Productdihydrouridine synthase TIM-barrel protein NifR3 
Protein accessionYP_569718 
Protein GI91977059 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.225137 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00524476 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGGTGGC GACAGAATGA GCAGCTTGTG ATCCACCCAA CAGTAACAGC TTTGCCGGCT 
TTGAGGATTG GCAATATTGC GGTGGCCAAT CGGGTGCTGC TGGCGCCGAT GTCCGGCATT
ACCGACGCGC CTTTCCGCAA GCAGGTCGCA GCTCTCGGGG CCGGATTGGT GGTGTCCGAG
ATGACCGCGA GCGAAGACCT TGTCCAGGGG CGCGCAATGT CGGTCCGCCG CTGCGACGCC
ATCGACGGTG CTCCGCACGT TGTCCAGCTC GCCGGCTGCG AACCGCATTG GATGGCGGAA
GGCGCCCGGA TCGCCGAGGC GGGCGGCGCC GACATCATCG ATATCAATAT GGGCTGTCCG
GCCCGGCACG TGACCGGCGG CCAATCCGGT TCGGCGTTGA TGCGTGATCT CGATCACGCA
CTGACGCTGA TCGAGGCCAC GATCGACGCG GTGCGCGTGC CGGTGACGCT GAAGATGCGG
CTCGGCTGGG ATGAGCGCTC GCTCAACGCG CCGGAATTGG CGCGGCGGGC CGAGGCCGCC
GGCGTCCAAT TAGTGACGGT TCACGGCCGC ACCCGCAGTC AGTTCTACAA AGGCGAGGCC
GACTGGCGCG CGGTCCGCGC CGTTCGCGAG GCGATCAGCA TTCCACTGGT CGTCAACGGC
GATATCACGA CGTATCACAT GGCGGTCGAA GCGCTCGACC AGTCCGGCGC CGACGCGGTA
ATGATCGGCC GCGGTGCGCA GGGGCAGCCC TGGCTGCCGG GCCAGATCGG CCGGCGGCTG
CAGACCGGGA TCGCCGAGGC GATGCCGTCG CTCGCCGAAC AGTTCGACTA TCTCCGCAGC
CTCTATGACG GCGTGCTGAG TTTGTACGGA CAACGCATTG GGCTGCGCCA CGCTCGCAAG
CATCTCGGCT GGTCGCTCGA CGTCGCCGCA GCGGCGAGCG GCGCGCCGCC GGCGGCGCTG
AAAAGCTGGC GGGCCCAGAT CCTGACCGAG GAAAATCCGG TCCGTGTGCA TCGTGCGCTT
GCCGATGCCT ACGACGATTT CGCCTGGAGA GCCGCAGCAT GA
 
Protein sequence
MRWRQNEQLV IHPTVTALPA LRIGNIAVAN RVLLAPMSGI TDAPFRKQVA ALGAGLVVSE 
MTASEDLVQG RAMSVRRCDA IDGAPHVVQL AGCEPHWMAE GARIAEAGGA DIIDINMGCP
ARHVTGGQSG SALMRDLDHA LTLIEATIDA VRVPVTLKMR LGWDERSLNA PELARRAEAA
GVQLVTVHGR TRSQFYKGEA DWRAVRAVRE AISIPLVVNG DITTYHMAVE ALDQSGADAV
MIGRGAQGQP WLPGQIGRRL QTGIAEAMPS LAEQFDYLRS LYDGVLSLYG QRIGLRHARK
HLGWSLDVAA AASGAPPAAL KSWRAQILTE ENPVRVHRAL ADAYDDFAWR AAA