Gene Rsph17029_1458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1458 
Symbol 
ID4896454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1521361 
End bp1522341 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content74% 
IMG OID640112046 
ProductnifR3 family TIM-barrel protein 
Protein accessionYP_001043340 
Protein GI126462226 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.425307 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCCTTC GCATCGCTCA GCTTCCGGTG GATCCGCCGG TGCTCCTCGC GCCGATGGCG 
GGAATTACGG ATCTTCCCTT CCGGCGCCTT GCCGTCCGCT TCGGCGCCGG GCTGGTCGTC
TCCGAGATGG TGGCGAGCCG CGAGGTGGTC TGCGCGGGCG CCGAGGCGCG GGCGCGGGCC
GAGATCGGCG CCGCCGAGGG ACGCACGGCG GTGCAGCTGG CGGGGTGCGA TCCGCACTGG
ATGGCCGAGG CCGCGCGGCT GGTCGAGGCG CAGGGGGCCC GCATCATCGA CATCAACATG
GGCTGCCCCG CCAAGCGGGT CACGAACGGC TGGTCGGGCT CGGCCCTGAT GCGCGAGCCC
GACCGGGCGC TTGCGCTGAT CGAGGCGGTG GTGGGGGCGG TCGCGGTGCC GGTCACGCTC
AAGATGCGGC TGGGCTGGGA CGAGGGGATG CTGAATGCGC CCGAGATCGC GCGGCGCGCC
GAAGCGGCGG GGGTGGCGAT GATCACGATC CATGGCCGCA CGCGCTGCCA GTTCTACACC
GGCCGCGCCG ACTGGGCCGC GATCCGCCCG GTGGTCGAGG CGGTCTCGGT GCCGGTGGTG
GCCAATGGCG ACATCACGGG GCCCGAGGAG GCGCAGGCGG CGCTGGCGGC CTCGGGGGCT
GCGGGCGTCA TGGTCGGGCG CGGCGCGCAG GGCCGGCCCT GGCTTCTCGG ACAGGTGGCC
TCGGCGCTCG ACGGGCGCGC GGCACCCGAC GTGCCCGAGG GAGAGGCGCT CGCCGATCTG
GTCGTGGCCC ATTACGAGGA GATGCTCTCT TTCTACGGGC GCGACCTCGG GCTGCGCGTC
GCGCGCAAAC ATCTCAACTG GTATCTCGAG GCCGCGGGTC TGGCGGCGCA CCGGGGCCCC
ATCGTGACCG GAACCGATCC GGCCCGGGTG GTCCGCGCGC TGCGGCAGGC CTTCGGCGCA
CAGGAAAGGG CCGCCGCATG A
 
Protein sequence
MALRIAQLPV DPPVLLAPMA GITDLPFRRL AVRFGAGLVV SEMVASREVV CAGAEARARA 
EIGAAEGRTA VQLAGCDPHW MAEAARLVEA QGARIIDINM GCPAKRVTNG WSGSALMREP
DRALALIEAV VGAVAVPVTL KMRLGWDEGM LNAPEIARRA EAAGVAMITI HGRTRCQFYT
GRADWAAIRP VVEAVSVPVV ANGDITGPEE AQAALAASGA AGVMVGRGAQ GRPWLLGQVA
SALDGRAAPD VPEGEALADL VVAHYEEMLS FYGRDLGLRV ARKHLNWYLE AAGLAAHRGP
IVTGTDPARV VRALRQAFGA QERAAA