Gene TM1040_1363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1363 
Symbol 
ID4076380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1457721 
End bp1458707 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content61% 
IMG OID638006673 
Productdihydrouridine synthase TIM-barrel protein nifR3 
Protein accessionYP_613358 
Protein GI99081204 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.511706 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCTTTT CAGTTGGACC CACACATCTC GATCCCCCCG TCGCTCTGGC GCCCATGGCC 
GGGATCACTG ACCGCCCGTT TCGGGATCTG GTGCGCTCTT TCGGGGCGGG CCTAGTGGTG
AGCGAGATGG TCGCCAGTCA GGAGATGGTT CAAGCCAAGC CCGGCGTGCG CGAGAAGGCG
GAACTGAGCG CCGATGTGGA GAATACCTCG GTTCAGATCG CCGGGCGCGA CGCGTATTGG
ATGGCAGAGG CCGCGCGTCA GGTGGCAGAT CGTGGGGCGC GGATGATCGA CATCAACATG
GGATGTCCGG CAAAGAAAGT GACCAACGGC TATTCGGGCT CTGCGCTCCT GAAGACCCCC
GATCACGCGC TGTCGTTGAT TGAGGCAGTC GTTCAGGCGG TGGATGTGCC TGTCACGCTC
AAGACCCGGT TGGGGTGGGA CGATAACTGT CTCAATGCCG CTGATGTGGC GCGCCGCGCC
GAAGCCGCGG GTGTCCAGAT GGTCACTATC CATGGTCGTA CCCGGTGCCA GTTTTACAAA
GGTCATGCTG ACTGGGCTGC GATCTCGGAG ATCAAGAATG CGATCTCTGT TCCCTTGCTG
GCCAATGGCG ATATTGTCGA TGCAAAAAGC GCGGGCAAGG CGCTCTCAGA CTCCGGAGCG
GATGGCGTCA TGATCGGGCG CGGTGTGCAG GGAAGACCCT GGCTTCTGGC TCAGATCGCG
CATGATCTCT GGGGCACGGC TGCTCCGGAC GTTCCCGAAG GGCGCGCATT TATTGATCTG
GTTTCAAAGC ACTACGAGGC GATGCTTGCC TTTTATGGGG CGGAGTTGGG CCTCCGCGTC
GCGCGCAAGC ACCTAGGCTG GTATATGGAT GAGGCCGGGA CACCTGCGGC CCTGCGGCGC
GAGGTTCTGA CGGCCAAATC CCCCTCTGAT GTGTTGCGAT TGCTCCCGAG TGCGCTTCAG
GGAACTGAGC AGGAGACTGC CGCATGA
 
Protein sequence
MSFSVGPTHL DPPVALAPMA GITDRPFRDL VRSFGAGLVV SEMVASQEMV QAKPGVREKA 
ELSADVENTS VQIAGRDAYW MAEAARQVAD RGARMIDINM GCPAKKVTNG YSGSALLKTP
DHALSLIEAV VQAVDVPVTL KTRLGWDDNC LNAADVARRA EAAGVQMVTI HGRTRCQFYK
GHADWAAISE IKNAISVPLL ANGDIVDAKS AGKALSDSGA DGVMIGRGVQ GRPWLLAQIA
HDLWGTAAPD VPEGRAFIDL VSKHYEAMLA FYGAELGLRV ARKHLGWYMD EAGTPAALRR
EVLTAKSPSD VLRLLPSALQ GTEQETAA