Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1363 |
Symbol | |
ID | 4076380 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 1457721 |
End bp | 1458707 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638006673 |
Product | dihydrouridine synthase TIM-barrel protein nifR3 |
Protein accession | YP_613358 |
Protein GI | 99081204 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0042] tRNA-dihydrouridine synthase |
TIGRFAM ID | [TIGR00737] putative TIM-barrel protein, nifR3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.511706 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCCTTTT CAGTTGGACC CACACATCTC GATCCCCCCG TCGCTCTGGC GCCCATGGCC GGGATCACTG ACCGCCCGTT TCGGGATCTG GTGCGCTCTT TCGGGGCGGG CCTAGTGGTG AGCGAGATGG TCGCCAGTCA GGAGATGGTT CAAGCCAAGC CCGGCGTGCG CGAGAAGGCG GAACTGAGCG CCGATGTGGA GAATACCTCG GTTCAGATCG CCGGGCGCGA CGCGTATTGG ATGGCAGAGG CCGCGCGTCA GGTGGCAGAT CGTGGGGCGC GGATGATCGA CATCAACATG GGATGTCCGG CAAAGAAAGT GACCAACGGC TATTCGGGCT CTGCGCTCCT GAAGACCCCC GATCACGCGC TGTCGTTGAT TGAGGCAGTC GTTCAGGCGG TGGATGTGCC TGTCACGCTC AAGACCCGGT TGGGGTGGGA CGATAACTGT CTCAATGCCG CTGATGTGGC GCGCCGCGCC GAAGCCGCGG GTGTCCAGAT GGTCACTATC CATGGTCGTA CCCGGTGCCA GTTTTACAAA GGTCATGCTG ACTGGGCTGC GATCTCGGAG ATCAAGAATG CGATCTCTGT TCCCTTGCTG GCCAATGGCG ATATTGTCGA TGCAAAAAGC GCGGGCAAGG CGCTCTCAGA CTCCGGAGCG GATGGCGTCA TGATCGGGCG CGGTGTGCAG GGAAGACCCT GGCTTCTGGC TCAGATCGCG CATGATCTCT GGGGCACGGC TGCTCCGGAC GTTCCCGAAG GGCGCGCATT TATTGATCTG GTTTCAAAGC ACTACGAGGC GATGCTTGCC TTTTATGGGG CGGAGTTGGG CCTCCGCGTC GCGCGCAAGC ACCTAGGCTG GTATATGGAT GAGGCCGGGA CACCTGCGGC CCTGCGGCGC GAGGTTCTGA CGGCCAAATC CCCCTCTGAT GTGTTGCGAT TGCTCCCGAG TGCGCTTCAG GGAACTGAGC AGGAGACTGC CGCATGA
|
Protein sequence | MSFSVGPTHL DPPVALAPMA GITDRPFRDL VRSFGAGLVV SEMVASQEMV QAKPGVREKA ELSADVENTS VQIAGRDAYW MAEAARQVAD RGARMIDINM GCPAKKVTNG YSGSALLKTP DHALSLIEAV VQAVDVPVTL KTRLGWDDNC LNAADVARRA EAAGVQMVTI HGRTRCQFYK GHADWAAISE IKNAISVPLL ANGDIVDAKS AGKALSDSGA DGVMIGRGVQ GRPWLLAQIA HDLWGTAAPD VPEGRAFIDL VSKHYEAMLA FYGAELGLRV ARKHLGWYMD EAGTPAALRR EVLTAKSPSD VLRLLPSALQ GTEQETAA
|
| |