Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1926 |
Symbol | |
ID | 3917149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2037905 |
End bp | 2038960 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640444672 |
Product | dihydrouridine synthase TIM-barrel protein nifR3 |
Protein accession | YP_497200 |
Protein GI | 87199943 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0042] tRNA-dihydrouridine synthase |
TIGRFAM ID | [TIGR00737] putative TIM-barrel protein, nifR3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0994429 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCAGG TTCCCGTTCC GCCCCGCCTC AAGCCCATCC AGGTTGGCCC CGTCACCGTG AGCTGCCCGG TCGTGCTCGC GCCAATGACG GGCGTTTCGG ACCTGCCATT CCGCACGATC GTGCGCCGCT TCGGCTCGGG GCTCAACGTG ACCGAGATGA TTGCGAGCCC TGCGGCCATC CGTGAGACGC GGCAATCCAT CCAGAAGGCC GCATGGCACC CGACGGAAGA ACCGGTCTCC ATGCAACTCG TTGGCTGCGA GCCCGAGCAG ATGGCCGAAG CTGCCAAGCT TTCGGAAGAC AAAGGCGCGG CAATTATCGA CATCAACATG GGCTGCCCCG TGCGCAAGGT CGTTAACGGG GATGCCGGCT CGGCCTTGAT GCGCGACATT CCGTTAGCTA CCCGCCTGAT CGAGGCCACG GTGAAGGCGG TGAAGGTGCC CGTCACCGTA AAGATGCGCA TGGGCTGGTG CCACGACAGC CTGAATGCTC CCGAACTCGC GCGCATTGCC GAGGATCTCG GCGCGAAGAT GATTACCGTG CACGGACGCA CCCGCAACCA GATGTACAAG GGCAGTGCCG ACTGGGCCTT CGTCCGCAGC GTCAAGGAGG CGGTGTCAAT TCCAGTGATC GTCAACGGCG ACATCTGCGG GATCGAGGAT GTCGCTACCG CCATAGAGCA GAGCGGCGCG GACGGCGTGA TGATCGGGCG GGGGTCCTAT GGTCGTCCGT GGCTTCTCGG CCAGATCATG CATTGGCTCG ACACCGGCGA GCGTCTGGCG GACCCCCCCC TTGCCGAACA GTACGCCCTC ATTGTCGAGC ACTACCACGC GATGCAGGAG CATTACGGCG AGATGACCGG CGTCAACATG GCGCGCAAGC ATCTCGGCTG GTATACCAAG GGGCTGCACG GCTCAGCCGA TTTCCGCAAC AAGGTCAACT TCATCGACGA CCCAAAGGCC GTCCTGGCAA GTCTGGCGGA TTTCTACGGA CGAGCGATCG AACACCAGCA CTCCGCCGCC GAGGCTTCCG CTGCGGAGCG GCGCGCAGCC GCATGA
|
Protein sequence | MSQVPVPPRL KPIQVGPVTV SCPVVLAPMT GVSDLPFRTI VRRFGSGLNV TEMIASPAAI RETRQSIQKA AWHPTEEPVS MQLVGCEPEQ MAEAAKLSED KGAAIIDINM GCPVRKVVNG DAGSALMRDI PLATRLIEAT VKAVKVPVTV KMRMGWCHDS LNAPELARIA EDLGAKMITV HGRTRNQMYK GSADWAFVRS VKEAVSIPVI VNGDICGIED VATAIEQSGA DGVMIGRGSY GRPWLLGQIM HWLDTGERLA DPPLAEQYAL IVEHYHAMQE HYGEMTGVNM ARKHLGWYTK GLHGSADFRN KVNFIDDPKA VLASLADFYG RAIEHQHSAA EASAAERRAA A
|
| |