Gene Saro_1926 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1926 
Symbol 
ID3917149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2037905 
End bp2038960 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content64% 
IMG OID640444672 
Productdihydrouridine synthase TIM-barrel protein nifR3 
Protein accessionYP_497200 
Protein GI87199943 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0994429 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCAGG TTCCCGTTCC GCCCCGCCTC AAGCCCATCC AGGTTGGCCC CGTCACCGTG 
AGCTGCCCGG TCGTGCTCGC GCCAATGACG GGCGTTTCGG ACCTGCCATT CCGCACGATC
GTGCGCCGCT TCGGCTCGGG GCTCAACGTG ACCGAGATGA TTGCGAGCCC TGCGGCCATC
CGTGAGACGC GGCAATCCAT CCAGAAGGCC GCATGGCACC CGACGGAAGA ACCGGTCTCC
ATGCAACTCG TTGGCTGCGA GCCCGAGCAG ATGGCCGAAG CTGCCAAGCT TTCGGAAGAC
AAAGGCGCGG CAATTATCGA CATCAACATG GGCTGCCCCG TGCGCAAGGT CGTTAACGGG
GATGCCGGCT CGGCCTTGAT GCGCGACATT CCGTTAGCTA CCCGCCTGAT CGAGGCCACG
GTGAAGGCGG TGAAGGTGCC CGTCACCGTA AAGATGCGCA TGGGCTGGTG CCACGACAGC
CTGAATGCTC CCGAACTCGC GCGCATTGCC GAGGATCTCG GCGCGAAGAT GATTACCGTG
CACGGACGCA CCCGCAACCA GATGTACAAG GGCAGTGCCG ACTGGGCCTT CGTCCGCAGC
GTCAAGGAGG CGGTGTCAAT TCCAGTGATC GTCAACGGCG ACATCTGCGG GATCGAGGAT
GTCGCTACCG CCATAGAGCA GAGCGGCGCG GACGGCGTGA TGATCGGGCG GGGGTCCTAT
GGTCGTCCGT GGCTTCTCGG CCAGATCATG CATTGGCTCG ACACCGGCGA GCGTCTGGCG
GACCCCCCCC TTGCCGAACA GTACGCCCTC ATTGTCGAGC ACTACCACGC GATGCAGGAG
CATTACGGCG AGATGACCGG CGTCAACATG GCGCGCAAGC ATCTCGGCTG GTATACCAAG
GGGCTGCACG GCTCAGCCGA TTTCCGCAAC AAGGTCAACT TCATCGACGA CCCAAAGGCC
GTCCTGGCAA GTCTGGCGGA TTTCTACGGA CGAGCGATCG AACACCAGCA CTCCGCCGCC
GAGGCTTCCG CTGCGGAGCG GCGCGCAGCC GCATGA
 
Protein sequence
MSQVPVPPRL KPIQVGPVTV SCPVVLAPMT GVSDLPFRTI VRRFGSGLNV TEMIASPAAI 
RETRQSIQKA AWHPTEEPVS MQLVGCEPEQ MAEAAKLSED KGAAIIDINM GCPVRKVVNG
DAGSALMRDI PLATRLIEAT VKAVKVPVTV KMRMGWCHDS LNAPELARIA EDLGAKMITV
HGRTRNQMYK GSADWAFVRS VKEAVSIPVI VNGDICGIED VATAIEQSGA DGVMIGRGSY
GRPWLLGQIM HWLDTGERLA DPPLAEQYAL IVEHYHAMQE HYGEMTGVNM ARKHLGWYTK
GLHGSADFRN KVNFIDDPKA VLASLADFYG RAIEHQHSAA EASAAERRAA A