Gene Saro_3088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3088 
Symbol 
ID3916703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3308092 
End bp3309804 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content64% 
IMG OID640445871 
Productdihydroorotase 
Protein accessionYP_498357 
Protein GI87201100 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3653] N-acyl-D-aspartate/D-glutamate deacylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGCGT TCGACATCGT CCTTCGCGAG GGCACCGTGG TTGATGGGAC CGGCGCCGAC 
CCATTCGTGG CCGATGTCGC GATTTCAGGA GGACGGATCG CAGCGATCGG ACCGAACCTG
TCGCGCGGCA CGACGGAGAT CAACGCGAGT GGCCAGATCG TCACGCCCGG CTTTGTCGAT
CTTCACACCC ACTATGACGG ACAGGTCACA TGGGATAACC GCCTGGCGCC GTCGTCGTAT
CATGGGGTGA CGACGGCCGT GATCGGCAAT TGCGGGGTCG GCTTCGCCCC GTGCCGCCGC
GATGCGGCAT CCCGCGATGC CATGGTCCGC CTGATGGAAG GGGTGGAGGA CATCCCGCAC
CCGGTCTTGA CCGAAGGCCT GCCGTGGACC TGGGAAACGT TCCCCGACTA CCTCGATTTC
CTCGCTTCGC GCACATACGA CATCGACGTT GCCGCCTATG TTCCGCACGC ACCGCTGCGA
GTGTACGTGA TGGGGCAGCG TGGAGCTGAT CGCGAACCGG CTACGCAAAC CGACATTGCG
GAAATGTGCC GCCTTCTGGA ACAGGCGCTC GACCGGGGCG CACTGGGGAT CGCAACGTCG
CGCAGTCTCT TCCACCGCTC CAGCGACGGT ACGGCAATCC CAACCTACCA GGCTGCTCAG
GCCGAACTGA TGGCCTTTGC AGAAGTCCTG CGTGCCAAGG GCAAAGGCGT GTTCCAGATC
GTCGAGGATA TCCATGTCCC CGGCGCCAGT CTGGACAACA TGCGAGAGCT GGCCCGCACA
TCCGGCCGGC CACTGACCTT CTCCATCGGC ACCGGCAATA CCGGGCCCTA TGGCTATCCG
CGTCTTCTCG ACGAACTGGC CGCGGCGAAT GCCGAGGGGC TGGTAATGAA GGGCCAGCTG
ATGCCCCGCG GAATCGGGAT GATCCTGGGT TTCGAGCTGA CTTTGAACCC GTTCTATACG
ACGTCCACCT TTGCGCGGCT CGCGCCACTG CCACTTGCGG AGCGCCTGGA GCAACTGCGC
CGTCCCGAAA TCCGTGCTGC AATCCTTTCC GAACCGATGG ATCCGGATCC GGCTCTGGTC
CTGGGCCGCG CGGTGCGCGA TTTCGATCAC ATGTTCCTGC TCGGCGATGA TCCGGATTAC
GAGCAGCCGC CGGAACGCAG TATTGCGGGT CGTGCCCGGG AGGCCGGAAT TACACCGGAG
GAACTGGCCT ACGACGTGAT GACGGAGGGC GAAAGCGGCG GCTTGCTCTA CCTTGCAATG
GCCAATTATG CCGATGGGAG CCTTGATGCG GTAGGCGACA TCCTGTCGCA CCCGGACGTG
GTGCTGGGGC TGGGCGATGG CGGTGCCCAT GTCGGCACGA TTTGCGATGC GAGCTATTCG
ACATTCGCGC TTTGCCACTG GGCGCGAGAC CGGCAGCGCG GACGAAAGAC AGTGGCTGAC
ATGGTTCACC GGATGACCCA GGCAACGGCC CGAGTGATCG GACTGGAAGA CCGCGGGGAA
CTCGCCGTTG GCAAGCGGGC CGACATCAAC GTGATCGATT TGGCACAGCT GGCGCTTCGC
CCGCCACAGG TTTGCCATGA TCTGCCTGCC GGTGGGCGCC GCCTGATCCA GCGTGCGACC
GGCTATAGTC TGACAATGCT AGCGGGAGAG ATTGTCCTGC GCGATGACGA GCCGACCGGT
CTGTTGCCCG GTCGTCTGAT CCGCGCGTCC TGA
 
Protein sequence
MSAFDIVLRE GTVVDGTGAD PFVADVAISG GRIAAIGPNL SRGTTEINAS GQIVTPGFVD 
LHTHYDGQVT WDNRLAPSSY HGVTTAVIGN CGVGFAPCRR DAASRDAMVR LMEGVEDIPH
PVLTEGLPWT WETFPDYLDF LASRTYDIDV AAYVPHAPLR VYVMGQRGAD REPATQTDIA
EMCRLLEQAL DRGALGIATS RSLFHRSSDG TAIPTYQAAQ AELMAFAEVL RAKGKGVFQI
VEDIHVPGAS LDNMRELART SGRPLTFSIG TGNTGPYGYP RLLDELAAAN AEGLVMKGQL
MPRGIGMILG FELTLNPFYT TSTFARLAPL PLAERLEQLR RPEIRAAILS EPMDPDPALV
LGRAVRDFDH MFLLGDDPDY EQPPERSIAG RAREAGITPE ELAYDVMTEG ESGGLLYLAM
ANYADGSLDA VGDILSHPDV VLGLGDGGAH VGTICDASYS TFALCHWARD RQRGRKTVAD
MVHRMTQATA RVIGLEDRGE LAVGKRADIN VIDLAQLALR PPQVCHDLPA GGRRLIQRAT
GYSLTMLAGE IVLRDDEPTG LLPGRLIRAS