Gene Saro_3062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3062 
Symbol 
ID3916676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3280706 
End bp3281935 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content71% 
IMG OID640445844 
Productdihydroorotase 
Protein accessionYP_498331 
Protein GI87201074 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGCGC AGCCTCTCAC GATCATAGGC GGCCGGCTGG TGCCGGGCTC GGGCGAGCCG 
GTCGCCGGGA ACCTGCGCTG CGAGGGCGGA CGCATCGTCG CGCTGGGCGA TGTGGCGCCG
CAGGACGGCG ACACGGTGTT CGACGCCGGG GGCGCGCTGG TCGCGCCGGG GCTGGTGGAC
CTCGGCGTCT TCGCGGTCGA CAAGCCCGCG TTCCATTTCG GCGGGATCAC GCGCGCGGCG
CTCATGCCCG ACCAGGCGCC GCCGCTCGAC CATCCGGCGC GGGTGCGCTT CGCCGCACAG
TCGGGCAAGC CGGACATGTG GGTCCACCCG CTGGCCGCCG CAACGCGCGG GCTGGAAGGA
ACGGAGTTGG CCGAACTGGC GCTGATGCGC GACGCGGGCG CCAAGGCTGT CTCCACCGGA
CGGGCATGGA TCGGCGATTC AGGCGTGATG CTGCGCCTGC TGCGCTACTG CGCGATGCTG
AGGCTGGTCG TGGTCACGCA TGCAGAGGAT GCGGCGATAA CCGGATCGGC CGTCGCGACG
GCAGGCGAGG TGGCGACGCG ACTCGGCCTG CCGAGCGCGC CGGCCGAGGC CGAAGCGCTG
GCCGTGGCTC GCGACATCGC GCTTGCCGAA ATGTCGGGCT GCCACGTCCA TTTCCGGCAG
GTGACGACGG CGCAGGCGCT GGATCTCGTG CGGACGGCCA AGGCGCGCGG CGTGCGCGTG
ACGGCAGGGG TCACGCCTGC GCACTTCGTC CTTTCCGATC TCGAACTCGT CGGCTTCCGC
ACCTTCTGCC GTCTCTCGCC GCCGCTGCGC TCGGACGCGG ATCGCAAGGC CGTGATCGCG
GCGATTGCCG ATGGCACCAT CGACGTCATC GCATCGGGTC ACGACCCGCG CGGGCCGGAA
GACAAGCGCC TCCCCTTTGC CGACGCGGAG CCCGGCATGG CGGGCGCCGA AACGCTCCTG
CCGCTTACCT TGACGCTCGT GCGCGACGGC GTGATCGATC TGGCACGCGC GTTCGAGCTG
CTCGCCGGAA ACCCGGCGCG GCTACTGGGG GTCGATGCCG GACGACTCGA GACAGGGGCC
GAGGCCGACA TCGCGATCGT CGATCCGGCG CGGCCGTGGA TCGTCAATTC GGGCAAGATG
GCCGCCAGCG CCGGAAACAC CCCGTTCGAC CGACGACCCG TCGAAGGCCG GGTCACCGCG
CTGTTCAAGG GCGGCAAGCA GGTCCACTGA
 
Protein sequence
MIAQPLTIIG GRLVPGSGEP VAGNLRCEGG RIVALGDVAP QDGDTVFDAG GALVAPGLVD 
LGVFAVDKPA FHFGGITRAA LMPDQAPPLD HPARVRFAAQ SGKPDMWVHP LAAATRGLEG
TELAELALMR DAGAKAVSTG RAWIGDSGVM LRLLRYCAML RLVVVTHAED AAITGSAVAT
AGEVATRLGL PSAPAEAEAL AVARDIALAE MSGCHVHFRQ VTTAQALDLV RTAKARGVRV
TAGVTPAHFV LSDLELVGFR TFCRLSPPLR SDADRKAVIA AIADGTIDVI ASGHDPRGPE
DKRLPFADAE PGMAGAETLL PLTLTLVRDG VIDLARAFEL LAGNPARLLG VDAGRLETGA
EADIAIVDPA RPWIVNSGKM AASAGNTPFD RRPVEGRVTA LFKGGKQVH