Gene Saro_0538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0538 
Symbol 
ID3918668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp587562 
End bp588896 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content64% 
IMG OID640443268 
ProductRieske (2Fe-2S) protein 
Protein accessionYP_495819 
Protein GI87198562 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID[TIGR03229] benzoate 1,2-dioxygenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCCG CGCTGAAAAC CCGCATTGAA ACGGCCGTCG TCGATGACGC GGAAACCGGG 
CAGTTCCGCT GCCGCCGCGA CATCTTCACC GATCCCGACC TGTTCGAACT CGAGATGAAG
CACATCTTCG AGGGGAATTG GGTCTACCTG GCGCACGAGA GCCAGATCCC CGCCGCCAAC
GACTGGTTCG CGGTCACGGT GGGGCGGACG CCGGTGGTCA TCACGCGCAC GAAGGATGGC
TCGTTCAACG CCTTCATCAA TGCGTGTTCT CACCGTGGCG CGATGTTGTG CCGTCGCAAG
CGGGGCAACG GTCGGGTCAT GGTCTGCCCG TTCCACGGCT GGAGCTTCAC CAACGACGGC
AAGCTGCTGA AGGCAAAGGA CGAAGGTTCG GGGGCCTATC CGGAGTCGTT CAACCGCGAT
GGTTCGCACG ATCTTGCGCG GCTGGCGCGG TTCGAGAGCT ATCGCGGCTT CCTGTTCGGC
AGTGTCAATC CGGACGTCGC GCCGCTGGCG GAATACCTTG GCGAGACGCG GGTGATAATC
GACCAGATCG TCGATCAGGC GCCGGAGGGC ATTGAGGTGC TGGCCGGAAG CTCTTCCTAT
ATCTTCGACG GCAACTGGAA GTTGCAGATG GAGAACGGGT GCGACGGATA TCACGTCAGC
TCGGTCCACT ACAATTACGC CAGGACAATG GGGCGGCGCG CCGAGGGCGG CACCAAGGCA
GTGGACGCCA ATGGCTGGTC GAAGGCAGTC AGCGGCGTCT ACGGGTTCGA CAACGGGCAC
ATCCTGCTGT GGACGCGGGT GCTCAATCCC GAGGTCCGGC CGGTGTGGTC GCGCAAGGAT
GAGCTGGAGC AGCGGCTGGG CAAGGAGCGC ACGCGCTTCA TCGTCGAGCA GAGCCGCAAC
CTGGCGCTCT ATCCGAACGT CTTCCTGATG GACCAGTTCT CGACCCAGAT CCGCGTGGTA
CGCCCGATCG ATGCCGACCA CACCGAAGTG ACGATCTACT GCTTTGCGCC CAAGGGGGAG
AGCCCGGAAC TGCGGGCTGT CCGCATCCGC CAGTACGAGG ACTTCTTCAA CGTCTCGGGC
ATGGGCACGC CGGATGATCT CGAGGAGTTC CGTACTTGCC AGGCCTCCTA TGCGGGTGCG
GGCGGGTTGT GGAACGACCT CAGCCGCGGC GCGCGGCGCT GGATTGCCGG GCCGGACGAA
AATGCGCGCG AAATGGGCTT GAACCCGTTG CTCAGCAGCG AGCGCAGCGA GGATGAAGGC
TTGTTCGTCC GCCAGCACGA ATACTGGGCG AAGACCCTTC TCGAAGGCCT GGCCCGGCAG
GAGGAAATGG CATGA
 
Protein sequence
MSAALKTRIE TAVVDDAETG QFRCRRDIFT DPDLFELEMK HIFEGNWVYL AHESQIPAAN 
DWFAVTVGRT PVVITRTKDG SFNAFINACS HRGAMLCRRK RGNGRVMVCP FHGWSFTNDG
KLLKAKDEGS GAYPESFNRD GSHDLARLAR FESYRGFLFG SVNPDVAPLA EYLGETRVII
DQIVDQAPEG IEVLAGSSSY IFDGNWKLQM ENGCDGYHVS SVHYNYARTM GRRAEGGTKA
VDANGWSKAV SGVYGFDNGH ILLWTRVLNP EVRPVWSRKD ELEQRLGKER TRFIVEQSRN
LALYPNVFLM DQFSTQIRVV RPIDADHTEV TIYCFAPKGE SPELRAVRIR QYEDFFNVSG
MGTPDDLEEF RTCQASYAGA GGLWNDLSRG ARRWIAGPDE NAREMGLNPL LSSERSEDEG
LFVRQHEYWA KTLLEGLARQ EEMA