Gene Saro_3483 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3483 
Symbol 
ID5077632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp85445 
End bp86629 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content62% 
IMG OID640481207 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001165869 
Protein GI146275709 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAAA TCTATGAAAA ATACGCCCAC GCGGCGGAAC GCATGCTCCA CTTCGTGGAG 
ACCAGGACGA CCGATCAGGC GAGCAGCACG ATGCGCGTGC CCGTCGCAGA CTATCTGGAC
CAGGGCCGCT TCGATCTCGA GATCGATCGC ATCTTCAAGC GGCTTCCGCT GATGCTGGCG
CTGACCATCG AACTGCCCGA GGTGAACGAC TACAAGGCGA TGGACGTCAT GGGCCTGCCG
GTGCTCATCA CCCGCGGCCG CGACGGCAAG GCGCGGGCCT TCCTCAACGT GTGCAAGCAC
CGCGCGATGC ATCTGGCCAA GGAAGGCAAG GGTAACTGCA AGGCCTTCGC CTGCCAGTAC
CATGGCTGGA CGTATGCCAA TGACGGCAAG CTGATGGGCA TCGCCGAGGC CAGCACGTTC
GGTGACGTCG ATCGCTCGAC GCTCAACATG ACCGAACTGC CTTGCGACGA GGCGGCGGGG
CTGATCTTCG TGATCCTCAC TCCGGGCCTG CCGATCAACG CGGTCGAGTG GATGGGGGGC
ATGTACGAGG ACTTCGCCGC CCTCAAGCTC GAGACGTGGT ATTACCACAA GAGCAAGGCC
ATGAAGGGCG CAAACTGGAA GGTCGCCTAC GACGGCTACC TCGAGGGCTA TCACTTCCAG
GCCGCGCACA CCAATACCGT TGCCACGCGC AGCCCTTCAA ACCGCGCAAG CTATGAAGGT
TTCGGGCCGC ACATCCGCCT GGGCTTTCCG CAGAACACGA TCACACGCCT CCATGAACTG
CCGCGTGACG AATGGGGCCG GCAGGAGAAC AAGGGCTACG ATTTCATCCG CATGCTCTTC
CCGAACATGA GCTTCTTCCT GGCGCCCGAA ATGGGGCAGC TCGCGCAGCT GTTCCCCGGA
CCGCAGGCGA ACCAGAACAC CACCGTGATG AACTACATCT TCCCGGTGAA GCCAGAGACC
GAGGAGGGCC TCGCCGCGCT CGACCAGATG TGCGACTTCT TCTTCGACGT GGTGGAGGAG
GAGGATTACT TCCTGGGCCT CAAGGTGCAG AACGGGCTGG AATCAGGCGC GATGACGCAC
CAGACCTTCG GCCGCAACGA ACCCGGCAAC CAGTTCTTCC ACAAATGGGT GGCCTACTAT
CTCGACGAAA GCGGCCAGAC CCCGATGCCG GTGATGAAGG AGTAG
 
Protein sequence
MTEIYEKYAH AAERMLHFVE TRTTDQASST MRVPVADYLD QGRFDLEIDR IFKRLPLMLA 
LTIELPEVND YKAMDVMGLP VLITRGRDGK ARAFLNVCKH RAMHLAKEGK GNCKAFACQY
HGWTYANDGK LMGIAEASTF GDVDRSTLNM TELPCDEAAG LIFVILTPGL PINAVEWMGG
MYEDFAALKL ETWYYHKSKA MKGANWKVAY DGYLEGYHFQ AAHTNTVATR SPSNRASYEG
FGPHIRLGFP QNTITRLHEL PRDEWGRQEN KGYDFIRMLF PNMSFFLAPE MGQLAQLFPG
PQANQNTTVM NYIFPVKPET EEGLAALDQM CDFFFDVVEE EDYFLGLKVQ NGLESGAMTH
QTFGRNEPGN QFFHKWVAYY LDESGQTPMP VMKE