Gene Saro_3431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3431 
Symbol 
ID5077580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp31008 
End bp32597 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content65% 
IMG OID640481155 
Productamine oxidase 
Protein accessionYP_001165817 
Protein GI146275657 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.478071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGT ACGACATCGT CGTCATGGGC GCAGGCCACA ATGGCCTGAC CGCCGCCGCC 
TACATGGCGA AGGCCGGTAA GAAGGTGCTG GTGCTGGAGC GCAAGCCGCA TTTCGGCGGC
GGGGTTTCGA CGCGCGAACT GCTGCACCCC GGCTTCTGGC ACGACGAGCA TTCCAACGTG
CACATCATGA TCCAGGGCAA TCCGATGCTG CGCGAGGACG AGCTGGGATT GCTGTCCCGG
TTCGGCCTGG AATACATCTA TCCGGACCTC GTCCATGTCT CGATCTGGGA GGACGGGACG
GTCATCCGTT CCTACAAGGA CCTCGACCGG ACGTGCGAGG AACTGGCCCG CGTCGCCGGG
CCGAAGGACG CCGAAGCCTA TCGCCGGTTC GTGAAGATGA GCCAGACCGC GCTGCCGATG
CTGGTCAGCG GGCTCTATTC GCCGCCGTTC CCGCTGGGCG CGTTCGTGGC GATGATGGAC
CAGTCGGACG AGGGGCGCTT TCTACTCGAT CTCATGCAGC GCAGCGCGCT CGACATCGTC
GACGCCTATT TCGAGAGCGA CCTGCTGAAG CTCCACATCG TGCGCATGGT GACCGAGAAC
CTGCAGATGC CCGACGAACT GGGCACCGGC ATGGGCGCCT TCGTCATGCC CGGCATCATC
CACACTTACG GCTGTTCGAT GCCCAAGGGC GGTTCGGGCC AGCTTTCCAG GGCGCTGGTC
CGCGCGATCG AGCATTTCGG CGGCGAAGTG CGCTGCAATG CCGAAGTCGC GCGGGTGATC
GTGTCGGGCG GGAAGGCGGT CGGGCTCGAG CTGACGGACG GCGAGACCTT CATGGCGCGC
GACGGGGTGA TCGGGGCGAT CCACCCGCAC GTCCTGCGCA AGTTCGTGGG CGAAACGCCC
GAGCCGGTGC TGGAGCGGGC CGAGCGCGTG ACCCAGTCCA CCTTCTCGAT CAATCTCACG
CACATGACGC TGAAGGAGCG GCTGCGCCTC AAGGTCGGTA ACGACTGCAA CGCGATGATG
ACCGAGCTGA TGGACTTCTA CTCGATCCGG GAGATGCTGC TCGAATACGA CAAGCTTCGT
CGCGGCGAGG TCAGCGAGCG GTTGATCGCG GGCGGCGACA ACACGATTTT CGATCCCTCG
CGCGCGCCGG AAGGCGCCGG GGTGTTCTAC GGCGTGAACT TCGCGCCCTA TGACCTCCAT
CCCGGCGGAT CCGGATCGTG GGACGAACGC AAGGAGGAAG TCGCCGACCG CGCCTTGGCG
CAGTACCGCA AGTTCTACGC GAACCTTGCC GACGAGAACA TCACCGGCCG CCTGATCCGC
TCGCCAGTCG ACCATGAGCG CGACAGCCCG GCAAGCTTCG TGCGCGGCGA CATCCATGGT
TGCGCGCCAT TCTTCTACCA GTCTGCGGGA CACCGCCCGA CGCCGGATCT CGGCTCGTTC
CGGGTGCCGG GGGTGGAGGG GCTCTATCTG GTGGGGCCGT TCATGCACCC GGGCGGCGGC
GTGTTCGGCG CAGGACGGGC GACCGCCATC CAGATGATGG ACGACATGGG GATCGACTAC
GACAAGGTCT GCGGGAGGGC CGTGCAGTGA
 
Protein sequence
MSQYDIVVMG AGHNGLTAAA YMAKAGKKVL VLERKPHFGG GVSTRELLHP GFWHDEHSNV 
HIMIQGNPML REDELGLLSR FGLEYIYPDL VHVSIWEDGT VIRSYKDLDR TCEELARVAG
PKDAEAYRRF VKMSQTALPM LVSGLYSPPF PLGAFVAMMD QSDEGRFLLD LMQRSALDIV
DAYFESDLLK LHIVRMVTEN LQMPDELGTG MGAFVMPGII HTYGCSMPKG GSGQLSRALV
RAIEHFGGEV RCNAEVARVI VSGGKAVGLE LTDGETFMAR DGVIGAIHPH VLRKFVGETP
EPVLERAERV TQSTFSINLT HMTLKERLRL KVGNDCNAMM TELMDFYSIR EMLLEYDKLR
RGEVSERLIA GGDNTIFDPS RAPEGAGVFY GVNFAPYDLH PGGSGSWDER KEEVADRALA
QYRKFYANLA DENITGRLIR SPVDHERDSP ASFVRGDIHG CAPFFYQSAG HRPTPDLGSF
RVPGVEGLYL VGPFMHPGGG VFGAGRATAI QMMDDMGIDY DKVCGRAVQ