Gene Saro_3533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3533 
Symbol 
ID5077682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp150238 
End bp151434 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content67% 
IMG OID640481257 
Productcytochrome P450 
Protein accessionYP_001165919 
Protein GI146275759 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGCGC CCGCTCACGT ACCCGCCGAC CGCGTCGTCG ATATCGACAT CTACATGCCG 
CCCGGCCTTG CAGAACACGG GTTCCACAAG GCGTGGAGCG ATCTGTCGGC CGGCAATCCC
GCCGTCGTCT GGACGCCCCG CAACGAGGGA CACTGGATCG CGCTGGGTGG CGAGGCCCTG
CAGGAAGTCC AGTCGGACCC CGAACGCTTC TCCTCGCGCA TCATCGTCCT GCCCAAGTCA
GTGGGCGAGA TGCACGGCCT GATCCCCACC ACCATCGACC CGCCCGAGCA CCGCCCGTAC
CGTCAGCTCC TCAACGCGCA TCTCAATCCC GGTGCGATAC GCGGGCTTTC CGAGAGCATC
CGCCAGACCG CGGTGGACCT GATCGAGGGC TTCGCGGCGC AAGGGCACTG CAACTTCACC
GCCCAGTATG CCGAGCAGTT CCCGATCCGG GTGTTCATGG CGCTCGTCGG CATCGAAGCA
TCCGAGGCGC CCAGGATACG CCACTGGGCC GAATGCATGA CCCGCCCCGG CATGGACATG
ACTTTCGACG AGGCCAAGGC GGTCTTCTTC GATTACGTCG GCCCACTGGT CGATGCCCGG
CGCGAGACGC CGGGCGAGGA CATGATCAGC GCGATGATAA ACGCCGACCT CGGAGATGGA
CGCCGCCTCA CCCGTGACGA AGCGCTGTCC GTCGTCACGC AGGTGCTGAT CGCGGGGCTC
GATACCGTGG TCAACGTGCT CGGCTTCATC ATGCGCGAGC TGGCCGGGAA CCCCGCCCTG
CGGGCCGATC TCCGGCAGCG CGGCGCGGAC ATCCTGCCCG TCGTCCATGA ACTGTTCCGC
CGCTTCGGCC TTGTCTCCAT CGCCCGCGAG GTGCGCCGCG ACATCGAGTT CCACGGCGTT
CACCTGAAGG CCGGCGACAT GATCGCCATC CCGACCCAGG TTCATGGTCT CGACCCGCGC
GTGAACCCCG ATCCTCTCGC CATCGATCCG TCGCGCAAGC GCGCGCGCCA TTCCACTTTC
GGCTCCGGCC CGCACATGTG CCCGGGCCAG GAACTCGCGC GCAAGGAGGT GGCGATCACG
CTCGAGGAAT GGCTGCGCCG CATCCCCGAT TTCGCGCTCG GGCCGAACTC GGACCTCTCG
CCCGTGCCCG GAATCGTCGG CGCCCTGCGC CGCGTGGAAC TGGTCTGGAA TACCTAG
 
Protein sequence
MEAPAHVPAD RVVDIDIYMP PGLAEHGFHK AWSDLSAGNP AVVWTPRNEG HWIALGGEAL 
QEVQSDPERF SSRIIVLPKS VGEMHGLIPT TIDPPEHRPY RQLLNAHLNP GAIRGLSESI
RQTAVDLIEG FAAQGHCNFT AQYAEQFPIR VFMALVGIEA SEAPRIRHWA ECMTRPGMDM
TFDEAKAVFF DYVGPLVDAR RETPGEDMIS AMINADLGDG RRLTRDEALS VVTQVLIAGL
DTVVNVLGFI MRELAGNPAL RADLRQRGAD ILPVVHELFR RFGLVSIARE VRRDIEFHGV
HLKAGDMIAI PTQVHGLDPR VNPDPLAIDP SRKRARHSTF GSGPHMCPGQ ELARKEVAIT
LEEWLRRIPD FALGPNSDLS PVPGIVGALR RVELVWNT