Gene Saro_3337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3337 
Symbol 
ID3915984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3559319 
End bp3560713 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content67% 
IMG OID640446122 
Productcytochrome P450 
Protein accessionYP_498606 
Protein GI87201349 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAGCA TCGCGCCTGA CAGCAGGACA GATCTACATA CCGAACGCGC CAACCCGCAC 
TGGGTAAGGC TGGGCGGGGA CCACAAGCTG GACCATGTCC CCGGCGAGGA CGGCTGGCCG
GTGCTCGGCA CCACGCTGAT GCAACTGGCA GATCCGCTGG GGTTCCAGAG ACGCATGGTC
GAGACCCACG GCCCGGTGTT CCGGACGCGC AGCTTCGGAC GGCGCGGAGT GAACCTTATC
GGCGCGGACG CAAACGAACT GGTGCTGTTC GACCGCGACC GGCTGTTCTC CAACGAACAG
GGCTGGGGAC CGGTACTCAA CCTGCTGTTC CCCCGGGGCC TGATGTTGAT GGACTTCGAG
GCGCACCGGG TGGACCGCCG CGCGCTGTCC ATCGCTTTCA AGCCGGAGCC GATGCGCGCC
TATTGCAGCG TGCTCAACAC AGGCATCGCG CAGGCCGTGC AAGGCTGGGG CGGCCAGATG
CGGTTCTACG ACGCGATCAA GGCCCTGACG CTCGACACCG CCGCCTCCAG CTTCCTCGGC
CTTCCGCTCG GGCCCGAGGC CGACCGGCTC AACAAGGCCT TCGTCGACAT GGTCCAGGCG
TCGGGCGGGG TCGTTAGACG CCCTCTGCCC TTCACCAGGA TGGGCAAGGG CGTAGCGGGA
CGGCGCCTGA TGGTCGAATA CTTCGGCCGG CTGGTGCGCG AGCGGCGCGC GGATCCCGGG
CAGGACATGT TCAGCCAGTT CGCGCTCGCC ACGCGCGAGG ACGGCTCGCT CCTGCCCGAG
GACGTGGTGG TCGACCACAT GATCTTTCTG ATGATGGCCG CTCACGACAC CATCACCAGT
TCGGCCACGG TGCTGTTCTG GCAATTGGCC CGGAACCCCG ACTGGCAGGA CCGACTGCGC
GCCGAAGCCC GCGCCGTGAC CGGGGGCGAC GGCCTTCCAC TGGCCTACGA GGACCTCGGC
CGGATGGAAT TGACCGAGAT GGCGTTCAAG GAGGCGCTGC GCTTCATGCC GCCTGTGCCC
AACATGCCGC GCCGCGCGCT GCGTGACTTC GAGTTCGGCG GCTACCGCAT CCCGGCAGGG
ACGCCGGTGG GGATCAGCCC GGCGGCCGTC CACGCCGATC CTGCGCATTG GCCCGAACCG
GACCGATTCG ATCCGCTACG ATTCACCCCG GAAAACGTCT CGGGACGCCA CAAGTATGCC
TGGGTGCCCT TCGGCGGCGG CGCACACATG TGCCTCGGGC TGCATTTTGC CTATATGCAG
GTGAAGTTGC TGGTCAGTCA CATCCTGACC CGCTACGAGG TCGCCATGCA GCCGGGCCCC
GCGCCTTCGT GGCAGGCCTG GCCTATCCCG AAGCCCCGCG ATGGCCTGCG CGTGGAGATG
CGCCGAATCT GTTGA
 
Protein sequence
MASIAPDSRT DLHTERANPH WVRLGGDHKL DHVPGEDGWP VLGTTLMQLA DPLGFQRRMV 
ETHGPVFRTR SFGRRGVNLI GADANELVLF DRDRLFSNEQ GWGPVLNLLF PRGLMLMDFE
AHRVDRRALS IAFKPEPMRA YCSVLNTGIA QAVQGWGGQM RFYDAIKALT LDTAASSFLG
LPLGPEADRL NKAFVDMVQA SGGVVRRPLP FTRMGKGVAG RRLMVEYFGR LVRERRADPG
QDMFSQFALA TREDGSLLPE DVVVDHMIFL MMAAHDTITS SATVLFWQLA RNPDWQDRLR
AEARAVTGGD GLPLAYEDLG RMELTEMAFK EALRFMPPVP NMPRRALRDF EFGGYRIPAG
TPVGISPAAV HADPAHWPEP DRFDPLRFTP ENVSGRHKYA WVPFGGGAHM CLGLHFAYMQ
VKLLVSHILT RYEVAMQPGP APSWQAWPIP KPRDGLRVEM RRIC