Gene Saro_0514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0514 
Symbol 
ID3918644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp555434 
End bp556699 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content62% 
IMG OID640443244 
Productcytochrome P450 
Protein accessionYP_495795 
Protein GI87198538 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCCC AAACTTCTAC GGCGACCCAG AAGCATCGCG TTGCTCCGCC GCCACACGTG 
CCGGGCCATC TGATCCGGGA GATCGACGCA TACGACCTGG ACGGCCTGGA GCAGGGTTTC
CACGAAGCAT GGAAGCGGGT GCAGCAACCC GATACGCCGC CGCTCGTCTG GACGCCGTTC
ACTGGCGGGC ACTGGATCGC AACCCGCGGT ACCTTGATCG ACGAGATCTA TCGCAGCCCC
GAACGCTTCT CCAGCCGCGT GATCTGGGTC CCGCGCGAAG CGGGCGAGGC GTACGACATG
GTGCCGACCA AGCTCGATCC GCCCGAGCAT ACACCCTATC GCAAGGCGAT CGACAAGGGC
CTGAACCTTG CGGAAATCCG CAAGCTCGAG GACCAGATCC GGACCATCGC GGTCGAGATC
ATCGAAGGCT TCGCCGATCG CGGCCATTGT GAGTTCGGCA GCGAGTTCTC GACGGTGTTT
CCAGTCAGGG TGTTTCTCGC GCTGGCCGGG CTGCCGGTTG AAGATGCCAC GAAGCTTGGC
CTTCTGGCGA ACGAGATGAC GCGGCCCTCG GGCAACACGC CGGAAGAGCA GGGGCGGTCG
CTGGAAGCGG CAAACAAGGG ATTTTTCGAG TACGTCGCGC CGATCATCGC TGCGCGCAGG
GGAGGCAGTG GTACTGACCT CATCACGCGC ATTCTCAACG TCGAAATCGA CGGCAAGCCG
ATGCCCGACG ACCGTGCGCT AGGCCTGGTT TCGCTCCTGC TGCTCGGAGG GCTCGACACT
GTCGTCAACT TCCTCGGCTT CATGATGATC TACCTTTCCC GGCACCCCGA AACGGTTGCC
GAAATGCGGC GCGAACCATT GAAGCTGCAA CGCGGCGTTG AAGAGCTGTT CCGTCGCTTC
GCGGTCGTTT CGGATGCACG ATATGTCGTT TCGGACATGG AGTTCCATGG CACCATGCTT
AAGGAGGGCG ACCTCATCCT CCTGCCAACG GCTCTGCACG GGCTTGACGA CAGGCATCAT
GACGATCCCA TGACCGTCGA CCTGTCGCGG CGCGATGTCA CTCACTCGAC TTTCGCCCAG
GGGCCGCACC GCTGCGCGGG CATGCACCTC GCGCGCCTCG AGGTGACGGT CATGCTGCAG
GAATGGCTGG CCCGCATTCC GGAATTCAGG CTGAAGGACA GGGCAGTGCC AATCTACCAT
TCAGGCATCG TCGCGGCGGT CGAGAACATT CCACTGGAAT GGGAGCCTCA GAGGGTTTCG
GCATGA
 
Protein sequence
MNAQTSTATQ KHRVAPPPHV PGHLIREIDA YDLDGLEQGF HEAWKRVQQP DTPPLVWTPF 
TGGHWIATRG TLIDEIYRSP ERFSSRVIWV PREAGEAYDM VPTKLDPPEH TPYRKAIDKG
LNLAEIRKLE DQIRTIAVEI IEGFADRGHC EFGSEFSTVF PVRVFLALAG LPVEDATKLG
LLANEMTRPS GNTPEEQGRS LEAANKGFFE YVAPIIAARR GGSGTDLITR ILNVEIDGKP
MPDDRALGLV SLLLLGGLDT VVNFLGFMMI YLSRHPETVA EMRREPLKLQ RGVEELFRRF
AVVSDARYVV SDMEFHGTML KEGDLILLPT ALHGLDDRHH DDPMTVDLSR RDVTHSTFAQ
GPHRCAGMHL ARLEVTVMLQ EWLARIPEFR LKDRAVPIYH SGIVAAVENI PLEWEPQRVS
A