Gene Saro_2608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2608 
Symbol 
ID3917023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2819584 
End bp2820849 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content64% 
IMG OID640445367 
Productcytochrome P450 family protein 
Protein accessionYP_497878 
Protein GI87200621 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATTTC CCTTCTCGCG CTCGACCAAC CCCAACGTCG ACCTGTCATC GCTCGACGCG 
TTCAACGAGG GCGCGCCCTT CGCGACTTTC GACCGCATGC GCCGGGAGGA CCCGATGGCG
TGGTCCGAAA TGGTCAACGG CGATCGCGGC TTCTGGTCGG TGACGCGCCA TGCCGATCTC
CTCGAACTGA ACCGACAGGC GGACCTCCTG TCCTCGGCCA AGGGCATCCG CATGGAGGAC
CAGACCGAGG AGGAATACGA GGCGCGCAAG ACCTTCCAGG AAACCGACGC GCCGCACCAC
CGTGGTTTCC GTGCGCTGGT TTCCAAGGCC TTCTCGAAAG GCACCGTCGC CGGCTTCGAG
GACCAGATCC GCAAGATCGT GACCGACCTG CTCGACGTGG CGCTGGCCGA GGGCGAGTTC
GACGCGGTCG ACCGCATCGC GCGACGCCTG CCGATGCAGA TGCTCGCGCA GATCATGGGC
GTACCGCAGG AAGACGGGCC GTGGCTGGTG GAAAAGGGCG ACGCGCTGAT TTCCAACTCC
GACCCCGACT ATACCGATTT CGTGGTCGAT CAGGTGGATA CGGAAGCCTA CCGGATGCTG
CCGTTCCGCT CGCCTGCGGC GGTCGAGTTG TTCGACTATG CCAACGGCCT GCTCGACCGG
ATGGACGCGG GCGAACAGAT CGGGGTGCTG AACCTTGTCC GCGAGCCGAC CAGCACCGGC
ACGCGAATGA GCCGCGACGA GTTCCGCAAC TTCTTCTGCC TTCTGGTCGC AGCCGGAAAC
GACACGACGC GCTACTCGAT CTCTGCGACG ATCCACGCGC TCGCCAACAA CCCCCACCTG
TTGCAGGCGC TGAAGGACGG CGACTTCACG AGCTGGGAAG CAGCCGCGGA CGAGATGATC
CGCTATGCCT CGCCCACGAC GCACTTCCGT CGCACCGCCA CCCGCGACTT CACCTTCCAC
GACCGGCACG TGAAGGCTGG CGACAAGGTG CTGCTGTGGT TCATTTCGGG CAACCGCGAC
GAGACCGCCA TTCTCGATCC CTACACGATC AACCTTCGCC GGGAACGCAA CCCGTTCCTC
TCGTTCGGCC AGGGCGGCCC GCACATCTGC CTTGGCATGT GGCTGGCCAA GCTCGAGGTC
GCGATCGTCA TGCAGGAACT CGCCAAGCGC CTCTCCAGCA TCGAGCAGGT CGCGGAGCAC
AGCTACCTGC GGTCAAACTT CATTCACGGC ATCAAGCACC TGCCGGTTCG CATTGTCGCC
CGCTGA
 
Protein sequence
MQFPFSRSTN PNVDLSSLDA FNEGAPFATF DRMRREDPMA WSEMVNGDRG FWSVTRHADL 
LELNRQADLL SSAKGIRMED QTEEEYEARK TFQETDAPHH RGFRALVSKA FSKGTVAGFE
DQIRKIVTDL LDVALAEGEF DAVDRIARRL PMQMLAQIMG VPQEDGPWLV EKGDALISNS
DPDYTDFVVD QVDTEAYRML PFRSPAAVEL FDYANGLLDR MDAGEQIGVL NLVREPTSTG
TRMSRDEFRN FFCLLVAAGN DTTRYSISAT IHALANNPHL LQALKDGDFT SWEAAADEMI
RYASPTTHFR RTATRDFTFH DRHVKAGDKV LLWFISGNRD ETAILDPYTI NLRRERNPFL
SFGQGGPHIC LGMWLAKLEV AIVMQELAKR LSSIEQVAEH SYLRSNFIHG IKHLPVRIVA
R