Gene Saro_3832 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3832 
Symbol 
ID5077980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp486063 
End bp487253 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content65% 
IMG OID640481555 
Productcytochrome P450 
Protein accessionYP_001166217 
Protein GI146276057 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0796027 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACCGCG CCATGACCAC GACCGTGCAA GATTTCGACC CCGAGGTTCC CGAAGACTTC 
GACAGCCCGC ATGCCGAATA TGCGCGCCTG CGCCGCGAGT GTCCCGTTGC GCATACCAAT
GGCCTGGGCG GGTTCTGGGC GCTGACGCGC TATGAGGACG TCAAGCGCGC GGCTTCCGAT
TCGACCACGT TCATCACTTC GGTGCAGAAC GTGGTGCCCA AGGTGGCATT TACCGGACGC
CGCCCTCCGC TACATCTCGA TCCGCCCGAG CACACGCCCT ATCGCAAGGC GCTTAACCCG
CTGCTCTCGC TCGAGCGTTC CGAAGCGTTT GCCGGAAAGG CGCGCGAGCT GACGCGCAAG
CTTCTGGCAC CGATGGTGGA GAACGGCGGC GGCGACATCT GCGTCGAGCT TTCGAGCTAT
CTGCCCGTCC ACGTCTTCGG CGAATGGATG CGCATGCCGG AGGAATGGCT CGACACGCTG
CACGACGCGG GTCGCGCGTT CATTCTTGCG GTCCATTCGA ACACGCCGGA GCGGATGAAG
GAAACGTCGC TGCGGCTCTA CGACATGGCG CGCGGCCTGA TCGCCGTTCG TCGGGAAAAC
CCGCAGGATC CCGCGCTCGA TCCGACAAGC GCATTGCTTG CGGCCCGCCA CGAGGGCGAA
CCTCTGCCCG AGGAACTGCT GGTGGGCACG GTGCGGCAGG TGCTGGTCGT GGGCATGGTC
GCGCCGATGG TCATGATCGG CAACATCTGC GTCCACCTCT CGCGCGACAA GGCGCTGCAG
CAGCAGCTTC GTGCCGATCC CTCGCTGGTG CCGGCGGCAA TCGAGGAATT CCTGCGGCTC
TACACGCCCT ATCGCGGATT TGCCCGGACG GCGGTGTGCG ACGTGGATAT GGGCGGACGC
ACGATCCCCA AGGACGAGGC GATCGCGCTG GTCTATGCAT CGGCAAACCG CGACGAGGAC
GTGTTCCCGG ACGGCGACAA GTTCATCCTC AACCGCCCCA ACATCGCGCA GCACCTGGCT
TTCGGTCGCG GGCCGCATAA TTGCCCCGGC GTGCATCTGG GACGGATGCA GCTTCGCGTG
GCGCTGGAGG AAATCCTGGC CGCAACGCGC GAGTTCGAGC TTTCCGGGCC GGTAAGCGTG
AGCCGCTGGC CCGAGGTCGG CGCGCTTTCG GTGCCGCTGC GCTTCGTTTG A
 
Protein sequence
MHRAMTTTVQ DFDPEVPEDF DSPHAEYARL RRECPVAHTN GLGGFWALTR YEDVKRAASD 
STTFITSVQN VVPKVAFTGR RPPLHLDPPE HTPYRKALNP LLSLERSEAF AGKARELTRK
LLAPMVENGG GDICVELSSY LPVHVFGEWM RMPEEWLDTL HDAGRAFILA VHSNTPERMK
ETSLRLYDMA RGLIAVRREN PQDPALDPTS ALLAARHEGE PLPEELLVGT VRQVLVVGMV
APMVMIGNIC VHLSRDKALQ QQLRADPSLV PAAIEEFLRL YTPYRGFART AVCDVDMGGR
TIPKDEAIAL VYASANRDED VFPDGDKFIL NRPNIAQHLA FGRGPHNCPG VHLGRMQLRV
ALEEILAATR EFELSGPVSV SRWPEVGALS VPLRFV