Gene Saro_2249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2249 
Symbol 
ID3916565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2389017 
End bp2390207 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content67% 
IMG OID640445003 
Productcytochrome P450 
Protein accessionYP_497520 
Protein GI87200263 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.274245 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATACCGG CACACGTTCC GGCCGACCGG GTGGTCGATT TCGACATCTT CAATCCGCCG 
GGCGTGGAAC AGGACTACTT CGCAGCCTGG AAGACCCTGC TCGATGGGCC GGGGCTGGTC
TGGAGCACGG CCAACGGCGG GCACTGGATC GCCGCGCGTG GCGATGTGGT GCGCGAACTG
TGGGGAGATG CGGAGCGGCT CTCCAGCCAG TGCCTTGCCG TTACGCCCGG CCTTGGCAAG
GTCATGCAGT TCATCCCTCT CCAGCAGGAC GGGGCGGAGC ACAAGGCCTT CCGCACGCCG
GTGATGAAGG GGCTCGCCTC GCGCTTCGTG GTGGCGCTCG AGCCGAAGGT CCAGGCGGTT
GCGCGCAAAC TCATGGAAAG CCTGCGCCCG CGCGGATCGT GCGATTTCGT CAGCGATTTT
GCCGAGATCC TGCCCCTCAA CATCTTTCTG ACGCTGATCG ACGTGCCGCT GGAAGACCGT
CCGCGCCTGC GCCAGCTGGG CGTGCAGCTT ACCCGCCCCG ATGGCTCGAT GACGGTGGAG
CAATTGAAGC AGGCCGCCGA CGACTACCTC TGGCCCTTCA TCGAGAAGCG GATGGCCCAG
CCGGGCGACG ACCTGTTCAG CCGCATTCTC TCGGAACCGG TGGGCGGACG TCCGTGGACG
GTCGACGAGG CGCGGCGGAT GTGCCGCAAC CTGCTGTTCG GCGGGCTTGA TACCGTGGCC
GCAATGATCG GCATGGTCGC GCTGCATCTT GCACGCCATC CCGAGGACCA GCGGCTTCTG
CGGGAAAGGC CAGACCTGAT CCCGGCGGCG GCCGACGAAC TGATGCGCCG CTACCCGACC
GTTGCCGTCA GCCGCAACGC GGTGGCCGAT GTGGACGCCG ATGGCGTTAC CATCCGCAAG
GGTGACCTCG TCTACCTGCC CAGCGTGCTG CACAACCTTG ATCCGGCGAG TTTCGAGGCG
CCCGAGGAAG TGCGCTTCGA CCGGGGTCTC GCGCCGATCC GCCACACCAC GATGGGGGTG
GGTGCGCATC GTTGCGTCGG GGCGGGACTG GCGCGGATGG AGGTGATCGT GTTCCTGCGC
GAATGGCTTG GCGGAATGCC CGAATTCGCG CTGGCCCCGG ACAAGGCGGT GACGATGAAG
GGGGGCAACG TCGGCGCTTG CACGGCGCTG CCTCTGGTCT GGCGGGCCTA G
 
Protein sequence
MIPAHVPADR VVDFDIFNPP GVEQDYFAAW KTLLDGPGLV WSTANGGHWI AARGDVVREL 
WGDAERLSSQ CLAVTPGLGK VMQFIPLQQD GAEHKAFRTP VMKGLASRFV VALEPKVQAV
ARKLMESLRP RGSCDFVSDF AEILPLNIFL TLIDVPLEDR PRLRQLGVQL TRPDGSMTVE
QLKQAADDYL WPFIEKRMAQ PGDDLFSRIL SEPVGGRPWT VDEARRMCRN LLFGGLDTVA
AMIGMVALHL ARHPEDQRLL RERPDLIPAA ADELMRRYPT VAVSRNAVAD VDADGVTIRK
GDLVYLPSVL HNLDPASFEA PEEVRFDRGL APIRHTTMGV GAHRCVGAGL ARMEVIVFLR
EWLGGMPEFA LAPDKAVTMK GGNVGACTAL PLVWRA