Gene Saro_1648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1648 
Symbol 
ID3918757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1723704 
End bp1724894 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content64% 
IMG OID640444389 
Productcytochrome P450 
Protein accessionYP_496922 
Protein GI87199665 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTACCG TGATCGAGCG GCCGCAATTC CGCTTCGACC CATATTCCCC GGCAATCGAC 
GCCGACCCGT TCCCCGCCTA CAAGGTGCTG CGCGACGAAT ACCCCTGCTT CTGGTCCGAG
GAGGCCGGAA AGTGGGTGCT CTCGCGCTAT GACGACGTGC TTGCAGCGCT GCAGGACTGG
CGGACCTATT CTTCCGCCAA GGGCAACCTC GTGGACGAGT TTCCCGGTCG CGCCGGCTCG
ACGCTGGGAT CGAGTGATCC GCCGCGCCAT GACCGCCTGC GCGCCCTCAT CCAGTCGGCC
GTGACCAAGC GTGCGCTTGA ACACATTATC GCACCAGCCC GGGCATCGGC CCAGGCGCAT
CTGGCCGCGC TGGCGGACAA GCCGGTGTTC GACCTGGTGG GCGACTACAC GTCGAAGCTG
ACGGTCGACC TCCTCTTCTA CCTTTTCGCC CTGCCGGACG AAGGCGCGCA GCAGGTGCGC
GAGAACGCGG TGCTGATGGT CCAGACCGAT CCGGTCACGC GCCAGAAGAG CCCCGAACAT
CTCGCGGCGT TCCATTGGAT GGCGGACTAC GCCGAAAAGC TGGTCGCCTC GCGCAAGGCG
AACCCCGGCG ACGACCTCCT GTCCAGCTTC ATCACCGCCG AGATCGACGG GGAGAAGTTG
CTCGACAAGG AAGTCCAGCT TACCGTCACC ACGCTGATCA TGGCGGGCAT CGAAAGCCTT
TCGGGCTTCA TGGCAATGTT CGGCCTGAAC CTTGCCGACT ATCCCGAAGC GCGCAGCGCG
CTGGTTGCCG ACCCTTCGCT GATCCCCGAT GCGATCGAGG AATCGTTGCG GTTCAACACT
TCCGCCCAGC GATTCAAACG GACGTTGACG CGGGACGTGG AGCTTCACGG ACAGGTGATG
AAGGCTGGCG ACGCGGTGAT CCTCGCCTAT GGATCAGCCA ATCGCGACGA GCGGATGTTC
GAGAATCCGG ACGTCTACGA CATCACCCGC AAGCCGCGGC GCCACCTCGG CTTCGGCGGC
GGTGTCCACG CCTGCCTTGG CTCGATGATC GGGCGCCTGG CGACGCAGAT CGCCTACGAG
GAACTCCTGA AGGCGGTGCC CGATTTCCGG CGTGCCGACG CCCCGCTCGA CTGGGTGCCT
TCATCCAACT TCCGCAGTCC GAAGTCGCTC ATGCTCGAAA AGAAGGCCTG A
 
Protein sequence
MATVIERPQF RFDPYSPAID ADPFPAYKVL RDEYPCFWSE EAGKWVLSRY DDVLAALQDW 
RTYSSAKGNL VDEFPGRAGS TLGSSDPPRH DRLRALIQSA VTKRALEHII APARASAQAH
LAALADKPVF DLVGDYTSKL TVDLLFYLFA LPDEGAQQVR ENAVLMVQTD PVTRQKSPEH
LAAFHWMADY AEKLVASRKA NPGDDLLSSF ITAEIDGEKL LDKEVQLTVT TLIMAGIESL
SGFMAMFGLN LADYPEARSA LVADPSLIPD AIEESLRFNT SAQRFKRTLT RDVELHGQVM
KAGDAVILAY GSANRDERMF ENPDVYDITR KPRRHLGFGG GVHACLGSMI GRLATQIAYE
ELLKAVPDFR RADAPLDWVP SSNFRSPKSL MLEKKA